New/llama

The newest discoveries in llama

github.com

K8sgpt: AI-Powered Kubernetes Diagnosis Tool

K8sgpt is a tool for diagnosing Kubernetes issues using AI, enhancing cluster management and troubleshooting.

aidevopsgithub-actionsgokubernetes

stack

github.com

TensorZero: Open-Source LLMOps Platform

TensorZero is an open-source LLMOps platform unifying API access, observability, evaluation, optimization, and experimentation for large language models.

aiai-engineeringanthropicartificial-intelligencedeep-learning

flux

github.com

Free LLM Inference API Resources List

A curated list of services offering free access to LLM APIs for developers and researchers.

aiclaudegeminillamallm

flux

github.com

Lemonade: Local AI Server for Optimized LLMs

Lemonade is a local AI server that allows users to run optimized LLMs on their own hardware, ensuring privacy and cost-effectiveness.

aiamdcppgenaigpu

flux

github.com

FastFlowLM: Run LLMs on AMD Ryzen AI NPUs

FastFlowLM enables efficient execution of large language models on AMD Ryzen AI NPUs, optimizing performance without GPU dependency.

amdcppdeepseekllamallm

flux

github.com

Python-free Rust Inference Server for OpenAI API

Shimmy is a Rust-based inference server providing local, OpenAI-compatible endpoints for machine learning models.

api-servercommand-line-tooldeveloper-toolsggufgpt

flux

github.com

High-Performance Framework for Language Models

SGLang is an open-source framework for efficient serving of large language and multimodal models, ensuring low-latency and high-throughput performance.

attentionblackwellcudadeepseekdiffusion