Code/llama

Community curated code

github.com

Ollama: AI Code Creation with Llama 3.3 and More

Ollama enables developers to utilize large language models for efficient AI code creation and improved application security.

llmollamaaiai-code-creationai-powered-developer-platform

SEselcuk

github.com

FastFlowLM: Run LLMs on AMD Ryzen AI NPUs

FastFlowLM enables efficient execution of large language models on AMD Ryzen AI NPUs, optimizing performance without GPU dependency.

amdcppdeepseekllamallm

flux

github.com

Python-free Rust Inference Server for OpenAI API

Shimmy is a Rust-based inference server providing local, OpenAI-compatible endpoints for machine learning models.

api-servercommand-line-tooldeveloper-toolsggufgpt

flux

github.com

High-Performance Framework for Language Models

SGLang is an open-source framework for efficient serving of large language and multimodal models, ensuring low-latency and high-throughput performance.

attentionblackwellcudadeepseekdiffusion

flux

github.com

High-Throughput LLM Inference Engine - vLLM

vLLM is an efficient engine for LLM inference and serving, designed for high throughput and memory management.

amdblackwellcudadeepseekdeepseek-v3

flux

github.com

Oumi: Open Source LLM/VLM Training Platform

Oumi is an open-source platform for training and deploying LLMs and VLMs, providing tools for evaluation and data synthesis.

dpoevaluationfine-tuninggptgpt-oss

flux

github.com

Unsloth: Unified Web UI for AI Model Training

Unsloth is a web UI for training and running AI models locally, enhancing efficiency and performance.

agentdeepseekdeepseek-r1fine-tuninggemma

flux

github.com

Open Source LocalAI: Self-Hosted AI Engine

LocalAI is an open-source AI engine for running models locally without GPU requirements, ensuring privacy and flexibility.

agentsaiapiaudio-generationclaude

flux

github.com

LangChain4j: Java Library for LLM Integration

LangChain4j simplifies LLM integration in Java applications with a unified API and a comprehensive toolbox for developers.

anthropicchatgptchromaembeddingsgemini

flux