flux/deepseek

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le...

Find, benchmark and install in CLI 200+ FREE coding LLM models across 20+ providers in real time - vava-nessa/free-coding-models

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs. - FastFlowLM/FastFlowLM
SGLang is a high-performance serving framework for large language models and multimodal models. - sgl-project/sglang
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm

A unified AI model hub for aggregation & distribution. It supports cross-converting various LLMs into OpenAI-compatible, Claude-compatible, or Gemini-compatible formats. A centralized gateway f...

Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce. - simstudioai/sim

The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configuration. - Mintplex-Labs/anything-llm

Context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools. - DearVa/Everywhere

⚡️ Open-source AI Gateway — Use any SDK to call 100+ LLMs. Built-in failover, load balancing, cost control & end-to-end tracing. - looplj/axonhub