flux/llama

RELATED TAGS

llm+8open-source-coding-agent+7gpt+5deepseek+4gpt-oss+4python+4qwen+4amd+3inference+3llms+3

github.com

GitHub - lemonade-sdk/lemonade: Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk - lemonade-sdk/lemonade

llmopen-source-coding-agentaiamdgpu

12 hours

github.com

GitHub - FastFlowLM/FastFlowLM: Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.

Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs. - FastFlowLM/FastFlowLM

llmollamaamdllamanpu

4 days

github.com

GitHub - Michael-A-Kuykendall/shimmy: ⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever. - Michael-A-Kuykendall/shimmy

open-source-coding-agentgptllmrustmachine-learning

4 days

github.com

GitHub - sgl-project/sglang: SGLang is a high-performance serving framework for large language models and multimodal models.

SGLang is a high-performance serving framework for large language models and multimodal models. - sgl-project/sglang

llmopen-source-coding-agentreinforcement-learningcudainference

16 days

github.com

GitHub - vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMs

A high-throughput and memory-efficient inference and serving engine for LLMs - vllm-project/vllm

llmopen-source-coding-agentamdcudainference

17 days

github.com

GitHub - oumi-ai/oumi: Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM! - oumi-ai/oumi

fine-tuninggptopen-source-coding-agentevaluationinference

27 days

github.com

GitHub - unslothai/unsloth: Unified web UI for training and running open models like Qwen, DeepSeek, and Gemma locally.

Unified web UI for training and running open models like Qwen, DeepSeek, and Gemma locally. - unslothai/unsloth

open-source-coding-agentagenttext-to-speechreinforcement-learningtts

27 days

github.com

GitHub - mudler/LocalAI: :robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transforme...

open-source-coding-agentgptclaudeapiai

27 days

github.com

GitHub - langchain4j/langchain4j: LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes implementing RAG, tool calling (including support for MCP), and agents easy. LangChain4j integrates seamlessly with various enterprise Java frameworks.

LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impl...

langchainllmragjavaembeddings

27 days