Popular Tags
No tags found in this context
The most inspiring discoveries in gguf

whichllm helps you find the best local LLM for your hardware, optimizing AI inference with real-time benchmarks.

AutoRound is a quantization toolkit for LLMs and VLMs, optimizing performance with high accuracy at low bit widths.
Shimmy is a Rust-based inference server providing local, OpenAI-compatible endpoints for machine learning models.

llmfit optimizes large language models for your hardware, ensuring efficient performance and compatibility.