Popular Tags
No tags found in this context
The most inspiring discoveries in evaluation

Promptfoo is a CLI tool for evaluating and securing LLM applications through automated testing and red teaming.

Langfuse is an open source platform for LLM observability and management, enabling teams to develop and debug AI applications efficiently.

Agenta is an open-source platform for building reliable LLM applications with integrated management, evaluation, and observability tools.

WeKnora is an LLM-powered framework for intelligent knowledge management and semantic retrieval, enhancing document understanding and Q&A capabilities.