Popular Tags
No tags found in this context
The most inspiring discoveries in llm evaluation

Promptfoo is a CLI tool for evaluating and securing LLM applications through automated testing and red teaming.

Langfuse is an open source platform for LLM observability and management, enabling teams to develop and debug AI applications efficiently.

Agenta is an open-source platform for building reliable LLM applications with integrated management, evaluation, and observability tools.