Popular Tags
No tags found in this context
flux/evaluation

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23 ...
langchainllmopen-source-coding-agentopen-sourceplayground
1 day

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place. - Agenta-AI/agenta
llmopen-source-coding-agentevaluationagentsobservability
1 day

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm. - Tencent/WeKnora
llmragagentgolangmulti-tenant
1 day

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line ...
gptclauderagtestingci
1 day