Popular Tags
No tags found in this context

flux/evaluation

favicongithub.com
url image

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23 ...

langchainllmopen-source-coding-agentopen-sourceplayground
1 day
favicongithub.com
url image

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place. - Agenta-AI/agenta

llmopen-source-coding-agentevaluationagentsobservability
1 day
favicongithub.com
url image

LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm. - Tencent/WeKnora

llmragagentgolangmulti-tenant
1 day
favicongithub.com
url image

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line ...

gptclauderagtestingci
1 day