Popular Tags

No tags found in this context

flux/llm

RELATED TAGS

open-source-coding-agent+86python+56ai-agents+55ai+49typescript+42openai+36claude+29gpt+28rag+24agent+23

github.com

GitHub - e2b-dev/E2B: Open-source, secure environment with real-world tools for enterprise-grade agents.

Open-source, secure environment with real-world tools for enterprise-grade agents. - e2b-dev/E2B

open-source-coding-agentai-agentsreactjavascriptpython

10 hours

github.com

GitHub - can1357/oh-my-pi: ⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more

⌥ AI Coding agent for the terminal — hash-anchored edits, optimized tool harness, LSP, Python, browser, subagents, and more - can1357/oh-my-pi

open-source-coding-agentsprawling-toolingai-agentsrustcli

10 hours

github.com

GitHub - antoinezambelli/forge: A Python framework for self-hosted LLM tool-calling and multi-step agentic workflows

A Python framework for self-hosted LLM tool-calling and multi-step agentic workflows - antoinezambelli/forge

llmai-agentsopen-source-coding-agentpythonself-hosted

22 hours

github.com

GitHub - HanaokaYuzu/Gemini-API: ✨ Reverse-engineered Python API for Google Gemini web app

✨ Reverse-engineered Python API for Google Gemini web app - HanaokaYuzu/Gemini-API

open-source-coding-agentpythonapigoogleai

1 day

github.com

GitHub - multica-ai/andrej-karpathy-skills: A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls. - multica-ai/andrej-karpathy-skills

claudellmfine-tuning

1 day

github.com

GitHub - humanlayer/12-factor-agents: What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers? - humanlayer/12-factor-agents

ai-agentsllmopen-source-coding-agentframeworkai

2 days

github.com

GitHub - yichuan-w/LEANN: [MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device. - yichuan-w/LEANN

ragpythonprivacyaioffline-first

3 days

github.com

GitHub - Light-Heart-Labs/DreamServer: Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions. - Light-Heart-Labs/DreamServer

llmragai-agentsdockertext-to-speech

3 days

github.com

GitHub - NirDiamant/agents-towards-production: End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.

End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment. - NirDiamant/agents-towards-production

ai-agentsopen-source-coding-agentsprawling-toolingpythonagent

3 days

arxiv.org

$δ$-mem: Efficient Online Memory for Large Language Models

Large language models increasingly need to accumulate and reuse historical information in long-term assistants and agent systems. Simply expanding the context window is costly and often fails to ensure effective context utilization. We propose $δ$-mem, a lightweight memory mechanism that augments a frozen full-attention backbone with a compact online state of associative memory. $δ$-mem compresses past information into a fixed-size state matrix updated by delta-rule learning, and uses its readout to generate low-rank corrections to the backbone’s attention computation during generation. With only an $8\times8$ online memory state, $δ$-mem improves the average score to $1.10\times$ that of the frozen backbone and $1.15\times$ that of the strongest non-$δ$-mem memory baseline. It achieves larger gains on memory-heavy benchmarks, reaching $1.31\times$ on MemoryAgentBench and $1.20\times$ on LoCoMo, while largely preserving general capabilities. These results show that effective memory can be realized through a compact online state directly coupled with attention computation, without full fine-tuning, backbone replacement, or explicit context extension.

llmai-agentsrag

4 days