Trending
Top
New
Filters
Tech

Popular Tags

No tags found in this context

Popular Tags

typescript
react
open-source-coding-agent
llm
ui-components
ai-agents
shadcn-ui
tailwind
open-source
python

Top Sources

github.com
clerk.com
1771technologies.com
21st.dev
abui.io
activepieces.com
ai-sdk.dev
alash3al.github.io
alchemy.run
altsendme.com

Browse by Type

Tools
Code

bookmrks.io - Discovery, refined.

Top/efficient-inference

The most inspiring discoveries in efficient inference

github.com

url image

Orthrus: Memory-Efficient Parallel Token Generation

Orthrus is a framework for efficient parallel token generation in LLMs, ensuring lossless output and significant speed improvements.

diffusion-language-modelsefficient-inferencelarge-language-modelsllmllm-efficiency

flux

RELATED TAGS

diffusion-language-models+1large-language-models+1llm+1llm-efficiency+1model-architecture+1natural-language-processing+1open-source-coding-agent+1python+1

Top