ervin's web review
ervin's web review
Tag cloud
Picture wall
Daily
RSS Feed
Login
Remember me
4172
shaares
4172
shaares
Filters
Links per page
20
50
100
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Interesting technique to speed up the generation of large language models.
tech
·
ai
·
machine-learning
·
gpt
·
optimization
December 20, 2023 at 11:11:12 AM GMT+1 * ·
permalink
·
·
https://sites.google.com/view/medusa-llm
·
Filters
Links per page
20
50
100
Fold
Fold all
Expand
Expand all
Are you sure you want to delete this link?
Are you sure you want to delete this tag?
The personal, minimalist, super fast, database-free, bookmarking service by the Shaarli community