Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FasterDecoding

community
https://github.com/FasterDecoding
FasterDecoding
Activity Feed Request to join this org

AI & ML interests

Making model inference more efficient by model-system codesign.

Recent Activity

tianlecai  authored a paper 11 days ago
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
tianlecai  authored a paper over 1 year ago
SnapKV: LLM Knows What You are Looking for Before Generation
tianlecai  authored a paper over 1 year ago
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
View all activity

Zhengyang Geng's profile picture Tianle Cai's profile picture James Liu's profile picture
Organization Card
Community About org cards

Think deeper, decode faster

models 8

FasterDecoding/BitDelta_Mistral_combo

Updated Feb 14, 2024

FasterDecoding/medusa-1.0-vicuna-13b-v1.5

Text Generation • Updated Jan 25, 2024 • 29 • 1

FasterDecoding/medusa-1.0-vicuna-33b-v1.3

Text Generation • Updated Dec 18, 2023 • 6

FasterDecoding/medusa-1.0-zephyr-7b-beta

Text Generation • Updated Dec 18, 2023 • 12 • 1

FasterDecoding/medusa-v1.0-vicuna-7b-v1.5

Text Generation • Updated Oct 29, 2023 • 239

FasterDecoding/medusa-vicuna-33b-v1.3

Updated Sep 11, 2023 • 3 • 4

FasterDecoding/medusa-vicuna-13b-v1.3

Updated Sep 11, 2023 • 26 • 5

FasterDecoding/medusa-vicuna-7b-v1.3

Updated Sep 11, 2023 • 414 • 16

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs