Sehoon Kim's picture

3

Sehoon Kim

kssteven

·

https://sehoonkim.org/

AI & ML interests

Efficient AI, AI Systems, Model Compression

Organizations

None yet

upvoted a paper 3 months ago

XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization

Paper • 2508.10395 • Published Aug 14 • 42

upvoted an article 10 months ago

Article

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

Jan 22

•

36

upvoted a paper over 1 year ago

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22, 2024 • 27