Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shijiecao's picture
3 1

shijiecao

shijiecao
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 months ago
Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
authored a paper 6 months ago
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
authored a paper 6 months ago
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs
View all activity

Organizations

None yet

upvoted a paper 4 months ago

Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

Paper • 2508.07101 • Published Aug 9 • 13
authored 2 papers 6 months ago

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

Paper • 2402.10631 • Published Feb 16, 2024 • 2

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Paper • 2410.13276 • Published Oct 17, 2024 • 29
upvoted 2 papers 6 months ago

Rectified Sparse Attention

Paper • 2506.04108 • Published Jun 4 • 10

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Paper • 2506.08889 • Published Jun 10 • 23
liked a model 9 months ago

SeerAttention/SeerAttention-Llama-3.1-8B-AttnGates

Text Generation • Updated Mar 3 • 3.49k • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs