Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rubbyninja's picture
37

rubbyninja

rubbyninja
Iliassti's profile picture
·

AI & ML interests

None yet

Recent Activity

updated a collection 20 days ago
advancing research
upvoted a paper 20 days ago
A Fingerprint for Large Language Models
updated a collection 2 months ago
advancing research
View all activity

Organizations

None yet

Collections 1

advancing research
  • STaR: Bootstrapping Reasoning With Reasoning

    Paper • 2203.14465 • Published Mar 28, 2022 • 8
  • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Paper • 2401.06066 • Published Jan 11, 2024 • 55
  • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Paper • 2405.04434 • Published May 7, 2024 • 21
  • Prompt Cache: Modular Attention Reuse for Low-Latency Inference

    Paper • 2311.04934 • Published Nov 7, 2023 • 34
advancing research
  • STaR: Bootstrapping Reasoning With Reasoning

    Paper • 2203.14465 • Published Mar 28, 2022 • 8
  • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

    Paper • 2401.06066 • Published Jan 11, 2024 • 55
  • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Paper • 2405.04434 • Published May 7, 2024 • 21
  • Prompt Cache: Modular Attention Reuse for Low-Latency Inference

    Paper • 2311.04934 • Published Nov 7, 2023 • 34

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs