Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Fugaku-LLM
community
Activity Feed
Follow
79
AI & ML interests
None defined yet.
Recent Activity
Taishi-N324
authored
a paper
about 1 month ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Taishi-N324
authored
a paper
3 months ago
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Taishi-N324
authored
a paper
5 months ago
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code
View all activity
Team members
10
Fugaku-LLM
's models
3
Sort: Recently updated
Fugaku-LLM/Fugaku-LLM-13B
Text Generation
•
13B
•
Updated
Jan 10
•
129
Fugaku-LLM/Fugaku-LLM-13B-instruct-gguf
13B
•
Updated
May 9, 2024
•
54
•
41
Fugaku-LLM/Fugaku-LLM-13B-instruct
Text Generation
•
13B
•
Updated
May 9, 2024
•
3
•
28