Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Fugaku-LLM
community
Activity Feed
Follow
79
AI & ML interests
None defined yet.
Recent Activity
Taishi-N324
authored
a paper
20 days ago
MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources
Taishi-N324
authored
a paper
about 2 months ago
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
Taishi-N324
authored
a paper
4 months ago
Rewriting Pre-Training Data Boosts LLM Performance in Math and Code
View all activity
Team members
10
models
3
Sort: Recently updated
Fugaku-LLM/Fugaku-LLM-13B
Text Generation
•
13B
•
Updated
Jan 10
•
2
•
129
Fugaku-LLM/Fugaku-LLM-13B-instruct-gguf
13B
•
Updated
May 9, 2024
•
33
•
41
Fugaku-LLM/Fugaku-LLM-13B-instruct
Text Generation
•
13B
•
Updated
May 9, 2024
•
34
•
28
datasets
0
None public yet