Fugaku-LLM

community

AI & ML interests

None defined yet.

Recent Activity

Taishi-N324 authored a paper about 1 month ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Taishi-N324 authored a paper 3 months ago

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Taishi-N324 authored a paper 5 months ago

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

View all activity

Fugaku-LLM 's models 3

Fugaku-LLM/Fugaku-LLM-13B

Text Generation • 13B • Updated Jan 10 • 129

Fugaku-LLM/Fugaku-LLM-13B-instruct-gguf

13B • Updated May 9, 2024 • 54 • 41

Fugaku-LLM/Fugaku-LLM-13B-instruct

Text Generation • 13B • Updated May 9, 2024 • 3 • 28