Pretrained LLMs from scratch.
Y. Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
updated
a model
3 days ago
PursuitOfDataScience/Llama-3.2-1B-GRPO
published
a model
3 days ago
PursuitOfDataScience/Llama-3.2-1B-GRPO
updated
a collection
17 days ago
ArgonneAI
Organizations
None yet
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 4 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 1 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 1 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 1
ArgonneAI
Pretrained LLMs from scratch.
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 4 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 1 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 1 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 1
models
23
PursuitOfDataScience/Llama-3.2-1B-GRPO
Text Generation
•
1B
•
Updated
•
19
PursuitOfDataScience/Argonne-2.0
Text Generation
•
6B
•
Updated
•
35
PursuitOfDataScience/llama3.2-1b-thinking
Text Generation
•
1B
•
Updated
•
1
PursuitOfDataScience/llama-3-2-1b-open-r1-mot-sft
Text Generation
•
1B
•
Updated
•
3
PursuitOfDataScience/qwen2.5-0.5b-r1-dpo
Text Generation
•
0.5B
•
Updated
PursuitOfDataScience/qwen2.5-0.5b-dpo
Text Generation
•
0.5B
•
Updated
•
4
PursuitOfDataScience/qwen2.5-0.5b-open-r1-mot-cot-sft
Text Generation
•
0.5B
•
Updated
•
2
PursuitOfDataScience/llama3.2-1b-dpo
Text Generation
•
1B
•
Updated
•
1
PursuitOfDataScience/qwen2.5-0.5b-ultrachat-sft-multi-turn
0.5B
•
Updated
•
1
PursuitOfDataScience/finetuned-llama-3.2-3b-math-reasoning
3B
•
Updated
•
1
datasets
44
PursuitOfDataScience/toucan-agentic-thinking
Viewer
•
Updated
•
119k
•
47
PursuitOfDataScience/arxiv-qa-thinking
Viewer
•
Updated
•
215k
•
23
PursuitOfDataScience/0.9M-thinking
Viewer
•
Updated
•
898k
•
160
PursuitOfDataScience/0.5M-thinking
Viewer
•
Updated
•
499k
•
104
PursuitOfDataScience/MiniMax-M2.1-Mixture-of-Thoughts
Viewer
•
Updated
•
349k
•
483
•
2
PursuitOfDataScience/gsm8k-thinking
Viewer
•
Updated
•
8.79k
•
12
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer
•
Updated
•
174k
•
11
PursuitOfDataScience/govreport-llama4-maverick-summary
Viewer
•
Updated
•
19.5k
•
17
•
1
PursuitOfDataScience/arxiv-llama4-maverick-abstract
Viewer
•
Updated
•
198k
•
17
PursuitOfDataScience/xsum-llama4-maverick-summary
Viewer
•
Updated
•
227k
•
13