wangrui's picture

wangrui

varuy322

·

varuy322

AI & ML interests

None yet

Recent Activity

upvoted a collection about 14 hours ago

Nemotron-Cascade

liked a dataset 5 days ago

Open-Bee/Honey-Data-15M

upvoted a collection 13 days ago

Nemotron-Pre-Training-Datasets

View all activity

Organizations

None yet

upvoted a collection about 14 hours ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 3 days ago • 41

liked a dataset 5 days ago

Open-Bee/Honey-Data-15M

Viewer • Updated Nov 5, 2025 • 14.8M • 29.9k • 104

upvoted a collection 13 days ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 12 days ago • 87

upvoted a collection 18 days ago

Multimodal Implementations

Comprehensive Demo of Multimodal VLMs on the Hub • 23 items • Updated 4 days ago • 11

liked a dataset 18 days ago

google/deepsearchqa

Viewer • Updated 18 days ago • 900 • 1.79k • 94

liked 4 datasets 19 days ago

allenai/Molmo2-Cap

Viewer • Updated 19 days ago • 108k • 683 • 7

allenai/dolma3_mix-5.5T-1125

Viewer • Updated 18 days ago • 218k • 2.95k • 8

Anthropic/alignment-faking-rl

Viewer • Updated 19 days ago • 2.14M • 358 • 5

openai/frontierscience

Viewer • Updated 19 days ago • 160 • 7.22k • 144

liked a model 19 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated 12 days ago • 261k • 520

liked a model 20 days ago

nvidia/Eagle2-2B

Image-Text-to-Text • 2B • Updated Apr 27, 2025 • 687 • 32

upvoted a paper 24 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 111

upvoted 2 collections 24 days ago

Multimodal Dataset

87 items • Updated 3 days ago • 7

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8, 2025 • 82

liked 2 models 26 days ago

Alibaba-NLP/gme-Qwen2-VL-7B-Instruct

Sentence Similarity • 8B • Updated Jun 9, 2025 • 3.77k • 70

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 71.6k • • 930

liked 2 datasets 27 days ago

MathLLMs/MathVision

Viewer • Updated Nov 27, 2025 • 3.34k • 8.09k • 114

MMMU/MMMU

Viewer • Updated Sep 19, 2024 • 11.6k • 54.1k • 307

upvoted a paper about 1 month ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 85

upvoted a collection about 1 month ago

Olmo 3 Post-training

All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 12 days ago • 46