Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

jzwong's picture

jzwong

jzwong

·

AI & ML interests

None yet

Organizations

None yet

Collections 4

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300
Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62
Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26, 2025 • 72

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20, 2025 • 29
Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22, 2025 • 44
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14, 2025 • 125

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300
Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62
Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published Jan 26, 2025 • 72

Redundancy Principles for MLLMs Benchmarks

Paper • 2501.13953 • Published Jan 20, 2025 • 29
Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22, 2025 • 44
Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12, 2025 • 47
Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14, 2025 • 125

View 4 collections

models 0

None public yet

datasets 0

None public yet

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs