2 2 20

Zhicheng Wang

Dicer

https://blog.dicer.fun

Dicer-Zz

AI & ML interests

NLP

Recent Activity

liked a model 5 months ago

Qwen/Qwen3-Embedding-0.6B

liked a model 7 months ago

thenlper/gte-large-zh

updated a model 11 months ago

Dicer/ppo-Huggy

View all activity

Organizations

liked a model 5 months ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20, 2025 • 1.98M • • 829

liked a model 7 months ago

thenlper/gte-large-zh

updated a model 11 months ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated Feb 25, 2025 • 1

published a model 11 months ago

Dicer/ppo-Huggy

Reinforcement Learning • Updated Feb 25, 2025 • 1

updated a model 11 months ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 25, 2025

published a model 11 months ago

Dicer/ppo-LunarLander-v2

Reinforcement Learning • Updated Feb 25, 2025

upvoted 2 articles 11 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

271

Article

Vision Language Models Explained

Apr 11, 2024

•

510

liked 5 datasets about 1 year ago

liked a model over 1 year ago

XLabs-AI/flux-controlnet-collections

Text-to-Image • Updated Aug 30, 2024 • 6.5k • 540

liked a Space almost 2 years ago

MTEB Leaderboard

🥇

6.93k

Embedding Leaderboard

liked a model almost 2 years ago

openbmb/MiniCPM-2B-sft-fp32

Text Generation • Updated Sep 7, 2024 • 311 • 296

liked a dataset almost 2 years ago

bigscience/P3

Viewer • Updated Mar 4, 2024 • 122M • 16.1k • 231

liked a model almost 2 years ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated Jul 24, 2025 • 2.13M • • 3.05k

liked a dataset about 2 years ago

Muennighoff/natural-instructions

Viewer • Updated Dec 23, 2022 • 7.15M • 2.12k • 74

liked a model over 2 years ago

huggyllama/llama-13b

Text Generation • 13B • Updated Apr 7, 2023 • 3.51k • 143

Zhicheng Wang

AI & ML interests

Recent Activity

Organizations

Dicer's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Vision Language Models Explained

MTEB Leaderboard