1 357 976

jiakai

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

upvoted an article about 16 hours ago

Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1

upvoted a paper about 17 hours ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper about 18 hours ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

View all activity

Organizations

upvoted an article about 16 hours ago

Article

Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1

•

5 days ago

• 3

upvoted a paper about 17 hours ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 173

upvoted a paper about 18 hours ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 66

upvoted 2 articles about 18 hours ago

Article

Fast LoRA inference for Flux with Diffusers and PEFT

and 1 other •

5 days ago

• 24

Article

Consilium: When Multiple LLMs Collaborate

•

11 days ago

• 17

upvoted a paper about 20 hours ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 4 days ago • 117

upvoted a paper 4 days ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published 5 days ago • 106

upvoted an article 4 days ago

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

and 3 others •

5 days ago

• 27

upvoted a collection 4 days ago

Medical & Clinical NER

Collection

State-of-the-art medical, biomedical, and clinical Named Entity Recognition models • 389 items • Updated 9 days ago • 23

upvoted an article 5 days ago

Article

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

•

11 days ago

• 122

upvoted a collection 7 days ago

OpenReasoning-Nemotron

Collection

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 6 days ago • 37

upvoted a paper 9 days ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published 10 days ago • 198

upvoted a collection 9 days ago

Seed-X

Collection

A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 3 items • Updated 11 days ago • 59

upvoted 2 articles 9 days ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

and 3 others •

9 days ago

• 45

Article

Back to The Future: Evaluating AI Agents on Predicting Future Events

and 6 others •

11 days ago

• 26

upvoted a paper 10 days ago

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published 15 days ago • 73

upvoted a paper 11 days ago

EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes

Paper • 2507.11407 • Published 12 days ago • 49

upvoted an article 11 days ago

Article

ScreenEnv: Deploy your full stack Desktop Agent

and 1 other •

18 days ago

• 53

upvoted an article 12 days ago

Article

Agents vs. Workflows

•

May 6

• 2

upvoted a paper 12 days ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published 13 days ago • 78

jiakai

AI & ML interests

Recent Activity

Organizations

real-jiakai's activity

Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1

Fast LoRA inference for Flux with Diffusers and PEFT

Consilium: When Multiple LLMs Collaborate

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Back to The Future: Evaluating AI Agents on Predicting Future Events

ScreenEnv: Deploy your full stack Desktop Agent

Agents vs. Workflows