1 459 1064

jiakai

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a Space about 13 hours ago

HuggingFaceTB/smol-training-playbook

upvoted a paper about 17 hours ago

Kimi Linear: An Expressive, Efficient Attention Architecture

upvoted a paper about 17 hours ago

Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games

View all activity

Organizations

liked a Space about 13 hours ago

813

The Smol Training Playbook: The Secrets to Building World-Class LLMs

📝

upvoted 2 papers about 17 hours ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published 2 days ago • 55

Can Agent Conquer Web? Exploring the Frontiers of ChatGPT Atlas Agent in Web Games

Paper • 2510.26298 • Published 3 days ago • 40

upvoted a paper 3 days ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published 4 days ago • 94

liked a model 3 days ago

openai/gpt-oss-safeguard-20b

Text Generation • 22B • Updated 3 days ago • 4.07k • • 99

upvoted a paper 5 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published 8 days ago • 90

liked a model 6 days ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 3 days ago • 530k • • 905

liked 2 models 9 days ago

PokeeAI/pokee_research_7b

Text Generation • 8B • Updated 10 days ago • 5.57k • 94

mirth/chonky_mmbert_small_multilingual_1

Token Classification • 0.1B • Updated 9 days ago • 198 • 22

upvoted a paper 10 days ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published 11 days ago • 106

upvoted a paper 11 days ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

Paper • 2510.16872 • Published 13 days ago • 90

liked a Space 11 days ago

DeepSeek OCR Demo

🖼

An interactive demo for the DeepSeek-OCR model.

upvoted a paper 12 days ago

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published 16 days ago • 144

liked a model 12 days ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 8 days ago • 1.66M • 2.33k

liked a model 15 days ago

nanonets/Nanonets-OCR2-3B

Image-Text-to-Text • 4B • Updated 17 days ago • 65k • 429

upvoted a paper 15 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

Paper • 2510.04849 • Published 26 days ago • 109

liked 2 models 15 days ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 1 day ago • 25.8k • 1.19k

facebook/MobileLLM-Pro

Text Generation • 1B • Updated 8 days ago • 4.49k • 138

upvoted a paper 18 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 19 days ago • 169

upvoted a paper 19 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published 27 days ago • 112