Xiaosen Zheng's picture

Xiaosen Zheng

xszheng2020

·

https://xszheng2020.github.io

AI & ML interests

Code AI and Data-Centric AI.

Recent Activity

liked a Space 4 days ago

HuggingFaceTB/smol-training-playbook

updated a dataset 7 days ago

xszheng2020/OpenThoughts3-1.2M-8k

published a dataset 7 days ago

xszheng2020/OpenThoughts3-1.2M-8k

View all activity

Organizations

authored a paper 2 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

authored a paper about 1 year ago

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates

Paper • 2410.07137 • Published Oct 9, 2024 • 8

authored 5 papers over 1 year ago

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 40

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses

Paper • 2406.01288 • Published Jun 3, 2024 • 1

Intriguing Properties of Data Attribution on Diffusion Models

Paper • 2311.00500 • Published Nov 1, 2023 • 2

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Paper • 2402.08567 • Published Feb 13, 2024 • 2

An Empirical Study of Memorization in NLP

Paper • 2203.12171 • Published Mar 23, 2022 • 1