arXiv:2511.04583

Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper

Published on Nov 6

· Submitted by

Atsuyuki Miyai on Nov 6

Hal Lab UTokyo

Upvote

Authors:

Atsuyuki Miyai ,

Abstract

Jr. AI Scientist, an autonomous AI system, mimics novice researcher workflows to generate scientifically valuable papers, outperforming fully automated systems but with identified limitations and risks.

AI-generated summary

Understanding the current capabilities and risks of AI Scientist systems is essential for ensuring trustworthy and sustainable AI-driven scientific progress while preserving the integrity of the academic ecosystem. To this end, we develop Jr. AI Scientist, a state-of-the-art autonomous AI scientist system that mimics the core research workflow of a novice student researcher: Given the baseline paper from the human mentor, it analyzes its limitations, formulates novel hypotheses for improvement, validates them through rigorous experimentation, and writes a paper with the results. Unlike previous approaches that assume full automation or operate on small-scale code, Jr. AI Scientist follows a well-defined research workflow and leverages modern coding agents to handle complex, multi-file implementations, leading to scientifically valuable contributions. For evaluation, we conducted automated assessments using AI Reviewers, author-led evaluations, and submissions to Agents4Science, a venue dedicated to AI-driven scientific contributions. The findings demonstrate that Jr. AI Scientist generates papers receiving higher review scores than existing fully automated systems. Nevertheless, we identify important limitations from both the author evaluation and the Agents4Science reviews, indicating the potential risks of directly applying current AI Scientist systems and key challenges for future research. Finally, we comprehensively report various risks identified during development. We hope these insights will deepen understanding of current progress and risks in AI Scientist development.

View arXiv page View PDF Add to collection

Community

AtsuMiyai

Paper author Paper submitter 3 days ago

This paper presents a comprehensive report on the development of a state-of-the-art (SOTA) AI Scientist and the associated risks.

💎 Achievements

Developed Jr. AI Scientist, a SOTA autonomous AI Scientist system specialized in baseline extension research that mirrors the workflow of an early-stage student researcher.
With the permission of the original paper authors, allowed Jr. AI Scientist to generate papers building upon real NeurIPS and ICLR papers — achieving higher scores than existing AI-generated research papers.
Conducted an in-depth analysis of the fundamental limitations of Jr. AI Scientist through evaluations by the authors and Agents4Science.
Comprehensively reported various risks identified during development (e.g., review hacking and incorrect citations).

Through this report, we aim to contribute to a deeper understanding of AI Scientists.

librarian-bot

2 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2511.04583 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2511.04583 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2511.04583 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.