Aryo Pradipta Gema's picture

Aryo Pradipta Gema PRO

aryopg

·

https://aryopg.github.io

AI & ML interests

Clinical NLP, Knowledge Graph Embedding, Protein Language Model

Recent Activity

updated a model 15 days ago

aryopg/Qwen3-8B_tool_call_wmdp_bsd_rl_curriculum_skipeasy_ckpt30

published a model 15 days ago

aryopg/Qwen3-8B_tool_call_wmdp_bsd_rl_curriculum_skipeasy_ckpt30

upvoted a paper 17 days ago

Learning GUI Grounding with Spatial Reasoning from Visual Feedback

View all activity

Organizations

authored a paper about 2 months ago

PiCSAR: Probabilistic Confidence Selection And Ranking

Paper • 2508.21787 • Published Aug 29 • 4

authored 4 papers 3 months ago

Self-Training Large Language Models for Tool-Use Without Demonstrations

Paper • 2502.05867 • Published Feb 9

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

Paper • 2307.03042 • Published Jul 6, 2023

Scalpel vs. Hammer: GRPO Amplifies Existing Capabilities, SFT Replaces Them

Paper • 2507.10616 • Published Jul 13 • 1

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

authored a paper 7 months ago

An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering

Paper • 2503.23415 • Published Mar 30 • 1

authored a paper 9 months ago

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

Paper • 2502.05092 • Published Feb 7 • 8

authored 4 papers about 1 year ago

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

Paper • 2410.10336 • Published Oct 14, 2024 • 2

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 20

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Paper • 2410.16090 • Published Oct 21, 2024 • 7

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

Paper • 2410.18860 • Published Oct 24, 2024 • 11

authored 2 papers over 1 year ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8, 2024 • 9

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 38