RLAIF-V

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Yirany authored a paper about 1 month ago

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages

Yirany authored a paper about 1 month ago

A Topic-level Self-Correctional Approach to Mitigate Hallucinations in MLLMs

Yirany authored a paper about 1 month ago

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

View all activity

Yirany

authored 4 papers about 1 month ago

Yirany

updated 2 models about 1 month ago

openbmb/RLPR-Llama3.1-8B-Inst

Text Generation • 8B • Updated Jun 30 • 8 • 2

openbmb/RLPR-Gemma2-2B-it

Text Generation • 3B • Updated Jun 30 • 29 • 3

orris27

updated a model about 1 month ago

openbmb/RLPR-Llama3.1-8B-Inst

Text Generation • 8B • Updated Jun 30 • 8 • 2

orris27

published a model about 1 month ago

openbmb/RLPR-Llama3.1-8B-Inst

Text Generation • 8B • Updated Jun 30 • 8 • 2

Yirany

updated a model about 1 month ago

RLAIF-V/RLPR-Qwen2.5-7B-Base

8B • Updated Jun 22 • 3 • 1

Yirany

updated 2 datasets about 1 month ago

RLAIF-V/RLPR-Benchmarks

Viewer • Updated Jun 22 • 638 • 49

RLAIF-V/RLPR-Train-Dataset

Viewer • Updated Jun 22 • 77.7k • 34

resilience

updated a dataset about 1 month ago

RLAIF-V/RLPR-Train-Dataset

Viewer • Updated Jun 22 • 77.7k • 34

resilience

updated a model about 1 month ago

RLAIF-V/RLPR-Qwen2.5-7B-Base

8B • Updated Jun 22 • 3 • 1

resilience

updated a dataset about 1 month ago

RLAIF-V/RLPR-Benchmarks

Viewer • Updated Jun 22 • 638 • 49

Yirany

authored a paper 6 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

Yirany

authored a paper 8 months ago

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published Dec 11, 2024 • 55

HaoyeZhang

authored a paper 12 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 85

Yirany

authored a paper 12 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 85

HaoyeZhang

authored a paper over 1 year ago

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Paper • 2312.00849 • Published Dec 1, 2023 • 12

Yirany

authored a paper over 1 year ago

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Paper • 2312.00849 • Published Dec 1, 2023 • 12

AI & ML interests

Recent Activity

Team members 5

RLAIF-V's activity