Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Adhi Setiawan's picture

Adhi Setiawan

adhisetiawan

hmb's profile picture

Mi6paulino's profile picture

adamm-hf's profile picture

·

adhiiisetiawan

AI & ML interests

Computer Vision, Reinforcement Learning

Organizations

adhisetiawan 's collections 6

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260
Audiobox: Unified Audio Generation with Natural Language Prompts

Paper • 2312.15821 • Published Dec 25, 2023 • 17
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 31
LLaMA Pro: Progressive LLaMA with Block Expansion

Paper • 2401.02415 • Published Jan 4, 2024 • 54

microsoft/phi-2

Text Generation • 3B • Updated Dec 8, 2025 • 1.4M • 3.42k
TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • 1B • Updated Mar 17, 2024 • 1.76M • 1.52k
TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

Text Generation • 1B • Updated Sep 27, 2024 • 35.2k • • 184

openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 6.16M • • 5.39k
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 57.1k • 952
nvidia/parakeet-rnnt-1.1b

Automatic Speech Recognition • Updated Nov 27, 2025 • 738 • 164

Multimodal Models

microsoft/kosmos-2-patch14-224

Image-to-Text • 2B • Updated Nov 28, 2023 • 144k • 183
Tyrannosaurus/TinyGPT-V

Updated Jan 19, 2024 • 50
naver-clova-ix/donut-base

Image-to-Text • Updated Aug 13, 2022 • 92.7k • 247
llava-hf/llava-v1.6-34b-hf

Image-Text-to-Text • 35B • Updated Jan 27, 2025 • 3.11k • 93

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24, 2025 • 490k • 4.64k
mistralai/Mixtral-8x7B-v0.1

47B • Updated Jul 24, 2025 • 78.5k • 1.79k
meta-llama/Llama-2-7b-chat-hf

Text Generation • 7B • Updated Apr 17, 2024 • 428k • 4.7k

Multimodal Papers

Woodpecker: Hallucination Correction for Multimodal Large Language Models

Paper • 2310.16045 • Published Oct 24, 2023 • 17
SILC: Improving Vision Language Pretraining with Self-Distillation

Paper • 2310.13355 • Published Oct 20, 2023 • 9
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Paper • 2311.07574 • Published Nov 13, 2023 • 16
MyVLM: Personalizing VLMs for User-Specific Queries

Paper • 2403.14599 • Published Mar 21, 2024 • 17

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Paper • 2312.11514 • Published Dec 12, 2023 • 260
Audiobox: Unified Audio Generation with Natural Language Prompts

Paper • 2312.15821 • Published Dec 25, 2023 • 17
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

Paper • 2312.16862 • Published Dec 28, 2023 • 31
LLaMA Pro: Progressive LLaMA with Block Expansion

Paper • 2401.02415 • Published Jan 4, 2024 • 54

Multimodal Models

microsoft/kosmos-2-patch14-224

Image-to-Text • 2B • Updated Nov 28, 2023 • 144k • 183
Tyrannosaurus/TinyGPT-V

Updated Jan 19, 2024 • 50
naver-clova-ix/donut-base

Image-to-Text • Updated Aug 13, 2022 • 92.7k • 247
llava-hf/llava-v1.6-34b-hf

Image-Text-to-Text • 35B • Updated Jan 27, 2025 • 3.11k • 93

microsoft/phi-2

Text Generation • 3B • Updated Dec 8, 2025 • 1.4M • 3.42k
TinyLlama/TinyLlama-1.1B-Chat-v1.0

Text Generation • 1B • Updated Mar 17, 2024 • 1.76M • 1.52k
TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T

Text Generation • 1B • Updated Sep 27, 2024 • 35.2k • • 184

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24, 2025 • 490k • 4.64k
mistralai/Mixtral-8x7B-v0.1

47B • Updated Jul 24, 2025 • 78.5k • 1.79k
meta-llama/Llama-2-7b-chat-hf

Text Generation • 7B • Updated Apr 17, 2024 • 428k • 4.7k

openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 6.16M • • 5.39k
facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 57.1k • 952
nvidia/parakeet-rnnt-1.1b

Automatic Speech Recognition • Updated Nov 27, 2025 • 738 • 164

Multimodal Papers

Woodpecker: Hallucination Correction for Multimodal Large Language Models

Paper • 2310.16045 • Published Oct 24, 2023 • 17
SILC: Improving Vision Language Pretraining with Self-Distillation

Paper • 2310.13355 • Published Oct 20, 2023 • 9
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Paper • 2311.07574 • Published Nov 13, 2023 • 16
MyVLM: Personalizing VLMs for User-Specific Queries

Paper • 2403.14599 • Published Mar 21, 2024 • 17

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs