Phú Võ

phuvo

phuvo

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

Fast and Simplex: 2-Simplicial Attention in Triton

upvoted a paper about 1 month ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

upvoted a paper about 2 months ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

View all activity

Organizations

None yet

upvoted a paper 24 days ago

Fast and Simplex: 2-Simplicial Attention in Triton

Paper • 2507.02754 • Published 28 days ago • 24

upvoted a paper about 1 month ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 260

upvoted a paper about 2 months ago

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Paper • 2505.24867 • Published May 30 • 80

upvoted a paper 3 months ago

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Paper • 2504.18415 • Published Apr 25 • 46

upvoted 2 papers 4 months ago

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published Apr 16 • 74

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

upvoted 2 papers 5 months ago

LongRoPE2: Near-Lossless LLM Context Window Scaling

Paper • 2502.20082 • Published Feb 27 • 39

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 163

upvoted a paper 6 months ago

NoLiMa: Long-Context Evaluation Beyond Literal Matching

Paper • 2502.05167 • Published Feb 7 • 15

upvoted 5 papers 10 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 152

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

Paper • 2409.18943 • Published Sep 27, 2024 • 30

VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models

Paper • 2409.17066 • Published Sep 25, 2024 • 29

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published Sep 26, 2024 • 54

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 96

upvoted a paper 11 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 79

upvoted 3 papers 12 months ago

upvoted 2 papers about 1 year ago

Scalify: scale propagation for efficient low-precision LLM training

Paper • 2407.17353 • Published Jul 24, 2024 • 13

Scaling Granite Code Models to 128K Context

Paper • 2407.13739 • Published Jul 18, 2024 • 20

Phú Võ

AI & ML interests

Recent Activity

Organizations

phuvo's activity