vitalyr
vitalyr
AI & ML interests
None yet
Organizations
None yet
llm
-
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Paper • 2404.05961 • Published • 66 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 111 -
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies
Paper • 2404.08197 • Published • 30 -
Pre-training Small Base LMs with Fewer Tokens
Paper • 2404.08634 • Published • 36
mamba
llm
-
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Paper • 2404.05961 • Published • 66 -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
Paper • 2404.07143 • Published • 111 -
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies
Paper • 2404.08197 • Published • 30 -
Pre-training Small Base LMs with Fewer Tokens
Paper • 2404.08634 • Published • 36
models
0
None public yet
datasets
0
None public yet