nitishpandey04
's Collections
Reading List
updated
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper
•
2504.07128
•
Published
•
86
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper
•
2412.09871
•
Published
•
107
BitNet b1.58 2B4T Technical Report
Paper
•
2504.12285
•
Published
•
74
FAST: Efficient Action Tokenization for Vision-Language-Action Models
Paper
•
2501.09747
•
Published
•
25
Towards Generalist Robot Policies: What Matters in Building
Vision-Language-Action Models
Paper
•
2412.14058
•
Published
•
1
π_0: A Vision-Language-Action Flow Model for General Robot Control
Paper
•
2410.24164
•
Published
•
23
Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon
Robotic Manipulation
Paper
•
2502.16707
•
Published
•
13
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
•
2406.09246
•
Published
•
40
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic
Control
Paper
•
2307.15818
•
Published
•
30
A Dual Process VLA: Efficient Robotic Manipulation Leveraging VLM
Paper
•
2410.15549
•
Published
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Paper
•
2310.08864
•
Published
•
2
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
•
2408.03314
•
Published
•
64
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient
Robotics
Paper
•
2506.01844
•
Published
•
122
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and
lighter
Paper
•
1910.01108
•
Published
•
17
Block Pruning For Faster Transformers
Paper
•
2109.04838
•
Published
•
2
The case for 4-bit precision: k-bit Inference Scaling Laws
Paper
•
2212.09720
•
Published
•
3
Matryoshka Representation Learning
Paper
•
2205.13147
•
Published
•
19
Language Models are Few-Shot Learners
Paper
•
2005.14165
•
Published
•
14
Scaling Vision Transformers to 22 Billion Parameters
Paper
•
2302.05442
•
Published
•
2
Robust Speech Recognition via Large-Scale Weak Supervision
Paper
•
2212.04356
•
Published
•
35
Emu3: Next-Token Prediction is All You Need
Paper
•
2409.18869
•
Published
•
96
Neural Architecture Search with Reinforcement Learning
Paper
•
1611.01578
•
Published
•
2
Regularized Evolution for Image Classifier Architecture Search
Paper
•
1802.01548
•
Published
•
2
High-Resolution Image Synthesis with Latent Diffusion Models
Paper
•
2112.10752
•
Published
•
13
Denoising Diffusion Probabilistic Models
Paper
•
2006.11239
•
Published
•
4
Scalable Diffusion Models with Transformers
Paper
•
2212.09748
•
Published
•
18
GLIDE: Towards Photorealistic Image Generation and Editing with
Text-Guided Diffusion Models
Paper
•
2112.10741
•
Published
•
4
Diffusion Models Beat GANs on Image Synthesis
Paper
•
2105.05233
•
Published
•
2