AI & ML interests
LLM/RAG/Agents/LSTM/CNN
Organizations
view article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)
ariG23498
• • 50
view article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch
AviSoori1x
• • 121
view article How to generate text: using different decoding methods for language generation with Transformers
patrickvonplaten
• • 300