-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 43 -
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
Paper • 2311.10642 • Published • 26 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 76
Lejon
Annelies
AI & ML interests
speech recognition
Organizations
None yet
nlp
-
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 43 -
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
Paper • 2311.10642 • Published • 26 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 76
pose
models
0
None public yet
datasets
0
None public yet