56 12 69

Umar Butler

umarbutler

https://umarbutler.com/

AI & ML interests

Law, technology, AI and everything in between.

Recent Activity

updated a dataset about 3 hours ago

isaacus/mteb-GovReport

updated a dataset about 3 hours ago

isaacus/mteb-BillSumUS

liked a dataset about 3 hours ago

isaacus/mteb-GovReport

View all activity

Organizations

updated 2 datasets about 3 hours ago

isaacus/mteb-GovReport

Viewer • Updated about 3 hours ago • 2.92k • 51 • 2

isaacus/mteb-BillSumUS

Viewer • Updated about 3 hours ago • 1.5k • 76 • 2

liked a dataset about 3 hours ago

isaacus/mteb-GovReport

Viewer • Updated about 3 hours ago • 2.92k • 51 • 2

liked a model about 20 hours ago

BAAI/bge-m3

liked a model 2 days ago

mixedbread-ai/mxbai-embed-large-v1

Feature Extraction • 0.3B • Updated Mar 13 • 2.02M • • 704

commented on 🪆 Introduction to Matryoshka Embedding Models 2 days ago

It is also surprising to me, quite a cool side effect. You'll find similar supporting results on pp 5 of 'Matryoshka Representation Learning'. In fact, the graphs on that page suggest that MRL-E/tied MRL underperforms vanilla their FF model on low dimensions, not to mention, the differences between their baseline and MRL aren't that significant (in those specific graphs). Funnily enough, Sentence Transformers implements MRL-E/tied MRL, not untied MRL.

@tomaarsen 's results seem to be quite different in that MRL-E is winning against no MRL.