15 10 14

Gabriel Mongaras

gmongaras

AI & ML interests

None yet

Recent Activity

commented on a paper 7 days ago

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

authored a paper about 1 month ago

On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

commented on a paper about 1 month ago

On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

View all activity

Organizations

commented a paper 7 days ago

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

Paper • 2509.00605 • Published 12 days ago • 41 •

commented a paper about 1 month ago

On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

Paper • 2507.23632 • Published Jul 31 • 6 •

New activity in cerebras/SlimPajama-627B 5 months ago

Error downloading dataset on AWS due to timeout

➕ 5

#17 opened about 1 year ago by

skeering

New activity in gmongaras/Amazon-Reviews-2023 7 months ago

Librarian Bot: Add language metadata for dataset

#2 opened 7 months ago by

librarian-bot

New activity in gmongaras/Imagenet21K 8 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

New activity in gmongaras/Stable_Diffusion_3_Recaption 8 months ago

[bot] Conversion to Parquet

#3 opened 8 months ago by

parquet-converter

Set image type for image column

#2 opened 8 months ago by

lhoestq

Dataset Viewer issue: JobManagerCrashedError

#1 opened 8 months ago by

gmongaras

New activity in gmongaras/Cosine_Attention_GPT_300M 11 months ago

Add pipeline tag, link to paper

#1 opened 11 months ago by

nielsr

commented a paper 11 months ago

Cottention: Linear Transformers With Cosine Attention

Paper • 2409.18747 • Published Sep 27, 2024 • 17 •

commented a paper 12 months ago

Cottention: Linear Transformers With Cosine Attention

Paper • 2409.18747 • Published Sep 27, 2024 • 17 •

New activity in gmongaras/Wizard_7B_Reddit_Political_2019_8bit about 2 years ago

Adding `safetensors` variant of this model

#1 opened about 2 years ago by

SFconvertbot

New activity in gmongaras/wizardLM-7B-HF-8bit about 2 years ago

Upload LlamaForCausalLM

#1 opened about 2 years ago by

gmongaras

New activity in gmongaras/gpt-anime-sub-1.3B over 2 years ago

Adding `safetensors` variant of this model

#1 opened over 2 years ago by

SFconvertbot

Gabriel Mongaras

AI & ML interests

Recent Activity

Organizations

gmongaras's activity

Error downloading dataset on AWS due to timeout

Librarian Bot: Add language metadata for dataset

[bot] Conversion to Parquet

[bot] Conversion to Parquet

Set image type for image column

Dataset Viewer issue: JobManagerCrashedError

Add pipeline tag, link to paper

Adding `safetensors` variant of this model

Upload LlamaForCausalLM

Adding `safetensors` variant of this model