AI & ML interests

LLM

Recent Activity

ljbro  updated a dataset 2 days ago
fnlp/VehicleWorld
ljbro  published a dataset 2 days ago
fnlp/VehicleWorld
rulerman  updated a model 15 days ago
fnlp/MOSS-TTSD-v0.5
View all activity

fnlp 's collections 6

MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"