30 10 84

devngho PRO

devngho

https://ngho.dev

devngho

AI & ML interests

Efficient Korean NLP, Fine Korean datasets

Recent Activity

liked a model 15 days ago

datalab-to/chandra

liked a dataset 16 days ago

HuggingFaceFW/finewiki

liked a dataset 24 days ago

nick007x/github-code-2025

View all activity

Organizations

upvoted a paper about 1 year ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 53

upvoted a collection about 1 year ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 653

upvoted a paper about 1 year ago

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85

upvoted an article about 1 year ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

•

Jun 3, 2024

• 48

upvoted 4 papers about 1 year ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 22

upvoted 2 articles over 1 year ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

•

Apr 28, 2024

• 38

Article

Can We Train Chat Models with Raw Data?

•

Apr 25, 2024

• 19

devngho PRO

AI & ML interests

Recent Activity

Organizations

devngho's activity

Mergoo: Efficiently Build Your Own MoE LLM

Expanding Model Context and Creating Chat Models with a Single Click

Can We Train Chat Models with Raw Data?