Mads's picture

Mads PRO

mhenrichsen

·

mhenrichsen

AI & ML interests

None yet

Recent Activity

replied to hannayukhymenko's post 1 day ago

Do you translate your benchmarks from English correctly? 🤔 Turns out, for many languages it is much harder than you can imagine! Introducing Recovered in Translation 🌍 together with @aalexandrov ritranslation.insait.ai Translating benchmarks is a painful process, requiring a lot of manual inspection and adjustments. You start from setting up the whole pipeline and adapting to every format type, including task specifics. There already exist some massive benchmarks, but they still have some simple (and sometimes silly) bugs, which can hurt the evaluations :( We present a novel automated translation framework to help with that! Eastern and Southern European languages introduce richer linguistic structures compared to English and for benchmarks which heavily rely on grammatical coherence machine translation presents a risk of harming evaluations. We discover potential answer leakage or misleading through grammatical structure of the questions. Some benchmarks are also just outdated and need to be retranslated with newer and better models. We present a framework with novel test-time scaling methods which allow to control time and cost investments, while at the same time mitigate the need for human-in-the-loop verification. While working on Ukrainian-focused MamayLM models, we had to translate 10+ benchmarks in a short span of time. Finding human evaluators is costly and time-consuming, same goes for using professional translators. With our pipeline we were able to do it in 3 days🏎️ We hope our findings will help enable stronger multilingual evaluations and developments. We release all produced benchmarks on Hugging Face together with the source code and Arxiv paper 🤗 Paper: https://huggingface.co/papers/2602.22207 Code: https://github.com/insait-institute/ritranslation Benchmarks: https://huggingface.co/collections/INSAIT-Institute/multilingual-benchmarks

new activity 5 days ago

syvai/plapre-nano:Add CoRal-TTS as data source

updated a model 14 days ago

syvai/plapre-nano

View all activity

Organizations

spaces 3

syv.ai TTS

Prøv syv.ai TTS

Axolotl_Launcher

Create a training configuration for Axolotl

DanskGPT

models 14

mhenrichsen/tts-v1-finetuned-Q8_0-GGUF

3B • Updated Apr 27, 2025 • 18

mhenrichsen/gemma-2b-it

Text Generation • 3B • Updated Feb 21, 2024 • 3

mhenrichsen/gemma-2b

Text Generation • 3B • Updated Feb 21, 2024 • 6 • 1

mhenrichsen/gemma-7b-it

Text Generation • 9B • Updated Feb 21, 2024 • 4

mhenrichsen/gemma-7b

Text Generation • 9B • Updated Feb 21, 2024 • 59 • 4

mhenrichsen/danskgpt-tiny-chat

Text Generation • 1B • Updated Jan 27, 2024 • 36 • 13

mhenrichsen/hestenettetLM

Text Generation • 7B • Updated Jan 16, 2024 • 705 • 3

mhenrichsen/danskgpt-tiny

Text Generation • 1B • Updated Jan 13, 2024 • 54 • 19

mhenrichsen/tinymix-8x1b

Text Generation • 6B • Updated Jan 2, 2024 • 2 • 1

mhenrichsen/hviske

Automatic Speech Recognition • 2B • Updated Dec 9, 2023 • 18

datasets 7

mhenrichsen/danish-tts-soprano-encoded

Viewer • Updated Jan 30 • 942k • 9

mhenrichsen/creator

Viewer • Updated Dec 12, 2023 • 1k • 6

mhenrichsen/context-aware-splits-english

Viewer • Updated Nov 16, 2023 • 28k • 11 • 5

mhenrichsen/hestenettet

Viewer • Updated Nov 5, 2023 • 14.5k • 20 • 3

mhenrichsen/terra

Viewer • Updated Sep 27, 2023 • 25.4M • 119

mhenrichsen/context-aware-splits

Viewer • Updated Sep 17, 2023 • 12.3k • 10 • 3

mhenrichsen/alpaca_2k_test

Viewer • Updated Jul 22, 2023 • 2k • 4.57k • 26