Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
19.4
TFLOPS
31
Mads
PRO
mhenrichsen
Follow
clem's profile picture
hellopedro's profile picture
SanRosenberg's profile picture
57 followers
Β·
7 following
mhenrichsen
AI & ML interests
None yet
Recent Activity
replied
to
hannayukhymenko
's
post
1 day ago
Do you translate your benchmarks from English correctly? π€ Turns out, for many languages it is much harder than you can imagine! Introducing Recovered in Translation π together with @aalexandrov ritranslation.insait.ai Translating benchmarks is a painful process, requiring a lot of manual inspection and adjustments. You start from setting up the whole pipeline and adapting to every format type, including task specifics. There already exist some massive benchmarks, but they still have some simple (and sometimes silly) bugs, which can hurt the evaluations :( We present a novel automated translation framework to help with that! Eastern and Southern European languages introduce richer linguistic structures compared to English and for benchmarks which heavily rely on grammatical coherence machine translation presents a risk of harming evaluations. We discover potential answer leakage or misleading through grammatical structure of the questions. Some benchmarks are also just outdated and need to be retranslated with newer and better models. We present a framework with novel test-time scaling methods which allow to control time and cost investments, while at the same time mitigate the need for human-in-the-loop verification. While working on Ukrainian-focused MamayLM models, we had to translate 10+ benchmarks in a short span of time. Finding human evaluators is costly and time-consuming, same goes for using professional translators. With our pipeline we were able to do it in 3 daysποΈ We hope our findings will help enable stronger multilingual evaluations and developments. We release all produced benchmarks on Hugging Face together with the source code and Arxiv paper π€ Paper: https://huggingface.co/papers/2602.22207 Code: https://github.com/insait-institute/ritranslation Benchmarks: https://huggingface.co/collections/INSAIT-Institute/multilingual-benchmarks
new
activity
5 days ago
syvai/plapre-nano:
Add CoRal-TTS as data source
updated
a model
14 days ago
syvai/plapre-nano
View all activity
Organizations
spaces
3
Sort:Β Recently updated
Runtime error
5
syv.ai TTS
π
PrΓΈv syv.ai TTS
Running
1
Axolotl_Launcher
π
Create a training configuration for Axolotl
Runtime error
4
DanskGPT
π
models
14
Sort:Β Recently updated
mhenrichsen/tts-v1-finetuned-Q8_0-GGUF
3B
β’
Updated
Apr 27, 2025
β’
18
mhenrichsen/gemma-2b-it
Text Generation
β’
3B
β’
Updated
Feb 21, 2024
β’
3
mhenrichsen/gemma-2b
Text Generation
β’
3B
β’
Updated
Feb 21, 2024
β’
6
β’
1
mhenrichsen/gemma-7b-it
Text Generation
β’
9B
β’
Updated
Feb 21, 2024
β’
4
mhenrichsen/gemma-7b
Text Generation
β’
9B
β’
Updated
Feb 21, 2024
β’
59
β’
4
mhenrichsen/danskgpt-tiny-chat
Text Generation
β’
1B
β’
Updated
Jan 27, 2024
β’
36
β’
13
mhenrichsen/hestenettetLM
Text Generation
β’
7B
β’
Updated
Jan 16, 2024
β’
705
β’
3
mhenrichsen/danskgpt-tiny
Text Generation
β’
1B
β’
Updated
Jan 13, 2024
β’
54
β’
19
mhenrichsen/tinymix-8x1b
Text Generation
β’
6B
β’
Updated
Jan 2, 2024
β’
2
β’
1
mhenrichsen/hviske
Automatic Speech Recognition
β’
2B
β’
Updated
Dec 9, 2023
β’
18
View 14 models
datasets
7
Sort:Β Recently updated
mhenrichsen/danish-tts-soprano-encoded
Viewer
β’
Updated
Jan 30
β’
942k
β’
9
mhenrichsen/creator
Viewer
β’
Updated
Dec 12, 2023
β’
1k
β’
6
mhenrichsen/context-aware-splits-english
Viewer
β’
Updated
Nov 16, 2023
β’
28k
β’
11
β’
5
mhenrichsen/hestenettet
Viewer
β’
Updated
Nov 5, 2023
β’
14.5k
β’
20
β’
3
mhenrichsen/terra
Viewer
β’
Updated
Sep 27, 2023
β’
25.4M
β’
119
mhenrichsen/context-aware-splits
Viewer
β’
Updated
Sep 17, 2023
β’
12.3k
β’
10
β’
3
mhenrichsen/alpaca_2k_test
Viewer
β’
Updated
Jul 22, 2023
β’
2k
β’
4.57k
β’
26