Massimo Roberto Scamarcia's picture

Massimo Roberto Scamarcia PRO

mrs83

·

AI & ML interests

Natural Language Processing, Text Generation, Question Answering, Data Augmentation, Knowledge Transfer, Chain-of-Thought, ResearchOps, MLOps

Recent Activity

liked a Space 3 days ago

onnx-community/convert-to-onnx

updated a model 3 days ago

ethicalabs/Kurtis-E1.1-Qwen2.5-3B-Instruct-mlx-4Bit

published a model 3 days ago

ethicalabs/Kurtis-E1.1-Qwen2.5-3B-Instruct-mlx-4Bit

View all activity

Organizations

Posts 8

Post

322

hallbayes https://github.com/leochlon/hallbayes is an interesting project by Leon Chlon (Hassana Labs) for checking for hallucination risk before text generation and it uses a powerful approach to decide if an LLM is confident enough to answer (or not).

https://arxiv.org/html/2509.11208v1
Predictable Compression Failures: Why Language Models Actually Hallucinate (2509.11208)

I've just integrated the hallbayes library into my completionist (synthetic dataset generation CLI tool) project to do exactly that, adding a new quality control layer to synthetic data generation.

Ran a small test on 10 samples on google/boolq with a 4B Qwen Instruct model Qwen/Qwen3-4B-Instruct-2507.

The output dataset now contains a hallucination_info column, flagging each sample with detailed metrics. The inference server is LM Studio, running on a Macbook Air M4 16GB

Test w/ hallucination flags: ethicalabs/google-boolq-hallbayes-test-qwen3-4b-2507

Implementation MRs:
https://github.com/leochlon/hallbayes/pull/16
https://github.com/ethicalabs-ai/completionist/pull/11

Articles 2

Article

Fine-Tuning xLSTM-7B on a budget: An Experimental Journey with Chat Templates

View all Articles

Collections 2

models 12

mrs83/Kurtis-SmolLM2-1.7B-Instruct-DPO-PEFT

mrs83/Kurtis-SmolLM2-1.7B-Instruct

Text Generation • 0.9B • Updated Jan 26 • 2

mrs83/Kurtis-SmolLM2-1.7B-Instruct-PEFT

Text Generation • Updated Jan 26 • 2

mrs83/Kurtis-SmolLM2-360M-Instruct-PEFT

Text Generation • Updated Jan 26 • 1

mrs83/Kurtis-SmolLM2-360M-Instruct-DPO-PEFT

Text Generation • Updated Jan 26 • 1

mrs83/Kurtis-SmolLM2-360M-Instruct

Text Generation • 0.4B • Updated Jan 25

mrs83/FlowerTune-phi-4-NLP-PEFT

Text Generation • Updated Jan 15 • 1

mrs83/FlowerTune-xLSTM-7b-NLP-PEFT

Text Generation • Updated Dec 18, 2024 • 4 • 1

mrs83/FlowerTune-Qwen2.5-7B-Instruct-Medical-PEFT

Text Generation • Updated Nov 27, 2024 • 3 • 1

mrs83/FlowerTune-Mistral-7B-Instruct-v0.3-Medical-PEFT

Text Generation • Updated Nov 27, 2024 • 4

datasets 4

mrs83/kurtis_mental_health_dpo

Viewer • Updated Dec 29, 2024 • 2.8k • 7 • 3

mrs83/kurtis_mental_health_initial

Viewer • Updated Oct 17, 2024 • 11.5k • 25

mrs83/kurtis_mental_health_final

Viewer • Updated Oct 14, 2024 • 11.5k • 9 • 2

mrs83/kurtis_mental_health

Viewer • Updated Oct 13, 2024 • 4.5k • 8