A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Data Mining and Information Systems Lab
dmis-lab
AI & ML interests
None yet
Recent Activity
updated
a collection
19 days ago
Med-PRM
updated
a dataset
19 days ago
dmis-lab/llama-3.1-medprm-reward-raw-training-set
published
a dataset
19 days ago
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Organizations
None yet
Meerkat
This collection hosts Meerkat series introduced in paper, Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks.
-
dmis-lab/meerkat-7b-v1.0
Text Generation • 7B • Updated • 597 • 24 -
dmis-lab/llama-3-meerkat-8b-v1.0
Text Generation • 8B • Updated • 1.12k • • 6 -
dmis-lab/llama-3-meerkat-70b-v1.0
Text Generation • 71B • Updated • 862 • • 6 -
Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks
Paper • 2404.00376 • Published • 5
OLAPH
This collection hosts models introduced in OLAPH: Improving Factuality in Biomedical Long-form Question Answering.
TouR
This collection hosts Phrase-reranker models introduced in TouR (ACL 2023 Findings), optimizing test-time query representations for dense retrieval.
BioBERT
This collection hosts BioBERT (Bioinformatics 2020) series, a domain-specific adaptation of BERT pre-trained on biomedical corpora.
Med-PRM
This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
ANGEL
This collection hosts ANGEL series introduced in paper, Learning from Negative Samples in Generative Biomedical Entity Linking.
Self-BioRAG
This collection hosts models of Self-BioRAG (ISMB 2024), improving medical reasoning through retrieval and self-reflection.
BioSyn
This collection hosts BioSyn (ACL 2020) series, for learning representations of biomedical entities based on their synonyms.
Outlier-Safe Pre-Training (OSP)
A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Med-PRM
This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
Meerkat
This collection hosts Meerkat series introduced in paper, Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks.
-
dmis-lab/meerkat-7b-v1.0
Text Generation • 7B • Updated • 597 • 24 -
dmis-lab/llama-3-meerkat-8b-v1.0
Text Generation • 8B • Updated • 1.12k • • 6 -
dmis-lab/llama-3-meerkat-70b-v1.0
Text Generation • 71B • Updated • 862 • • 6 -
Small Language Models Learn Enhanced Reasoning Skills from Medical Textbooks
Paper • 2404.00376 • Published • 5
ANGEL
This collection hosts ANGEL series introduced in paper, Learning from Negative Samples in Generative Biomedical Entity Linking.
OLAPH
This collection hosts models introduced in OLAPH: Improving Factuality in Biomedical Long-form Question Answering.
Self-BioRAG
This collection hosts models of Self-BioRAG (ISMB 2024), improving medical reasoning through retrieval and self-reflection.
TouR
This collection hosts Phrase-reranker models introduced in TouR (ACL 2023 Findings), optimizing test-time query representations for dense retrieval.
BioSyn
This collection hosts BioSyn (ACL 2020) series, for learning representations of biomedical entities based on their synonyms.
BioBERT
This collection hosts BioBERT (Bioinformatics 2020) series, a domain-specific adaptation of BERT pre-trained on biomedical corpora.