kuleshov-group/caduceus-ph_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 11 • 1
kuleshov-group/caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 42 • 1
kuleshov-group/caduceus-ph_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.23k • 6
kuleshov-group/caduceus-ps_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 11 • 1
kuleshov-group/caduceus-ps_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 27 • 2
kuleshov-group/caduceus-ps_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.75k • 14
kuleshov-group/bd3lm-owt-block_size1024-pretrain Text Generation • 0.2B • Updated Mar 18, 2025 • 534 • 1