rasyosef
/

splade-tiny

@@ -1,60 +1,38 @@
 ---
-language:
-- en
-license: mit
 tags:
 - sentence-transformers
 - sparse-encoder
 - sparse
 - splade
 - generated_from_trainer
-- dataset_size:496123
 - loss:SpladeLoss
 - loss:SparseMarginMSELoss
 - loss:FlopsLoss
-base_model: prajjwal1/bert-tiny
 widget:
-- text: >-
-    Hurley doesn't just want to be your go-to for surf gear, but the be the
-    brand that represents your lifestyle. Of course you have your pick up board
-    shorts, tanks and a Hurley hat while you're on the beach, but you can also
-    look at graphic tees, sandals, and accessories when you're on the street.
-- text: >-
-    Electric field of a positive and a negative point charge. Electric charge is
-    the physical property of matter that causes it to experience a force when
-    placed in an electromagnetic field.There are two types of electric charges:
-    positive and negative.lectric charge is a characteristic property of many
-    subatomic particles. The charges of free-standing particles are integer
-    multiples of the elementary charge e; we say that electric charge is
-    quantized. Michael Faraday, in his electrolysis experiments, was the first
-    to note the discrete nature of electric charge.
-- text: >-
-    The term mechanical digestion refers to the physical breakdown of large
-    pieces of food into smaller pieces which can subsequently be accessed by
-    digestive enzymes. In chemical digestion, enzymes break down food into the
-    small molecules the body can use.
-- text: >-
-    Kids and Quick Solutions. Children learn to put away their clothes when they
-    can reach the hanging rods. This is actually fun for little ones -- they may
-    spend a long stretch of time putting hangers on and taking them off the rods
-    -- as long as the rods are child-height.So take your stand against piles of
-    clothes on the floor of the teen's bedroom early by re-sizing the closet to
-    fit the kid.his is actually fun for little ones -- they may spend a long
-    stretch of time putting hangers on and taking them off the rods -- as long
-    as the rods are child-height. So take your stand against piles of clothes on
-    the floor of the teen's bedroom early by re-sizing the closet to fit the
-    kid.
-- text: >-
-    About EUS (endoscopic ultrasound). An EUS, or endoscopic ultrasound, is an
-    outpatient procedure used to closely examine the tissues in the digestive
-    tract. The procedure is done using a standard endoscope and a tiny
-    ultrasound device.The ultrasound sensor sends back visual images of the
-    digestive tract to a screen, allowing the physician to see deeper into the
-    tissues and the organs beneath the surface of the intestines.. In general,
-    an EUS is a very safe procedure. If your procedure is being done on the
-    upper GI tract, you may have a sore throat for a few days. As a result of
-    the sedation, you should not drive, operate heavy machinery or make any
-    important decisions for up to six hours following the procedure.
 pipeline_tag: feature-extraction
 library_name: sentence-transformers
 metrics:
@@ -78,7 +56,7 @@ metrics:
 - corpus_active_dims
 - corpus_sparsity_ratio
 model-index:
-- name: SPLADE-BERT-Tiny-Distil
   results:
   - task:
       type: sparse-information-retrieval
@@ -88,86 +66,94 @@ model-index:
       type: unknown
     metrics:
     - type: dot_accuracy@1
-      value: 0.4602
       name: Dot Accuracy@1
     - type: dot_accuracy@3
-      value: 0.7768
       name: Dot Accuracy@3
     - type: dot_accuracy@5
-      value: 0.885
       name: Dot Accuracy@5
     - type: dot_accuracy@10
-      value: 0.9548
       name: Dot Accuracy@10
     - type: dot_precision@1
-      value: 0.4602
       name: Dot Precision@1
     - type: dot_precision@3
-      value: 0.2653333333333333
       name: Dot Precision@3
     - type: dot_precision@5
-      value: 0.18391999999999997
       name: Dot Precision@5
     - type: dot_precision@10
-      value: 0.10024
       name: Dot Precision@10
     - type: dot_recall@1
-      value: 0.4461833333333334
       name: Dot Recall@1
     - type: dot_recall@3
-      value: 0.7631166666666666
       name: Dot Recall@3
     - type: dot_recall@5
-      value: 0.8761
       name: Dot Recall@5
     - type: dot_recall@10
-      value: 0.9500333333333334
       name: Dot Recall@10
     - type: dot_ndcg@10
-      value: 0.7094495794736737
       name: Dot Ndcg@10
     - type: dot_mrr@10
-      value: 0.6344716666666689
       name: Dot Mrr@10
     - type: dot_map@100
-      value: 0.6306882016403095
       name: Dot Map@100
     - type: query_active_dims
-      value: 16.77560043334961
       name: Query Active Dims
     - type: query_sparsity_ratio
-      value: 0.9994503767632085
       name: Query Sparsity Ratio
     - type: corpus_active_dims
-      value: 102.47956598021874
       name: Corpus Active Dims
     - type: corpus_sparsity_ratio
-      value: 0.9966424360795421
       name: Corpus Sparsity Ratio
-datasets:
-- microsoft/ms_marco
 ---
-# SPLADE-BERT-Tiny-Distil
-This is a SPLADE sparse retrieval model based on BERT-Tiny (4M) that was trained by distilling a Cross-Encoder on the MSMARCO dataset. The cross-encoder used was [ms-marco-MiniLM-L6-v2](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L6-v2).
-This Tiny SPLADE model beats `BM25` by `65.6%` on the MSMARCO benchmark. While this model is `15x` smaller than Naver's official `splade-v3-distilbert`, is posesses `80%` of it's performance on MSMARCO. This model is small enough to be used without a GPU on a dataset of a few thousand documents.
-- `Collection:` https://huggingface.co/collections/rasyosef/splade-tiny-msmarco-687c548c0691d95babf65b70
-- `Distillation Dataset:` https://huggingface.co/datasets/yosefw/msmarco-train-distil-v2
-- `Code:` https://github.com/rasyosef/splade-tiny-msmarco
-## Performance
-The splade models were evaluated on 55 thousand queries and 8 million documents from the [MSMARCO](https://huggingface.co/datasets/microsoft/ms_marco) dataset.
-||Size (# Params)|MRR@10 (MS MARCO dev)|
-|:---|:----|:-------------------|
-|`BM25`|-|18.6|-|-|
-|`rasyosef/splade-tiny`|4.4M|30.8|
-|`rasyosef/splade-mini`|11.2M|32.8|
-|`naver/splade-v3-distilbert`|67.0M|38.7|
 ## Usage
@@ -184,15 +170,15 @@ Then you can load this model and run inference.
 from sentence_transformers import SparseEncoder
 # Download from the 🤗 Hub
-model = SparseEncoder("rasyosef/splade-tiny")
 # Run inference
 queries = [
-    "what is eus appointment",
 ]
 documents = [
-    "Endoscopic Ultrasound (EUS). You've been referred to have an endoscopic ultrasound, or EUS, which will help your doctor, evaluate or treat your condition. This brochure will give you a basic understanding of the procedure-how it is performed, how it can help, and what side effects you might experience.our doctor can use EUS to diagnose the cause of conditions such as abdominal pain or abnormal weight loss. Or, if your doctor has ruled out certain conditions, EUS can confirm your diagnosis and give you a clean bill of health.",
-    'About EUS (endoscopic ultrasound). An EUS, or endoscopic ultrasound, is an outpatient procedure used to closely examine the tissues in the digestive tract. The procedure is done using a standard endoscope and a tiny ultrasound device.The ultrasound sensor sends back visual images of the digestive tract to a screen, allowing the physician to see deeper into the tissues and the organs beneath the surface of the intestines.. In general, an EUS is a very safe procedure. If your procedure is being done on the upper GI tract, you may have a sore throat for a few days. As a result of the sedation, you should not drive, operate heavy machinery or make any important decisions for up to six hours following the procedure.',
-    'Endoscopic Ultrasound (EUS) allows your doctor to examine the lining and the walls of your upper and lower gastrointestinal tract.The upper tract is the esophagus, stomach, and duodenum; the lower tract includes your colon and rectum.Doctors also use EUS to study internal organs that lie next to the gastrointestinal tract, such as the gall bladder and the pancreas. Your endoscopist will use a thin, flexible tube called an endoscope.he upper tract is the esophagus, stomach, and duodenum; the lower tract includes your colon and rectum. Doctors also use EUS to study internal organs that lie next to the gastrointestinal tract, such as the gall bladder and the pancreas.',
 ]
 query_embeddings = model.encode_query(queries)
 document_embeddings = model.encode_document(documents)
@@ -202,7 +188,7 @@ print(query_embeddings.shape, document_embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
-# tensor([[12.9370, 14.3277, 12.9725]])
 ```
 <!--
@@ -229,37 +215,6 @@ You can finetune this model on your own dataset.
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
-## Model Details
-### Model Description
-- **Model Type:** SPLADE Sparse Encoder
-- **Base model:** [prajjwal1/bert-tiny](https://huggingface.co/prajjwal1/bert-tiny) <!-- at revision 6f75de8b60a9f8a2fdf7b69cbd86d9e64bcb3837 -->
-- **Maximum Sequence Length:** 512 tokens
-- **Output Dimensionality:** 30522 dimensions
-- **Similarity Function:** Dot Product
-<!-- - **Training Dataset:** Unknown -->
-- **Language:** en
-- **License:** mit
-### Model Sources
-- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
-- **Documentation:** [Sparse Encoder Documentation](https://www.sbert.net/docs/sparse_encoder/usage/usage.html)
-- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
-- **Hugging Face:** [Sparse Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=sparse-encoder)
-### Full Model Architecture
-```
-SparseEncoder(
-  (0): MLMTransformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertForMaskedLM'})
-  (1): SpladePooling({'pooling_strategy': 'max', 'activation_function': 'relu', 'word_embedding_dimension': 30522})
-)
-```
-## More
-<details><summary>Click to expand</summary>
 ## Evaluation
 ### Metrics
@@ -270,25 +225,25 @@ SparseEncoder(
 | Metric                | Value      |
 |:----------------------|:-----------|
-| dot_accuracy@1        | 0.4602     |
-| dot_accuracy@3        | 0.7768     |
-| dot_accuracy@5        | 0.885      |
-| dot_accuracy@10       | 0.9548     |
-| dot_precision@1       | 0.4602     |
-| dot_precision@3       | 0.2653     |
-| dot_precision@5       | 0.1839     |
-| dot_precision@10      | 0.1002     |
-| dot_recall@1          | 0.4462     |
-| dot_recall@3          | 0.7631     |
-| dot_recall@5          | 0.8761     |
-| dot_recall@10         | 0.95       |
-| **dot_ndcg@10**       | **0.7094** |
-| dot_mrr@10            | 0.6345     |
-| dot_map@100           | 0.6307     |
-| query_active_dims     | 16.7756    |
-| query_sparsity_ratio  | 0.9995     |
-| corpus_active_dims    | 102.4796   |
-| corpus_sparsity_ratio | 0.9966     |
 <!--
 ## Bias, Risks and Limitations
@@ -308,25 +263,25 @@ SparseEncoder(
 #### Unnamed Dataset
-* Size: 496,123 training samples
-* Columns: <code>query</code>, <code>positive</code>, <code>negative_1</code>, <code>negative_2</code>, <code>negative_3</code>, <code>negative_4</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | query                                                                            | positive                                                                            | negative_1                                                                          | negative_2                                                                         | negative_3                                                                          | negative_4                                                                         | label                              |
-  |:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:-----------------------------------|
-  | type    | string                                                                           | string                                                                              | string                                                                              | string                                                                             | string                                                                              | string                                                                             | list                               |
-  | details | <ul><li>min: 4 tokens</li><li>mean: 9.09 tokens</li><li>max: 37 tokens</li></ul> | <ul><li>min: 14 tokens</li><li>mean: 80.68 tokens</li><li>max: 215 tokens</li></ul> | <ul><li>min: 20 tokens</li><li>mean: 78.57 tokens</li><li>max: 238 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 77.8 tokens</li><li>max: 253 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 76.46 tokens</li><li>max: 248 tokens</li></ul> | <ul><li>min: 16 tokens</li><li>mean: 75.9 tokens</li><li>max: 190 tokens</li></ul> | <ul><li>size: 4 elements</li></ul> |
 * Samples:
-  | query                                            | positive                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | negative_1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | negative_2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | negative_3                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | negative_4                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | label                                                                                       |
-  |:-------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------|
-  | <code>could Nexium antacid cause sweating</code> | <code>Summary: Sweating-excessive is found among people who take Nexium, especially for people who are 60+ old, have been taking the drug for.Personalized health information: on eHealthMe you can find out what patients like me (same gender, age) reported their drugs and conditions on FDA and social media since 1977. I am a 56 year old female who has been taking Nexium for 13 years and has been plagued by shingles.. 2  Support group for people who have Sweating-Excessive.  3 Been on warfarin for 6 days and having sweating at times.</code> | <code>More questions for: Nexium, Sweating-excessive. You may be interested at these reviews (Write a review): 1  Xarelto caused shortness of breath. 2  After taking Xarelto for 3 years I suddently experienced shortness of breath, sweating and pain in my arms. 3  Myrbetriq & hyperhidrosis (night sweats). I am a 56 year old female who has been taking Nexium for 13 years and has been plagued by shingles.. 2  Support group for people who have Sweating-Excessive.  3 Been on warfarin for 6 days and having sweating at times.</code> | <code>NEXIUM may help your acid-related symptoms, but you could still have serious stomach problems. Talk with your doctor. NEXIUM can cause serious side effects, including: 1  Diarrhea. 2  NEXIUM may increase your risk of getting severe diarrhea.3  This diarrhea may be caused by an infection (Clostridium difficile) in your intestines.EXIUM can cause serious side effects, including: 1  Diarrhea. 2  NEXIUM may increase your risk of getting severe diarrhea. 3  This diarrhea may be caused by an infection (Clostridium difficile) in your intestines.</code> | <code>Treatment for sweating. The treatment you have will depend on the cause of your sweating. If you have an infection, antibiotics will treat the infection and stop the sweating. If your sweating is due to cancer, treating the cancer can get rid of the sweating.If you have sweating because treatment has changed your hormone levels, it may settle down after a few weeks or months, once your body is used to the treatment. Talk to your doctor or nurse about your sweats.nfection. Infection is one of the most common causes of sweating in people who have cancer. Infection can give you a high temperature and your body sweats to try and reduce it. Treating the infection can control or stop the sweating.</code> | <code>Esomeprazole is used to treat certain stomach and esophagus problems (such as acid reflux, ulcers). It works by decreasing the amount of acid your stomach makes.ide Effects. See also Precautions section. Headache or abdominal pain may occur. If any of these effects persist or worsen, tell your doctor or pharmacist promptly. Remember that your doctor has prescribed this medication because he or she has judged that the benefit to you is greater than the risk of side effects.</code> | <code>[0.5, 6.390576362609863, 11.97206974029541, 16.409034729003906]</code>                |
-  | <code>what is electronic document access</code>  | <code>Electronic Document Access (EDA) is a web-based system that provides secure online access, storage, and retrieval of contracts, contract modifications, Government Bills of Lading (GBLs), DFAS Transactions for Others (E110), vouchers, and Contract Deficiency Reports (CDR) to authorized users throughout the Department of Defense (DoD).</code>                                                                                                                                                                                                    | <code>An electronic document management system (EDMS) is a software system for organizing and storing different kinds of documents. This type of system is a more particular kind of document management system, a more general type of storage system that helps users to organize and store paper or digital documents.</code>                                                                                                                                                                                                                    | <code>In many cases, the specific documentation for original storage protocols is a major part of what makes an electronic document management system so valuable to a business or organization.</code>                                                                                                                                                                                                                                                                                                                                                                       | <code>Benefits derived from DoD EDA include: 1  Single-source, timely information. 2  Electronic search and retrieval – 24/7 access/retrieval capability. 3  Increased visibility of all procurement & payment actions.  Reduction in data entry/human 1  error. Lower postage, handling, retention and document management costs.</code>                                                                                                                                                                                                                                                                                                                                                                                                 | <code>If YES, go to www.docusign.net and log in with your email and password. On the DocuSign Web Application, select the Documents tab. Your documents are listed there. If NO, you can access the document by opening the DocuSign Completed email. This email is sent to you once you have finished signing a DocuSign document. See the instructions below. Note: In some cases, your documents might be attached to the Completed email. 1. Open the DocuSign Completed email.</code>                 | <code>[4.681269645690918, 9.322907447814941, 14.813400268554688, 20.356698989868164]</code> |
-  | <code>does hpv cause uti</code>                  | <code>So now you get in the acidic environment can hpv cause urinary tract infection for the area of the blockage of the fruits and fiber as a completely eliminate urinate at all. Spending money on prescription of antibiotics will kill all of the bacterial infection keeps happening to your veterinarian will work to cure the condition.</code>                                                                                                                                                                                                         | <code>HPV & Urinary Tract Infections. Human Papillomavirus (HPV) is a group of viruses that can cause warts and cancers of the cervix, anus and genitals. Urinary tract infection (UTI) occurs when bacteria multiply within the bladder, causing pain and urinary urgency. (Thomas Northcut/Digital Vision/Getty Images) Other People Are Reading.</code>                                                                                                                                                                                          | <code>Some types of the HPV virus can infect the genital epithelial cells (skin and mucous membranes). Some types of HPV virus cause warts that appear on the genitals (vagina, vulva, penis, etc.) and anus of women and men.</code>                                                                                                                                                                                                                                                                                                                                         | <code>Most women with HPV have no signs of infection. Since most HPV infections go away on their own within two years, many women never know they had an infection. Some HPV infections cause genital warts that can be seen or felt. The only way to know if you have HPV is to ask your health care provider to do an HPV test.</code>                                                                                                                                                                                                                                                                                                                                                                                                  | <code>Genital warts are caused by low-risk types of human papillomavirus (HPV). These viruses may not cause warts in everyone. Women can get genital warts from sexual contact with someone who has HPV. Genital warts are spread by skin-to-skin contact, usually from contact with the warts. It can be spread by vaginal, anal, oral, or handgenital sexual contact. Genital warts will spread HPV while visible, and after recent treatment.</code>                                                    | <code>[0.5, 2.4958395957946777, 3.76273775100708, 4.114340305328369]</code>                 |
 * Loss: [<code>SpladeLoss</code>](https://sbert.net/docs/package_reference/sparse_encoder/losses.html#spladeloss) with these parameters:
   ```json
   {
       "loss": "SparseMarginMSELoss",
-      "document_regularizer_weight": 0.3,
-      "query_regularizer_weight": 0.5
   }
   ```
@@ -334,15 +289,15 @@ SparseEncoder(
 #### Non-Default Hyperparameters
 - `eval_strategy`: epoch
-- `per_device_train_batch_size`: 48
-- `per_device_eval_batch_size`: 48
-- `learning_rate`: 8e-05
-- `num_train_epochs`: 6
 - `lr_scheduler_type`: cosine
-- `warmup_ratio`: 0.025
 - `fp16`: True
 - `load_best_model_at_end`: True
 - `optim`: adamw_torch_fused
 #### All Hyperparameters
 <details><summary>Click to expand</summary>
@@ -351,24 +306,24 @@ SparseEncoder(
 - `do_predict`: False
 - `eval_strategy`: epoch
 - `prediction_loss_only`: True
-- `per_device_train_batch_size`: 48
-- `per_device_eval_batch_size`: 48
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
-- `learning_rate`: 8e-05
 - `weight_decay`: 0.0
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1.0
-- `num_train_epochs`: 6
 - `max_steps`: -1
 - `lr_scheduler_type`: cosine
 - `lr_scheduler_kwargs`: {}
-- `warmup_ratio`: 0.025
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
@@ -425,7 +380,7 @@ SparseEncoder(
 - `dataloader_persistent_workers`: False
 - `skip_memory_metrics`: True
 - `use_legacy_prediction_loop`: False
-- `push_to_hub`: False
 - `resume_from_checkpoint`: None
 - `hub_model_id`: None
 - `hub_strategy`: every_save
@@ -468,21 +423,20 @@ SparseEncoder(
 </details>
 ### Training Logs
-| Epoch   | Step      | Training Loss | dot_ndcg@10 |
-|:-------:|:---------:|:-------------:|:-----------:|
-| 1.0     | 10336     | 16309.8824    | 0.6698      |
-| 2.0     | 20672     | 14.4047       | 0.6920      |
-| 3.0     | 31008     | 13.0742       | 0.7004      |
-| 4.0     | 41344     | 11.8023       | 0.7060      |
-| 5.0     | 51680     | 11.0464       | 0.7085      |
-| **6.0** | **62016** | **10.6766**   | **0.7094**  |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions
 - Python: 3.11.13
 - Sentence Transformers: 5.0.0
-- Transformers: 4.53.2
 - PyTorch: 2.6.0+cu124
 - Accelerate: 1.8.1
 - Datasets: 4.0.0
@@ -556,6 +510,4 @@ SparseEncoder(
 ## Model Card Contact
 *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
--->
-</details>

 ---
 tags:
 - sentence-transformers
 - sparse-encoder
 - sparse
 - splade
 - generated_from_trainer
+- dataset_size:1200000
 - loss:SpladeLoss
 - loss:SparseMarginMSELoss
 - loss:FlopsLoss
+base_model: yosefw/SPLADE-BERT-Tiny-BS256
 widget:
+- text: 'Most Referenced:report - Return to the USDOJ/OIG Home Page - US Department
+    of JusticeReturn to the USDOJ/OIG Home Page - US Department of Justice. Opinion:Roberts:
+    Feds to stop using private prisons.'
+- text: 'Paul O''Neill, the founder of the Trans-Siberian Orchestra (pictured) has
+    died at age 61. Paul O''Neill, the founder of the popular Christmas-themed rock
+    ensemble Trans-Siberian Orchestra has died. A statement on the group''s Facebook
+    page reads: The entire Trans-Siberian Orchestra family, past and present, is heartbroken
+    to share the devastating news that Paul O’Neill has passed away from chronic illness.'
+- text: meaning for concern
+- text: 'Additional Tips. 1  Do not rub the ink stains as it can spread the stains
+    further. 2  Make sure you test the cleaning solution on a small, hidden area to
+    check if it is suitable for the material. 3  In case an ink stain has become old
+    and dried, the above mentioned home remedies may not be effective.arpet: For ink
+    stained spots on a carpet, you may apply a paste of cornstarch and milk. Leave
+    it for a few hours before brushing it off. Finally, clean the residue with a vacuum
+    cleaner. Leather: Try using a leather shampoo or a leather ink remover for removing
+    ink stains from leather items.'
+- text: 'See below: 1. Get your marriage license. Before you can change your name,
+    you''ll need the original (or certified) marriage license with the raised seal
+    and your new last name on it. Call the clerk''s office where your license was
+    filed to get copies if one wasn''t automatically sent to you. 2. Change your Social
+    Security card.'
 pipeline_tag: feature-extraction
 library_name: sentence-transformers
 metrics:
 - corpus_active_dims
 - corpus_sparsity_ratio
 model-index:
+- name: SPLADE Sparse Encoder
   results:
   - task:
       type: sparse-information-retrieval
       type: unknown
     metrics:
     - type: dot_accuracy@1
+      value: 0.4772
       name: Dot Accuracy@1
     - type: dot_accuracy@3
+      value: 0.793
       name: Dot Accuracy@3
     - type: dot_accuracy@5
+      value: 0.8964
       name: Dot Accuracy@5
     - type: dot_accuracy@10
+      value: 0.96
       name: Dot Accuracy@10
     - type: dot_precision@1
+      value: 0.4772
       name: Dot Precision@1
     - type: dot_precision@3
+      value: 0.2713333333333333
       name: Dot Precision@3
     - type: dot_precision@5
+      value: 0.18644000000000002
       name: Dot Precision@5
     - type: dot_precision@10
+      value: 0.10094000000000002
       name: Dot Precision@10
     - type: dot_recall@1
+      value: 0.4616666666666666
       name: Dot Recall@1
     - type: dot_recall@3
+      value: 0.7798833333333334
       name: Dot Recall@3
     - type: dot_recall@5
+      value: 0.8874
       name: Dot Recall@5
     - type: dot_recall@10
+      value: 0.95595
       name: Dot Recall@10
     - type: dot_ndcg@10
+      value: 0.721747648718731
       name: Dot Ndcg@10
     - type: dot_mrr@10
+      value: 0.6489996031746051
       name: Dot Mrr@10
     - type: dot_map@100
+      value: 0.6446961471449598
       name: Dot Map@100
     - type: query_active_dims
+      value: 18.334199905395508
       name: Query Active Dims
     - type: query_sparsity_ratio
+      value: 0.9993993119747921
       name: Query Sparsity Ratio
     - type: corpus_active_dims
+      value: 121.65303042911474
       name: Corpus Active Dims
     - type: corpus_sparsity_ratio
+      value: 0.9960142510179831
       name: Corpus Sparsity Ratio
 ---
+# SPLADE Sparse Encoder
+This is a [SPLADE Sparse Encoder](https://www.sbert.net/docs/sparse_encoder/usage/usage.html) model finetuned from [yosefw/SPLADE-BERT-Tiny-BS256](https://huggingface.co/yosefw/SPLADE-BERT-Tiny-BS256) using the [sentence-transformers](https://www.SBERT.net) library. It maps sentences & paragraphs to a 30522-dimensional sparse vector space   and can be used for semantic search and sparse retrieval.
+## Model Details
+### Model Description
+- **Model Type:** SPLADE Sparse Encoder
+- **Base model:** [yosefw/SPLADE-BERT-Tiny-BS256](https://huggingface.co/yosefw/SPLADE-BERT-Tiny-BS256) <!-- at revision 239bb34bbfcf6cc8b465eb5b94c76a20c574b47f -->
+- **Maximum Sequence Length:** 512 tokens
+- **Output Dimensionality:** 30522 dimensions
+- **Similarity Function:** Dot Product
+<!-- - **Training Dataset:** Unknown -->
+<!-- - **Language:** Unknown -->
+<!-- - **License:** Unknown -->
+### Model Sources
+- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
+- **Documentation:** [Sparse Encoder Documentation](https://www.sbert.net/docs/sparse_encoder/usage/usage.html)
+- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
+- **Hugging Face:** [Sparse Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=sparse-encoder)
+### Full Model Architecture
+```
+SparseEncoder(
+  (0): MLMTransformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'BertForMaskedLM'})
+  (1): SpladePooling({'pooling_strategy': 'max', 'activation_function': 'relu', 'word_embedding_dimension': 30522})
+)
+```
 ## Usage
 from sentence_transformers import SparseEncoder
 # Download from the 🤗 Hub
+model = SparseEncoder("yosefw/SPLADE-BERT-Tiny-BS256-distil-v3")
 # Run inference
 queries = [
+    "what do i need to change my name on my license in ma",
 ]
 documents = [
+    'Change your name on MA state-issued ID such as driver’s license or MA ID card. All documents you bring to RMV need to be originals or certified copies by the issuing agency. PAPERWORK NEEDED: Proof of legal name change — A court order showing your legal name change. Your Social Security Card with your new legal name change',
+    "See below: 1. Get your marriage license. Before you can change your name, you'll need the original (or certified) marriage license with the raised seal and your new last name on it. Call the clerk's office where your license was filed to get copies if one wasn't automatically sent to you. 2. Change your Social Security card.",
+    "You'll keep the same number—just your name will be different. Mail in your application to the local Social Security Administration office. You should get your new card within 10 business days. 3. Change your license at the DMV. Take a trip to the local Department of Motor Vehicles office to get a new license with your new last name. Bring every form of identification you can get your hands on—your old license, your certified marriage certificate and, most importantly, your new Social Security card.",
 ]
 query_embeddings = model.encode_query(queries)
 document_embeddings = model.encode_document(documents)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
+# tensor([[16.6297, 13.4552, 10.1923]])
 ```
 <!--
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
 ## Evaluation
 ### Metrics
 | Metric                | Value      |
 |:----------------------|:-----------|
+| dot_accuracy@1        | 0.4772     |
+| dot_accuracy@3        | 0.793      |
+| dot_accuracy@5        | 0.8964     |
+| dot_accuracy@10       | 0.96       |
+| dot_precision@1       | 0.4772     |
+| dot_precision@3       | 0.2713     |
+| dot_precision@5       | 0.1864     |
+| dot_precision@10      | 0.1009     |
+| dot_recall@1          | 0.4617     |
+| dot_recall@3          | 0.7799     |
+| dot_recall@5          | 0.8874     |
+| dot_recall@10         | 0.9559     |
+| **dot_ndcg@10**       | **0.7217** |
+| dot_mrr@10            | 0.649      |
+| dot_map@100           | 0.6447     |
+| query_active_dims     | 18.3342    |
+| query_sparsity_ratio  | 0.9994     |
+| corpus_active_dims    | 121.653    |
+| corpus_sparsity_ratio | 0.996      |
 <!--
 ## Bias, Risks and Limitations
 #### Unnamed Dataset
+* Size: 1,200,000 training samples
+* Columns: <code>query</code>, <code>positive</code>, <code>negative_1</code>, <code>negative_2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | query                                                                            | positive                                                                            | negative_1                                                                          | negative_2                                                                          | label                              |
+  |:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:-----------------------------------|
+  | type    | string                                                                           | string                                                                              | string                                                                              | string                                                                              | list                               |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 9.08 tokens</li><li>max: 35 tokens</li></ul> | <ul><li>min: 23 tokens</li><li>mean: 79.02 tokens</li><li>max: 192 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 78.24 tokens</li><li>max: 230 tokens</li></ul> | <ul><li>min: 13 tokens</li><li>mean: 75.26 tokens</li><li>max: 230 tokens</li></ul> | <ul><li>size: 2 elements</li></ul> |
 * Samples:
+  | query                                                           | positive                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | negative_1                                                                                                                                                                                                                                                                                                                                                                                                                                  | negative_2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | label                                                |
+  |:----------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------|
+  | <code>does alzheimer's affect sleep</code>                      | <code>People with Alzheimer’s disease go through many changes, and sleep problems are often some of the most noticeable. Most adults have changes in their sleep patterns as they age. But the problems are more severe and happen more often for people with Alzheimer’s.</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                        | <code>Could the position you SLEEP in affect your risk of Alzheimer's? People who sleep on their side enable their brain to 'detox' better while they rest. While asleep, brain is hard at work removing toxins that build up in the day. If left to build up, these toxins can cause Alzheimer's and Parkinson's.</code>                                                                                                                   | <code>The Scary Connection Between Snoring and Dementia. For more, visit TIME Health. If you don't snore, you likely know someone who does. Between 19% and 40% of adults snore when they sleep, and that percentage climbs even higher, particularly for men, as we age.</code>                                                                                                                                                                                                                                                                             | <code>[1.407266616821289, 10.169305801391602]</code> |
+  | <code>what is fy in steel design</code>                         | <code>Since the yield strength of the steel is quite clearly defined and controlled, this establishes a very precise reference in structural investigations. An early design decision is that for the yield strength (specified by the Grade of steel used) that is to be used in the design work.Several different grades of steel may be used for large projects, with a minimum grade for ordinary tasks and higher grades for more demanding ones.ost steel used for reinforcement is highly ductile in nature. Its usable strength is its yield strength, as this stress condition initiates such a magnitude of deformation (into the plastic yielding range of the steel), that major cracking will occur in the concrete.</code> | <code>fy is the yield point of the material. E is the symbol for Young's Modulus of the material. E can be measured by dividing the elastic stress by the elastic strain.That is, this measurement must be made before the yield point of the material is reached.y is the yield point of the material. E is the symbol for Young's Modulus of the material. E can be measured by dividing the elastic stress by the elastic strain.</code> | <code>The longest dimension of the cant. WT is 13'. Using ASTM A992 carbon steel, a WT9x35.5 is at full bending stress and deflection limits. (Fy = 50 ksi). The only information I've found about using stainless for structural design is that type 304 is usually used.This yield strength (Fy) is only equal to 39 or 42ksi.sing ASTM A992 carbon steel, a WT9x35.5 is at full bending stress and deflection limits. (Fy = 50 ksi). The only information I've found about using stainless for structural design is that type 304 is usually used.</code> | <code>[0.5, 0.5]</code>                              |
+  | <code>most common nutritional deficiencies for teenagers</code> | <code>: Appendix B: Vitamin and Mineral Deficiencies in the U.S. Some American adults get too little vitamin D, vitamin E, magnesium, calcium, vitamin A and vitamin C (Table B1). More than 40 percent of adults have dietary intakes of vitamin A, C, D and E, calcium and magnesium below the average requirement for their age and gender. Inadequate intake of vitamins and minerals is most common among 14-to-18-year-old teenagers. Adolescent girls have lower nutrient intake than boys (Berner 2014; Fulgoni 2011). But nutrient deficiencies are rare among younger American children; the exceptions are dietary vitamin D and E, for which intake is low for all Americans, and calcium.</code>                            | <code>Common Nutritional Deficiencies. 10 Most Common Nutritional Deficiencies.. Calcium. Calcium is one of the most abundant minerals in your body, yet most people still manage to have a calcium deficiency. Calcium is best know for adding strength to your bones and teeth.</code>                                                                                                                                                    | <code>1) Vitamin D–Vitamin D deficiency is common in infants born to mothers with low levels of Vitamin D. Severe deficiency of this nutrient in infancy and early childhood can lead to the development of Rickets, a disease that affects bone formation and causes bow-legs.</code>                                                                                                                                                                                                                                                                       | <code>[3.182860851287842, 7.834665775299072]</code>  |
 * Loss: [<code>SpladeLoss</code>](https://sbert.net/docs/package_reference/sparse_encoder/losses.html#spladeloss) with these parameters:
   ```json
   {
       "loss": "SparseMarginMSELoss",
+      "document_regularizer_weight": 0.2,
+      "query_regularizer_weight": 0.3
   }
   ```
 #### Non-Default Hyperparameters
 - `eval_strategy`: epoch
+- `per_device_train_batch_size`: 32
+- `per_device_eval_batch_size`: 32
+- `num_train_epochs`: 5
 - `lr_scheduler_type`: cosine
+- `warmup_ratio`: 0.05
 - `fp16`: True
 - `load_best_model_at_end`: True
 - `optim`: adamw_torch_fused
+- `push_to_hub`: True
 #### All Hyperparameters
 <details><summary>Click to expand</summary>
 - `do_predict`: False
 - `eval_strategy`: epoch
 - `prediction_loss_only`: True
+- `per_device_train_batch_size`: 32
+- `per_device_eval_batch_size`: 32
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
+- `learning_rate`: 5e-05
 - `weight_decay`: 0.0
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1.0
+- `num_train_epochs`: 5
 - `max_steps`: -1
 - `lr_scheduler_type`: cosine
 - `lr_scheduler_kwargs`: {}
+- `warmup_ratio`: 0.05
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
 - `dataloader_persistent_workers`: False
 - `skip_memory_metrics`: True
 - `use_legacy_prediction_loop`: False
+- `push_to_hub`: True
 - `resume_from_checkpoint`: None
 - `hub_model_id`: None
 - `hub_strategy`: every_save
 </details>
 ### Training Logs
+| Epoch   | Step       | Training Loss | dot_ndcg@10 |
+|:-------:|:----------:|:-------------:|:-----------:|
+| 1.0     | 37500      | 11.4095       | 0.7103      |
+| 2.0     | 75000      | 10.5305       | 0.7139      |
+| 3.0     | 112500     | 9.5368        | 0.7197      |
+| **4.0** | **150000** | **8.717**     | **0.7216**  |
+| 5.0     | 187500     | 8.3094        | 0.7217      |
 * The bold row denotes the saved checkpoint.
 ### Framework Versions
 - Python: 3.11.13
 - Sentence Transformers: 5.0.0
+- Transformers: 4.54.0
 - PyTorch: 2.6.0+cu124
 - Accelerate: 1.8.1
 - Datasets: 4.0.0
 ## Model Card Contact
 *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
+-->

config.json CHANGED Viewed

@@ -17,7 +17,7 @@
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.53.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.54.0",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

config_sentence_transformers.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "model_type": "SparseEncoder",
   "__version__": {
     "sentence_transformers": "5.0.0",
-    "transformers": "4.53.2",
     "pytorch": "2.6.0+cu124"
   },
   "prompts": {

   "model_type": "SparseEncoder",
   "__version__": {
     "sentence_transformers": "5.0.0",
+    "transformers": "4.54.0",
     "pytorch": "2.6.0+cu124"
   },
   "prompts": {

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c5578e5c58d8ff1c071f9ef9a555c2694c08a5b4c196697e4e199218dcc64ff0
 size 17671560

 version https://git-lfs.github.com/spec/v1
+oid sha256:5c31a966edf1180fdc74e1e2056564d353711ec001c395cc9bd00d5368ea38ee
 size 17671560