jnjj commited on
Commit
832ea2d
·
verified ·
1 Parent(s): 8270f31

Update README.md via script

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -23,14 +23,14 @@ The fully merged model weights and tokenizer are updated periodically at the roo
23
  - **Dynamic Dataset Source:** The script iterates through a wide array of Hugging Face Hub datasets.
24
  - **Rapid Iteration Strategy:** Training per dataset configuration is brief (`max_steps=1`), prioritizing breadth of exposure over depth on any single dataset.
25
  ## Training Progress
26
- - **Datasets Processed (Successfully trained on at least one config):** 11
27
- - **Text Examples Streamed (Total):** 66
28
- - **Tokens Processed (Total):** 33792
29
- - **Last Successful Model Update:** 2025-05-08 15:53:23 UTC
30
  ### Evaluation Snapshot (Approximate)
31
 
32
- - **Current Perplexity (wikitext Subset):** 283.21
33
- - **Perplexity Change:** `-0.22` ⬇️ (vs previous cycle's perplexity)
34
 
35
  #### Generated Examples (Qualitative Assessment)
36
 
 
23
  - **Dynamic Dataset Source:** The script iterates through a wide array of Hugging Face Hub datasets.
24
  - **Rapid Iteration Strategy:** Training per dataset configuration is brief (`max_steps=1`), prioritizing breadth of exposure over depth on any single dataset.
25
  ## Training Progress
26
+ - **Datasets Processed (Successfully trained on at least one config):** 1
27
+ - **Text Examples Streamed (Total):** 6
28
+ - **Tokens Processed (Total):** 3072
29
+ - **Last Successful Model Update:** 2025-05-08 18:02:08 UTC
30
  ### Evaluation Snapshot (Approximate)
31
 
32
+ - **Current Perplexity (wikitext Subset):** 282.70
33
+ - **Perplexity Change:** `-0.51` ⬇️ (vs previous cycle's perplexity)
34
 
35
  #### Generated Examples (Qualitative Assessment)
36