Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

vincenzodentamaro
/
wersa

Transformers
English
wersa
causal-lm
qwen
long-context
Model card Files Files and versions
xet
Community
wersa
90.4 kB
  • 2 contributors
History: 12 commits
vincenzodentamaro's picture
vincenzodentamaro
Update wersa/modeling_wersa.py
e2b3703 verified about 2 months ago
  • wersa
    Update wersa/modeling_wersa.py about 2 months ago
  • .gitattributes
    1.52 kB
    initial commit 2 months ago
  • README.md
    4.47 kB
    Update README.md 2 months ago
  • setup.py
    565 Bytes
    Initial release of WERSA source code 2 months ago
  • train_and_generate_0.6b.py
    5.05 kB
    Create train_and_generate_0.6b.py 2 months ago
  • train_and_generate_8b.py
    5.02 kB
    Create train_and_generate_8b.py 2 months ago
  • train_wersa_single_gpu.py
    18.2 kB
    Upload 2 files about 2 months ago
  • watch_and_test.py
    14.2 kB
    Upload 2 files about 2 months ago