Sao10K
/

L3-8B-Stheno-v3.2

Text Generation

text-generation-inference

Model card Files Files and versions

Sao10K commited on Jun 5, 2024

Commit

1520110

·

verified ·

1 Parent(s): 2b2b409

Create README.md

Files changed (1) hide show

README.md +33 -0

README.md ADDED Viewed

	@@ -0,0 +1,33 @@

+---
+license: cc-by-nc-4.0
+language:
+- en
+datasets:
+- Gryphe/Opus-WritingPrompts
+- Sao10K/Claude-3-Opus-Instruct-15K
+- Sao10K/Short-Storygen-v2
+- Sao10K/c2-Logs-Filtered
+---
+Stheno-v3.2-Zeta
+I have done a test run with multiple variations of the models, merged back to its base at various weights, different training runs too, and this Sixth iteration is the one I like most.
+Changes compared to v3.1
+<br>\- Included a mix of SFW and NSFW Storywriting Data, thanks to [Gryphe](https://huggingface.co/datasets/Gryphe/Opus-WritingPrompts)
+<br>\- Included More Instruct / Assistant-Style Data
+<br>\- Further cleaned up Roleplaying Samples from c2 Logs -> A few terrible, really bad samples escaped heavy filtering. Manual pass fixed it.
+<br>\- Hyperparameter tinkering for training, resulting in lower loss levels.
+Testing Notes - Compared to v3.1
+<br>\- Handles SFW / NSFW seperately better. Not as overly excessive with NSFW now.
+<br>\- Better at Storywriting / Narration.
+<br>\- Better at Assistant-type Tasks.
+<br>\- Better Multi-Turn Coherency -> Reduced Issues?
+<br>\- Slightly less creative? A worthy tradeoff. Still creative.
+<br>\- Better prompt / instruction adherence.
+---