Goes off the rails

#22

by Hypersniper - opened Jan 28

Jan 28

Not a bad release! Needs some more training for sure. After a minute or so of conversation it starts to degrade in quality. Maybe more turn based conversations are needed for the dataset?

MeghanaKap

Jan 28

•

edited Jan 28

Could u share your inference script

royrajarshi

NVIDIA org Jan 28

Right. Its focus was to showcase naturalness, and basic instruction following, voice prompting. And it has a 2048 token window and only on minimal SFT data. Future versions will have proper post-training/alignment.

royrajarshi changed discussion status to closed Jan 28

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment