Too BIG! max_model_length in tokenizer_config.json
#1
by Hambaobao - opened
max_model_length in tokenizer_config.json seems wrong. Why is it so big, I see max_model_length in other reps is only 2048.
@Hambaobao yeah I had this issue too when I wrote a script to pad my datasets in max_model_length, boy that was insane to have 1000000000000000019884624838656 length. I made a pull request in https://huggingface.co/NousResearch/Llama-2-7b-hf/discussions/2/files to make it 4k
It doesn't actually matter so long as its 4096 or greater, but it is confusing. Updating
teknium changed discussion status to closed