rl-llm-agent
/

Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter1

Model card Files Files and versions

Llama-3.2-3B-Instruct-reward-alfworld-iqlearn-iter1

14.4 GB

1 contributor

History: 4 commits

SFconvertbot's picture

Adding `safetensors` variant of this model

1d57640 verified 10 months ago

.gitattributes

1.57 kB

upload checkpoint 10 months ago
config.json

965 Bytes

upload checkpoint 10 months ago
generation_config.json

184 Bytes

upload checkpoint 10 months ago
pytorch_model-00001-of-00002.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.BFloat16Storage"
What is a pickle import?
4.97 GB
xet

upload checkpoint 10 months ago
pytorch_model-00001-of-00002.safetensors

4.97 GB
xet

Adding `safetensors` variant of this model 10 months ago
pytorch_model-00002-of-00002.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.BFloat16Storage",
- "collections.OrderedDict"
What is a pickle import?
2.25 GB
xet

upload checkpoint 10 months ago
pytorch_model-00002-of-00002.safetensors

2.25 GB
xet

Adding `safetensors` variant of this model 10 months ago
pytorch_model.bin.index.json

21 kB

upload checkpoint 10 months ago
special_tokens_map.json

439 Bytes

upload checkpoint 10 months ago
tokenizer.json

17.2 MB
xet

upload checkpoint 10 months ago
tokenizer_config.json

54.7 kB

upload checkpoint 10 months ago