allenai/swag
Viewer • Updated • 207k • 4.8k • 24
How to use thesven/distilbert-swag with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultipleChoice
tokenizer = AutoTokenizer.from_pretrained("thesven/distilbert-swag")
model = AutoModelForMultipleChoice.from_pretrained("thesven/distilbert-swag")This model is a fine-tuned version of distilbert-base-uncased on the swag dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 0.9186 | 1.0 | 2298 | 0.7658 | 0.6950 |
| 0.5843 | 2.0 | 4597 | 0.7453 | 0.7059 |
| 0.3548 | 3.0 | 6894 | 0.9079 | 0.7105 |
Base model
distilbert/distilbert-base-uncased