s3prl/superb
Viewer • Updated • 304k • 1.62k • 33
How to use jialicheng/whisper-base-speech_commands with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("audio-classification", model="jialicheng/whisper-base-speech_commands") # Load model directly
from transformers import AutoProcessor, AutoModelForAudioClassification
processor = AutoProcessor.from_pretrained("jialicheng/whisper-base-speech_commands")
model = AutoModelForAudioClassification.from_pretrained("jialicheng/whisper-base-speech_commands")# Load model directly
from transformers import AutoProcessor, AutoModelForAudioClassification
processor = AutoProcessor.from_pretrained("jialicheng/whisper-base-speech_commands")
model = AutoModelForAudioClassification.from_pretrained("jialicheng/whisper-base-speech_commands")This model is a fine-tuned version of openai/whisper-base on the superb dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|---|---|---|---|---|
| 0.6909 | 1.0 | 1597 | 0.1572 | 0.9651 |
| 0.0891 | 2.0 | 3194 | 0.1597 | 0.9660 |
| 0.0676 | 3.0 | 4791 | 0.1304 | 0.9719 |
| 0.0475 | 4.0 | 6388 | 0.0999 | 0.9796 |
| 0.0433 | 5.0 | 7985 | 0.1079 | 0.9798 |
| 0.0284 | 6.0 | 9582 | 0.1089 | 0.9803 |
| 0.0236 | 7.0 | 11179 | 0.1162 | 0.9819 |
| 0.0193 | 8.0 | 12776 | 0.1152 | 0.9834 |
| 0.0111 | 9.0 | 14373 | 0.1272 | 0.9821 |
| 0.0088 | 10.0 | 15970 | 0.1306 | 0.9826 |
Base model
openai/whisper-base
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("audio-classification", model="jialicheng/whisper-base-speech_commands")