Spaces:
Sleeping
Sleeping
docs : fix VAD section heading levels (#3186)
Browse files
README.md
CHANGED
|
@@ -733,7 +733,7 @@ let package = Package(
|
|
| 733 |
)
|
| 734 |
```
|
| 735 |
|
| 736 |
-
|
| 737 |
Support for Voice Activity Detection (VAD) can be enabled using the `--vad`
|
| 738 |
argument to `whisper-cli`. In addition to this option a VAD model is also
|
| 739 |
required.
|
|
@@ -747,7 +747,7 @@ transcription process.
|
|
| 747 |
|
| 748 |
The following VAD models are currently supported:
|
| 749 |
|
| 750 |
-
|
| 751 |
[Silero-vad](https://github.com/snakers4/silero-vad) is a lightweight VAD model
|
| 752 |
written in Python that is fast and accurate.
|
| 753 |
|
|
@@ -792,7 +792,7 @@ $ ./build/bin/whisper-cli \
|
|
| 792 |
--vad-model ./models/silero-v5.1.2-ggml.bin
|
| 793 |
```
|
| 794 |
|
| 795 |
-
|
| 796 |
|
| 797 |
* --vad-threshold: Threshold probability for speech detection. A probability
|
| 798 |
for a speech segment/frame above this threshold will be considered as speech.
|
|
|
|
| 733 |
)
|
| 734 |
```
|
| 735 |
|
| 736 |
+
## Voice Activity Detection (VAD)
|
| 737 |
Support for Voice Activity Detection (VAD) can be enabled using the `--vad`
|
| 738 |
argument to `whisper-cli`. In addition to this option a VAD model is also
|
| 739 |
required.
|
|
|
|
| 747 |
|
| 748 |
The following VAD models are currently supported:
|
| 749 |
|
| 750 |
+
### Silero-VAD
|
| 751 |
[Silero-vad](https://github.com/snakers4/silero-vad) is a lightweight VAD model
|
| 752 |
written in Python that is fast and accurate.
|
| 753 |
|
|
|
|
| 792 |
--vad-model ./models/silero-v5.1.2-ggml.bin
|
| 793 |
```
|
| 794 |
|
| 795 |
+
### VAD Options
|
| 796 |
|
| 797 |
* --vad-threshold: Threshold probability for speech detection. A probability
|
| 798 |
for a speech segment/frame above this threshold will be considered as speech.
|