Update README.md
Browse files
README.md
CHANGED
|
@@ -2,7 +2,7 @@
|
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
# SALMONN: Speech Audio Language Music Open Neural Network
|
| 5 |
-
<div align=center><img src="
|
| 6 |
|
| 7 |
Welcome to the repo of **SALMONN**!
|
| 8 |
|
|
@@ -19,7 +19,7 @@ We will open source the code and the model checkpoint soon. Stay tuned!
|
|
| 19 |
|
| 20 |
SALMONN adopts a speech & audio encoder to encode generic audio representation, then uses an audio-text aligner to map the audio feature into textual space. Finally, the large language model answers based on the textual prompt and the auditory tokens.
|
| 21 |
|
| 22 |
-
<div align=center><img src="
|
| 23 |
|
| 24 |
## Demos
|
| 25 |
|
|
|
|
| 2 |
license: apache-2.0
|
| 3 |
---
|
| 4 |
# SALMONN: Speech Audio Language Music Open Neural Network
|
| 5 |
+
<div align=center><img src="https://cdn-uploads.huggingface.co/production/uploads/63770389cdcc1bf630870758/sr9ABG_rv6P-VgesTMhOC.png" height="256px" width="256px"/></div>
|
| 6 |
|
| 7 |
Welcome to the repo of **SALMONN**!
|
| 8 |
|
|
|
|
| 19 |
|
| 20 |
SALMONN adopts a speech & audio encoder to encode generic audio representation, then uses an audio-text aligner to map the audio feature into textual space. Finally, the large language model answers based on the textual prompt and the auditory tokens.
|
| 21 |
|
| 22 |
+
<div align=center><img src="https://cdn-uploads.huggingface.co/production/uploads/63770389cdcc1bf630870758/TEZzr54VZ5yc34LeixFbi.png" height="75%" width="75%"/></div>
|
| 23 |
|
| 24 |
## Demos
|
| 25 |
|