| # Seed-VC | |
| [](https://huggingface.co/spaces/Plachta/Seed-VC) [](https://arxiv.org/abs/2411.09943) | |
| *[English](README.md) | [ç®äœäžæ](README-ZH.md) | æ¥æ¬èª* | |
| [real-time-demo.webm](https://github.com/user-attachments/assets/86325c5e-f7f6-4a04-8695-97275a5d046c) | |
| *(泚æïŒãã®ææžã¯æ©æ¢°ç¿»èš³ã«ãã£ãŠçæããããã®ã§ããæ£ç¢ºæ§ã確ä¿ããããåªããŠããŸãããäžæç¢ºãªç¹ãããããŸãããè±èªçããåç §ãã ãããç¿»èš³ã®æ¹åæ¡ãããããŸããããPRãæè¿ããããŸãã)* | |
| çŸåšãªãªãŒã¹ãããŠããã¢ãã«ã¯ã*ãŒãã·ã§ããé³å£°å€æ* ðã*ãŒãã·ã§ãããªã¢ã«ã¿ã€ã é³å£°å€æ* ð£ïžã*ãŒãã·ã§ããæå£°å€æ* ð¶ ã«å¯Ÿå¿ããŠããŸãããã¬ãŒãã³ã°ãªãã§ã1ã30ç§ã®åç §é³å£°ãããã€ã¹ã¯ããŒãã³ã°ãå¯èœã§ãã | |
| ã«ã¹ã¿ã ããŒã¿ã§ã®è¿œå ãã¡ã€ã³ãã¥ãŒãã³ã°ããµããŒãããŠãããç¹å®ã®è©±è /話è 矀ã«å¯Ÿããããã©ãŒãã³ã¹ãåäžãããããšãã§ããŸããããŒã¿èŠä»¶ã¯æ¥µããŠå°ãªãïŒ**話è ãããæäœ1çºè©±**ïŒããã¬ãŒãã³ã°é床ãéåžžã«éãïŒ**æäœ100ã¹ããããT4ã§2å**ïŒã§ãïŒ | |
| **ãªã¢ã«ã¿ã€ã é³å£°å€æ**ã«å¯Ÿå¿ããŠãããã¢ã«ãŽãªãºã ã®é å»¶ã¯çŽ300msãããã€ã¹åŽã®é å»¶ã¯çŽ100msã§ããªã³ã©ã€ã³äŒè°ãã²ãŒã ãã©ã€ãé ä¿¡ã«é©ããŠããŸãã | |
| ãã¢ã以åã®é³å£°å€æã¢ãã«ãšã®æ¯èŒã«ã€ããŠã¯ã[ãã¢ããŒãž](https://plachtaa.github.io/seed-vc/)ðãš[è©äŸ¡](EVAL.md)ðãã芧ãã ããã | |
| ã¢ãã«ã®å質åäžãšæ©èœè¿œå ãç¶ç¶çã«è¡ã£ãŠããŸãã | |
| ## è©äŸ¡ð | |
| 客芳çè©äŸ¡çµæãšä»ã®ããŒã¹ã©ã€ã³ãšã®æ¯èŒã«ã€ããŠã¯[EVAL.md](EVAL.md)ãã芧ãã ããã | |
| ## ã€ã³ã¹ããŒã«ð¥ | |
| Windows ãŸã㯠Linux ã§ Python 3.10 ãæšå¥šããŸãã | |
| ```bash | |
| pip install -r requirements.txt | |
| ``` | |
| ## äœ¿çšæ¹æ³ð ïž | |
| ç®çã«å¿ããŠ3ã€ã®ã¢ãã«ããªãªãŒã¹ããŠããŸãïŒ | |
| | ããŒãžã§ã³ | åç§° | ç®ç | ãµã³ããªã³ã°ã¬ãŒã | ã³ã³ãã³ããšã³ã³ãŒã | ãã³ãŒã | é ãæ¬¡å | ã¬ã€ã€ãŒæ° | ãã©ã¡ãŒã¿æ° | åè | | |
| |---------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------|---------------|-----------------|---------|------------|----------|--------|--------------------------------------------------------| | |
| | v1.0 | seed-uvit-tat-xlsr-tiny ([ð€](https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_uvit_tat_xlsr_ema.pth)[ð](configs/presets/config_dit_mel_seed_uvit_xlsr_tiny.yml)) | é³å£°å€æ (VC) | 22050 | XLSR-large | HIFT | 384 | 9 | 25M | ãªã¢ã«ã¿ã€ã é³å£°å€æã«é©ããŠããŸã | | |
| | v1.0 | seed-uvit-whisper-small-wavenet ([ð€](https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_small_wavenet_bigvgan_pruned.pth)[ð](configs/presets/config_dit_mel_seed_uvit_whisper_small_wavenet.yml)) | é³å£°å€æ (VC) | 22050 | Whisper-small | BigVGAN | 512 | 13 | 98M | ãªãã©ã€ã³é³å£°å€æã«é©ããŠããŸã | | |
| | v1.0 | seed-uvit-whisper-base ([ð€](https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_base_f0_44k_bigvgan_pruned_ft_ema.pth)[ð](configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml)) | æå£°å€æ (SVC) | 44100 | Whisper-small | BigVGAN | 768 | 17 | 200M | 匷åãªãŒãã·ã§ããããã©ãŒãã³ã¹ãæå£°å€æ | | |
| ææ°ã®ã¢ãã«ãªãªãŒã¹ã®ãã§ãã¯ãã€ã³ãã¯ãæåã®æšè«å®è¡æã«èªåçã«ããŠã³ããŒããããŸãã | |
| ãããã¯ãŒã¯ã®çç±ã§huggingfaceã«ã¢ã¯ã»ã¹ã§ããªãå Žåã¯ããã¹ãŠã®ã³ãã³ãã®åã« `HF_ENDPOINT=https://hf-mirror.com` ã远å ããŠãã©ãŒã䜿çšããŠãã ããã | |
| ã³ãã³ãã©ã€ã³æšè«ïŒ | |
| ```bash | |
| python inference.py --source <source-wav> | |
| --target <referene-wav> | |
| --output <output-dir> | |
| --diffusion-steps 25 # æå£°å€æã«ã¯30ã50ãæšå¥š | |
| --length-adjust 1.0 | |
| --inference-cfg-rate 0.7 | |
| --f0-condition False # æå£°å€æã®å Žåã¯Trueã«èšå® | |
| --auto-f0-adjust False # ãœãŒã¹ããããã¿ãŒã²ãããããã¬ãã«ã«èªå調æŽããå Žåã¯Trueãéåžžã¯æå£°å€æã§ã¯äœ¿çšããªã | |
| --semi-tone-shift 0 # æå£°å€æã®ãããã·ããïŒåé³åäœïŒ | |
| --checkpoint <path-to-checkpoint> | |
| --config <path-to-config> | |
| --fp16 True | |
| ``` | |
| åãã©ã¡ãŒã¿ã®èª¬æïŒ | |
| - `source` ã¯å€æãããé³å£°ãã¡ã€ã«ã®ãã¹ | |
| - `target` ã¯åç §é³å£°ãã¡ã€ã«ã®ãã¹ | |
| - `output` ã¯åºåãã£ã¬ã¯ããªã®ãã¹ | |
| - `diffusion-steps` ã¯æ¡æ£ã¹ãããæ°ãããã©ã«ãã¯25ãæé«å質ã«ã¯30-50ãæéæšè«ã«ã¯4-10ãäœ¿çš | |
| - `length-adjust` ã¯é·ã調æŽä¿æ°ãããã©ã«ãã¯1.0ã<1.0ã§é³å£°ççž®ã>1.0ã§é³å£°äŒžé· | |
| - `inference-cfg-rate` ã¯åºåã«åŸ®åŠãªéãããããããããã©ã«ãã¯0.7 | |
| - `f0-condition` ã¯ãœãŒã¹é³å£°ã®ããããåºåã«æ¡ä»¶ä»ããããã©ã°ãããã©ã«ãã¯Falseãæå£°å€æã®å Žåã¯True | |
| - `auto-f0-adjust` ã¯ãœãŒã¹ããããã¿ãŒã²ãããããã¬ãã«ã«èªå調æŽãããã©ã°ãããã©ã«ãã¯Falseãéåžžã¯æå£°å€æã§ã¯äœ¿çšããªã | |
| - `semi-tone-shift` ã¯æå£°å€æã®ãããã·ããïŒåé³åäœïŒãããã©ã«ãã¯0 | |
| - `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`f0-condition`ã`False`ã®å Žåã¯`seed-uvit-whisper-small-wavenet`ããã以å€ã¯`seed-uvit-whisper-base`ïŒ | |
| - `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã | |
| - `fp16` ã¯float16æšè«ã䜿çšãããã©ã°ãããã©ã«ãã¯True | |
| é³å£°å€æWeb UIïŒ | |
| ```bash | |
| python app_vc.py --checkpoint <path-to-checkpoint> --config <path-to-config> --fp16 True | |
| ``` | |
| - `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`seed-uvit-whisper-small-wavenet`ïŒ | |
| - `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã | |
| ãã©ãŠã¶ã§`http://localhost:7860/`ã«ã¢ã¯ã»ã¹ããŠWebã€ã³ã¿ãŒãã§ãŒã¹ã䜿çšã§ããŸãã | |
| æå£°å€æWeb UIïŒ | |
| ```bash | |
| python app_svc.py --checkpoint <path-to-checkpoint> --config <path-to-config> --fp16 True | |
| ``` | |
| - `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`seed-uvit-whisper-base`ïŒ | |
| - `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã | |
| çµ±åWeb UIïŒ | |
| ```bash | |
| python app.py | |
| ``` | |
| ããã¯ãŒãã·ã§ããæšè«çšã®äºååŠç¿æžã¿ã¢ãã«ã®ã¿ãèªã¿èŸŒã¿ãŸããã«ã¹ã¿ã ãã§ãã¯ãã€ã³ãã䜿çšããå Žåã¯ãäžèšã®`app_vc.py`ãŸãã¯`app_svc.py`ãå®è¡ããŠãã ããã | |
| ãªã¢ã«ã¿ã€ã é³å£°å€æGUIïŒ | |
| ```bash | |
| python real-time-gui.py --checkpoint-path <path-to-checkpoint> --config-path <path-to-config> | |
| ``` | |
| - `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`seed-uvit-tat-xlsr-tiny`ïŒ | |
| - `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã | |
| éèŠïŒãªã¢ã«ã¿ã€ã é³å£°å€æã«ã¯GPUã®äœ¿çšãåŒ·ãæšå¥šããŸãã | |
| NVIDIA RTX 3060ããŒãããœã³ã³GPUã§ããã€ãã®ããã©ãŒãã³ã¹ãã¹ããè¡ããçµæãšæšå¥šãã©ã¡ãŒã¿èšå®ã以äžã«ç€ºããŸãïŒ | |
| | ã¢ãã«æ§æ | æ¡æ£ã¹ããã | æšè«CFGã¬ãŒã | æå€§ããã³ããé· | ãããã¯æé (ç§) | ã¯ãã¹ãã§ãŒãé· (ç§) | 远å ã³ã³ããã¹ã (å·Š) (ç§) | 远å ã³ã³ããã¹ã (å³) (ç§) | ã¬ã€ãã³ã· (ããªç§) | ãã£ã³ã¯ãããã®æšè«æé (ããªç§) | | |
| |---------------------------------|-----------------|--------------------|-------------------|----------------|----------------------|--------------------------|---------------------------|--------------|-------------------------------| | |
| | seed-uvit-xlsr-tiny | 10 | 0.7 | 3.0 | 0.18 | 0.04 | 2.5 | 0.02 | 430 | 150 | | |
| GUIã§ãã©ã¡ãŒã¿ãèªèº«ã®ããã€ã¹ã®ããã©ãŒãã³ã¹ã«åãããŠèª¿æŽã§ããŸããæšè«æéããããã¯æéããçããã°ãé³å£°å€æã¹ããªãŒã ã¯æ£åžžã«åäœããã¯ãã§ãã | |
| ä»ã®GPUéçŽåã¿ã¹ã¯ïŒã²ãŒã ãåç»èŠèŽãªã©ïŒãå®è¡ããŠããå Žåãæšè«é床ãäœäžããå¯èœæ§ãããããšã«æ³šæããŠãã ããã | |
| ãªã¢ã«ã¿ã€ã é³å£°å€æGUIã®ãã©ã¡ãŒã¿èª¬æïŒ | |
| - `Diffusion Steps` ã¯æ¡æ£ã¹ãããæ°ããªã¢ã«ã¿ã€ã 倿ã®å Žåã¯éåžž4~10ã§æéæšè« | |
| - `Inference CFG Rate` ã¯åºåã«åŸ®åŠãªéãããããããããã©ã«ãã¯0.7ã0.0ã«èšå®ãããš1.5åã®æšè«é床ãåäž | |
| - `Max Prompt Length` ã¯æå€§ããã³ããé·ãèšå®ãäœããããšæšè«é床ãéããªãããæç€ºé³å£°ãšã®é¡äŒŒæ§ãäœäžããå¯èœæ§ããã | |
| - `Block Time` ã¯æšè«ã®åãªãŒãã£ãª ãã£ã³ã¯ã®æéé·ã§ããå€ã倧ããã»ã©ã¬ã€ãã³ã·ãé·ããªããŸãããã®å€ã¯ãããã¯ãããã®æšè«æéãããé·ãããå¿ èŠãããããšã«æ³šæããŠãã ãããããŒããŠã§ã¢ã®ç¶æ ã«å¿ããŠèšå®ããŸãã | |
| - `Crossfade Length` ã¯ã¯ãã¹ãã§ãŒãé·ãéåžžã¯å€æŽããªã | |
| - `Extra context (left)` ã¯æšè«ã®ããã®è¿œå å±¥æŽã³ã³ããã¹ãã®æéé·ã§ããå€ãé«ãã»ã©æšè«æéã¯é·ããªããŸãããå®å®æ§ã¯åäžããŸãã | |
| - `Extra context (right)` ã¯æšè«ã®ããã®è¿œå æªæ¥ã³ã³ããã¹ãã®æéé·ã§ããå€ãé«ãã»ã©æšè«æéãšã¬ã€ãã³ã·ã¯é·ããªããŸãããå®å®æ§ã¯åäžããŸãã | |
| ã¢ã«ãŽãªãºã ã¬ã€ãã³ã·ãŒã¯`Block Time * 2 + Extra context (right)`ã§ãããã€ã¹åŽã¬ã€ãã³ã·ãŒã¯éåžž100msçšåºŠã§ããå šäœã®é 延㯠2 ã€ã®åèšã§ãã | |
| [VB-CABLE](https://vb-audio.com/Cable/)ã䜿çšããŠãGUIåºåã¹ããªãŒã ãä»®æ³ãã€ã¯ã«ã«ãŒãã£ã³ã°ããããšãã§ããŸãã | |
| *ïŒGUIãšãªãŒãã£ãªãã£ã³ãã³ã°ã®ããžãã¯ã¯[RVC](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)ããä¿®æ£ãããŠããŸããçŽ æŽãããå®è£ ã«æè¬ããŸãïŒïŒ* | |
| ## ãã¬ãŒãã³ã°ðïž | |
| ã«ã¹ã¿ã ããŒã¿ã§ã®ãã¡ã€ã³ãã¥ãŒãã³ã°ã«ãããããæ£ç¢ºã«å£°ãã¯ããŒãã³ã°ããããšãã§ããŸããç¹å®ã®è©±è ã«å¯Ÿãã話è é¡äŒŒæ§ãå€§å¹ ã«åäžããŸãããWERãè¥å¹²äžæããå¯èœæ§ããããŸãã | |
| 以äžã®Colabãã¥ãŒããªã¢ã«ã§æé ã確èªã§ããŸãïŒ[](https://colab.research.google.com/drive/1R1BJTqMsTXZzYAVx3j1BiemFXog9pbQG?usp=sharing) | |
| 1. ç¬èªã®ããŒã¿ã»ãããæºåããŸãã以äžã®æ¡ä»¶ãæºããå¿ èŠããããŸãïŒ | |
| - ãã¡ã€ã«æ§é ã¯åããŸãã | |
| - åé³å£°ãã¡ã€ã«ã¯1ã30ç§ã®ç¯å²ã§ããå¿ èŠãããããã以å€ã¯ç¡èŠãããŸã | |
| - ãã¹ãŠã®é³å£°ãã¡ã€ã«ã¯ä»¥äžã®ããããã®åœ¢åŒã§ããå¿ èŠããããŸãïŒ`.wav` `.flac` `.mp3` `.m4a` `.opus` `.ogg` | |
| - 話è ã©ãã«ã¯å¿ é ã§ã¯ãããŸããããå話è ã«å°ãªããšã1ã€ã®çºè©±ãããããšã確èªããŠãã ãã | |
| - ãã¡ãããããŒã¿ãå€ãã»ã©ã¢ãã«ã®ããã©ãŒãã³ã¹ã¯åäžããŸã | |
| - ãã¬ãŒãã³ã°ããŒã¿ã¯ã§ããã ãã¯ãªãŒã³ã§ããå¿ èŠããããBGMããã€ãºã¯æãŸãããããŸãã | |
| 2. ãã¡ã€ã³ãã¥ãŒãã³ã°çšã«`configs/presets/`ããã¢ãã«èšå®ãã¡ã€ã«ãéžæãããããŒããããã¬ãŒãã³ã°ããããã®ç¬èªã®èšå®ãäœæããŸãã | |
| - ãã¡ã€ã³ãã¥ãŒãã³ã°ã®å Žåã¯ã以äžã®ãããããéžæããŸãïŒ | |
| - `./configs/presets/config_dit_mel_seed_uvit_xlsr_tiny.yml` ãªã¢ã«ã¿ã€ã é³å£°å€æçš | |
| - `./configs/presets/config_dit_mel_seed_uvit_whisper_small_wavenet.yml` ãªãã©ã€ã³é³å£°å€æçš | |
| - `./configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml` æå£°å€æçš | |
| 3. 以äžã®ã³ãã³ãã§ãã¬ãŒãã³ã°ãéå§ããŸãïŒ | |
| ```bash | |
| python train.py | |
| --config <path-to-config> | |
| --dataset-dir <path-to-data> | |
| --run-name <run-name> | |
| --batch-size 2 | |
| --max-steps 1000 | |
| --max-epochs 1000 | |
| --save-every 500 | |
| --num-workers 0 | |
| ``` | |
| åãã©ã¡ãŒã¿ã®èª¬æïŒ | |
| - `config` ã¯ã¢ãã«èšå®ãžã®ãã¹ããã¡ã€ã³ãã¥ãŒãã³ã°çšã«äžèšã®ãããããéžæãããããŒããããã¬ãŒãã³ã°ããå Žåã¯ç¬èªã®èšå®ãäœæ | |
| - `dataset-dir` ã¯ããŒã¿ã»ãããã£ã¬ã¯ããªãžã®ãã¹ããã¹ãŠã®é³å£°ãã¡ã€ã«ãå«ããã©ã«ãã§ããå¿ èŠããããŸã | |
| - `run-name` ã¯å®è¡åã§ãã¢ãã«ãã§ãã¯ãã€ã³ããšãã°ã®ä¿åã«äœ¿çšãããŸã | |
| - `batch-size` ã¯ãã¬ãŒãã³ã°çšã®ããããµã€ãºã§ãGPUã¡ã¢ãªã«å¿ããŠéžæããŸã | |
| - `max-steps` ã¯æå€§ãã¬ãŒãã³ã°ã¹ãããæ°ã§ãããŒã¿ã»ãããµã€ãºãšãã¬ãŒãã³ã°æéã«å¿ããŠéžæããŸã | |
| - `max-epochs` ã¯æå€§ãšããã¯æ°ã§ãããŒã¿ã»ãããµã€ãºãšãã¬ãŒãã³ã°æéã«å¿ããŠéžæããŸã | |
| - `save-every` ã¯ã¢ãã«ãã§ãã¯ãã€ã³ããä¿åããã¹ãããéé | |
| - `num-workers` ã¯ããŒã¿èªã¿èŸŒã¿ã®ã¯ãŒã«ãŒæ°ãWindowsã®å Žåã¯0ã«èšå® | |
| 4. ãã¬ãŒãã³ã°ãäºæãã忢ããå Žåãåãã³ãã³ããå床å®è¡ããããšã§ãæåŸã®ãã§ãã¯ãã€ã³ãããåéã§ããŸãïŒææ°ã®ãã§ãã¯ãã€ã³ããèŠã€ããããããã«ã`run-name`ãš`config`åŒæ°ãåãã§ããããšã確èªããŠãã ããïŒã | |
| 5. ãã¬ãŒãã³ã°åŸããã§ãã¯ãã€ã³ããšèšå®ãã¡ã€ã«ã®ãã¹ãæå®ããããšã§ããã¬ãŒãã³ã°ããã¢ãã«ãæšè«ã«äœ¿çšã§ããŸãã | |
| - ãããã¯`./runs/<run-name>/`ã®äžã«ããããã§ãã¯ãã€ã³ãã¯`ft_model.pth`ãšããååã§ãèšå®ãã¡ã€ã«ã¯ãã¬ãŒãã³ã°èšå®ãã¡ã€ã«ãšåãååã§ãã | |
| - æšè«æã«ã¯ããŒãã·ã§ããäœ¿çšæãšåæ§ã«ã䜿çšããã話è ã®åç §é³å£°ãã¡ã€ã«ãæå®ããå¿ èŠããããŸãã | |
| ## TODOð | |
| - [x] ã³ãŒãã®ãªãªãŒã¹ | |
| - [x] äºååŠç¿æžã¿ã¢ãã«ã®ãªãªãŒã¹ïŒ[](https://huggingface.co/Plachta/Seed-VC) | |
| - [x] Huggingfaceã¹ããŒã¹ãã¢ïŒ[](https://huggingface.co/spaces/Plachta/Seed-VC) | |
| - [x] HTMLãã¢ããŒãžïŒ[Demo](https://plachtaa.github.io/seed-vc/) | |
| - [x] ã¹ããªãŒãã³ã°æšè« | |
| - [x] ã¹ããªãŒãã³ã°æšè«ã®ã¬ã€ãã³ã·ãŒåæž | |
| - [x] ãªã¢ã«ã¿ã€ã é³å£°å€æã®ãã¢åç» | |
| - [x] æå£°å€æ | |
| - [x] ãœãŒã¹é³å£°ã®ãã€ãºèæ§ | |
| - [ ] ã¢ãŒããã¯ãã£ã®æœåšçãªæ¹å | |
| - [x] U-ViTã¹ã¿ã€ã«ã®ã¹ãããæ¥ç¶ | |
| - [x] OpenAI Whisperãžã®å ¥åå€æŽ | |
| - [x] Time as Token | |
| - [x] ã«ã¹ã¿ã ããŒã¿ã§ã®ãã¬ãŒãã³ã°ã³ãŒã | |
| - [x] ãã¥ãŒã·ã§ãã/ã¯ã³ã·ã§ãã話è ãã¡ã€ã³ãã¥ãŒãã³ã° | |
| - [x] æå£°ãã³ãŒãã£ã³ã°çšã«NVIDIAã®BigVGANã«å€æŽ | |
| - [x] æå£°å€æçšã®WhisperããŒãžã§ã³ã¢ãã« | |
| - [x] æå£°å€æã®RVC/SoVITSãšã®å®¢èгçè©äŸ¡ãšæ¯èŒ | |
| - [x] é³å£°å質ã®åäž | |
| - [ ] ããè¯ãæå£°å€æã®ããã®NSFãã³ãŒã | |
| - [x] éçºè©±æã®ãªã¢ã«ã¿ã€ã é³å£°å€æã¢ãŒãã£ãã¡ã¯ãã®ä¿®æ£ïŒVADã¢ãã«ã®è¿œå ã«ãã察å¿ïŒ | |
| - [x] ãã¡ã€ã³ãã¥ãŒãã³ã°äŸã®ColabããŒããã㯠| |
| - [ ] Whisperãããé«åºŠãªæå³æœåºåšã«çœ®ãæãã | |
| - [ ] ä»åŸè¿œå äºå® | |
| ## æŽæ°å±¥æŽðïž | |
| - 2024-11-26: | |
| - ãªã¢ã«ã¿ã€ã é³å£°å€æçšã«æé©åãããv1.0 tinyããŒãžã§ã³ã®äºååŠç¿æžã¿ã¢ãã«ãæŽæ° | |
| - ã¯ã³ã·ã§ãã/ãã¥ãŒã·ã§ããã®åäž/è€æ°è©±è ãã¡ã€ã³ãã¥ãŒãã³ã°ããµããŒã | |
| - webUIããã³ãªã¢ã«ã¿ã€ã GUIã§ã«ã¹ã¿ã ãã§ãã¯ãã€ã³ãã®äœ¿çšããµããŒã | |
| - 2024-11-19: | |
| - arXivè«æå ¬é | |
| - 2024-10-28: | |
| - ããè¯ãé³å£°å質ã®ãã¡ã€ã³ãã¥ãŒãã³ã°ããã44kæå£°å€æã¢ãã«ãæŽæ° | |
| - 2024-10-27: | |
| - ãªã¢ã«ã¿ã€ã é³å£°å€æGUIã远å | |
| - 2024-10-25: | |
| - æå£°å€æã®RVCv2ãšã®å æ¬çãªè©äŸ¡çµæãšæ¯èŒã远å | |
| - 2024-10-24: | |
| - é³å£°ã³ã³ãã³ãå ¥åãšããŠOpenAI Whisperã䜿çšãã44kHzæå£°å€æã¢ãã«ãæŽæ° | |
| - 2024-10-07: | |
| - é³å£°ã³ã³ãã³ããšã³ã³ãŒããOpenAI Whisperã«å€æŽããv0.3äºååŠç¿æžã¿ã¢ãã«ãæŽæ° | |
| - v0.3äºååŠç¿æžã¿ã¢ãã«ã®å®¢èгçè©äŸ¡çµæã远å | |
| - 2024-09-22: | |
| - NVIDIAã®BigVGANã䜿çšããæå£°å€æã¢ãã«ãæŽæ°ããé«é³åã®æå£°ãå€§å¹ ã«æ¹å | |
| - Web UIã§é·ãé³å£°ãã¡ã€ã«ã®ãã£ã³ãã³ã°ãšã¹ããªãŒãã³ã°åºåããµããŒã | |
| - 2024-09-18: | |
| - æå£°å€æçšã®f0æ¡ä»¶ä»ãã¢ãã«ãæŽæ° | |
| - 2024-09-14: | |
| - åãå質ãéæããããã®ãµã€ãºçž®å°ã𿡿£ã¹ãããæ°ã®åæžãããã³ãããœãã£ä¿æã®å¶åŸ¡èœåã远å ããv0.2äºååŠç¿æžã¿ã¢ãã«ãæŽæ° | |
| - ã³ãã³ãã©ã€ã³æšè«ã¹ã¯ãªããã远å | |
| - ã€ã³ã¹ããŒã«ãšäœ¿ç𿹿³ã®èª¬æã远å |