Ultimate RVC Maker ⚡
If you liked this HF Space you can give me a ❤️
Try Ultimate RVC Maker WebUI using Colab here
Convert Audio
Convert audio using a trained voice model
Enter the path to the audio file
Extracting pitch using the ONNX model can help improve speed
Unlock all pitch extraction methods
Combination of two or more different types of extracts
Converted audio
Convert Text to Speech
Convert text to speech and read aloud using the trained voice model
Extracting pitch using the ONNX model can help improve speed
Unlock all pitch extraction methods
Combination of two or more different types of extracts
Unconverted and converted audio
Convert Audio With Whisper
Convert audio using a trained speech model with a Whisper model for speech recognition
Whisper will recognize different voices then cut the individual voices and use the RVC model to convert those segments
The Whisper model may not work properly which may cause strange output
Enter the path to the audio file
Extracting pitch using the ONNX model can help improve speed
Unlock all pitch extraction methods
Combination of two or more different types of extracts
Audio input, output
Download Model
Download voice models, pre-trained models, and embedding models
Choose a pre-trained model to download
Model sample rate
Create Dataset training from YouTube
Process and create training datasets using YouTube links
Train Model
Train and build a voice model with a set of voice data
Extracting pitch using the ONNX model can help improve speed
Unlock all pitch extraction methods
Custom dataset folder for training data
Check for overtraining during model training
Only enable if you need to retrain the model from scratch.
Store the model in GPU cache memory
Save only the latest D and G models
Save all models after each epoch
Do not use pre-trained models
Customize pre-training settings
When enabled, highly deterministic algorithms are used, ensuring that each run of the same input data will yield the same results.
When disabled, more optimal algorithms may be selected but may not be fully deterministic, resulting in different training results between runs.
When enabled, it will test and select the most optimized algorithm for the specific hardware and size. This can help speed up training.
When disabled, it will not perform this algorithm optimization, which can reduce speed but ensures that each run uses the same algorithm, which is useful if you want to reproduce exactly.
Fushion Two Models
Combine two voice models into a single model
Read Model Information
Retrieve recorded information within the model
Converting PYTORCH Model to ONNX Model
Convert RVC model from pytorch to onnx to optimize audio conversion
Music Separation
A simple music separation system can separate into 4 parts: Instruments, Vocals, Main vocals, Backup vocals
Separated output
Editing Soundtrack Using Audioldm2 Model
Editing the soundtrack using Audioldm2 model can help change the type of instrument inside the soundtrack
Enter the path to the audio file
Audio output
Add Additional Audio Effects
Add effects to audio
Enter the path to the audio file
Enter the path to the audio file
Create a continuous echo effect when this mode is enabled
Audio output
Pitch Extraction
F0 pitch extraction is intended for use in audio conversion inference
Extracting pitch using the ONNX model can help improve speed
Additional Settings
Customize additional features of the project
The display language in the project (When changing the language, the system will automatically restart after 15 seconds to update)
Theme type displayed in the interface (When changing the theme, the system will automatically restart after 15 seconds to update)
Please do not use the project for any unethical, illegal, or harmful purposes to individuals or organizations...
In cases where users do not comply with the terms or violate them, I will not be responsible for any claims, damages, or liabilities, whether in contract, negligence, or other causes arising from, outside of, or related to the software, its use, or other transactions associated with it.