๐ค Miipher-2 Speech Enhancement
How it works
- Upload a noisy or degraded audio file
 - Process using Miipher-2 model
 - Download the enhanced audio
 
Model Details
- SSL Backbone: mHuBERT-147 (Multilingual)
 - Adapter: Parallel adapters at layer 6
 - Vocoder: HiFi-GAN trained on SSL features
 - Input: Any sample rate (automatically resampled to 16kHz)
 - Output: 22.05kHz high-quality audio
 
Tips
- Works best with speech audio
 - Supports various noise types (background noise, reverb, etc.)
 - Processing time depends on audio length and hardware