VibeVoice ASR

Speech recognition service supporting Chinese and English transcription with speaker identification. Based on Whisper architecture.

API 使用方式
使用 model ID:vibevoice-asr