VietTTS is an open-source toolkit providing the community with a powerful Vietnamese TTS model, capable of natural voice synthesis and robust voice cloning. Designed for effective experimentation, ...
Real-time voice artificial intelligence startup Deepgram Inc. has reasons to think the new year could be a good one after ...
Abstract: Deep biasing (DB) enhances the performance of end-to-end automatic speech recognition (E2E-ASR) models for rare words or contextual phrases using a bias list. However, most existing methods ...
Abstract: The fast growth of internet and communications networks has drastically enhanced data transport, allowing tasks like Speech Emotion Recognition (SER), an essential aspect of human-computer ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results