NVIDIA’s AI Transcription Tool Produces 60 Minutes of Text in 1 Second
NVIDIA has released a new version of its Parakeet transcription tool, boasting the lowest error rate of any of its competitors. In addition, the company made the code public on GitHub.
Parakeet TDT 0.6B is a 600-million-parameter automatic speech recognition model. It can transcribe 60 minutes of audio per second, Hugging Face data scientist Vaibhav Srivastav said on X on May 5.
The model is recommended for, but not limited to, “conversational AI, voice assistants, transcription services,