DeepL, known for text translation, now wants to translate your voice

DeepL, a translation company known for its text tools, has released a voice-to-voice translation suite that covers meetings, mobile and web conversations, and group conversations. The suite includes add-ons for platforms like Zoom and Microsoft Teams, allowing listeners to hear real-time translation or follow translated text on screen. The technology can also learn and adapt to custom vocabulary, such as industry-specific terms.

The company is releasing an API that lets outside developers and businesses build on top of DeepL's tech for customized use cases, including call centers. The voice-to-voice stack currently converts speech to text, applies translation, then converts back to speech, but DeepL aims to develop an end-to-end model that skips the text step entirely.

The product is currently under early access and available by invitation only. Organizations can join a waitlist to be notified when it becomes available. DeepL claims to have an edge in translation quality due to its experience with text translation.

DeepL faces competition from several startups, including Sanas, Camb.AI, Amazon Web Services, and Palabra, which are building similar technologies for real-time speech translation.