But audio outputs like the clip above demonstrate the potential for a commercial system later down the line. Then the automatic voice translator will quickly and accurately recognize the voice input by the user, translate directly into the language you set, and read the translation result aloud through the text-to-voice feature. During testing, the researchers trialed the system only with Spanish-to-English translation, which already took a lot of carefully curated training data. Danish to Nepali translator allows users to speak and translate voice to text (voice typing). Translatotron is currently a proof of concept. Not only does this approach produce more nuanced translations by retaining important nonverbal cues, but in theory it should also minimize translation error, because it reduces the task to fewer steps. The third component can then layer the original speaker’s vocal characteristics back into the final audio output. The second converts the spectrogram into an audio wave that can be played. The first component uses a neural network trained to map the audio spectrogram in the input language to the audio spectrogram in the output language. The new system, dubbed the Translatotron, has three components, all of which look at the speaker’s audio spectrogram-a visual snapshot of the frequencies used when the sound is playing, often called a voiceprint.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |