Wednesday, November 15, 2023
HomeSocial MediaMeta Showcases New ‘Voicebox’ Speech-to-Textual content Translation Device

Meta Showcases New ‘Voicebox’ Speech-to-Textual content Translation Device


On the floor not less than, Meta’s newest AI development doesn’t appear to be a significant step.

Immediately, Meta has revealed an outline of its new ‘Voicebox’ AI system, which can allow customers to translate textual content to audio, in a spread of kinds and voices.

As offered on this overview clip, the Voicebox system can take textual content inputs and translate them into audio, with completely different voice choices, enabling extra superior text-to-audio translation, however with decreased studying and processing necessities than different, comparable choices.

Although, on the floor not less than, it’s not a heap completely different from the text-to-audio instruments that we’re now accustomed to – whether or not we like them or not – on TikTok and different apps.

The Voicebox translations sound fairly comparable – and I’m prepared to wager Meta gained’t let me use the voice of Rocket Raccoon or a Transformer in these new translations.

However the Voicebox system can be greater than only a direct text-to-speech translation device.

As defined by Meta:

Voicebox can produce top quality audio clips and edit pre-recorded audio – like eradicating automobile horns or a canine barking – all whereas preserving the content material and magnificence of the audio. The mannequin can be multilingual and might produce speech in six languages. Sooner or later, multipurpose generative AI fashions like Voicebox may give natural-sounding voices to digital assistants and non-player-characters within the metaverse. They might enable visually impaired individuals to listen to written messages from buddies learn by AI of their voices, give creators new instruments to simply create and edit audio tracks for movies, and rather more.”

As Meta notes, Voicebox additionally lets you use fashions of voice for translation, so you need to use an audio clip of one other individual with a view to make your text-to-speech translation sound like that individual is talking, through simply seconds of audio enter.

Which is able to undoubtedly result in a brand new raft of deepfakes – although once more, comparable instruments do exist already. They’re simply not the identical, and Meta says not pretty much as good, as this new course of.

The true advantage of Voicebox, in a broad-reaching sense, will probably be in translation, and enabling simplified, native-sounding variations of your textual content inputs in several languages. That would open up new, cross-market alternatives, whereas the superior modeling of the system may also facilitate broader use instances and course of, which may present different key advantages.

However Meta can be conscious of the dangers.

At this stage, Meta isn’t releasing the supply code or app to the general public, citing ‘the potential dangers of misuse’. It’s hoping to seek out extra sensible, precious use instances for the expertise over time – so its announcement right this moment is extra of an FYI than a launch, as such.

You’ll be able to learn extra about Meta’s Voicebox mission right here.



RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments