Universal translators are tantalizing close as Facebook’s Meta reveals its tech can translate between 101 languages


Back in August 2023, Meta revealed an ‘all-in-one’ AI translation model capable of understanding close to 100 different languages.
Dubbed SeamlessM4T (Massively Multilingual and Multimodal Machine Translation), this is Meta’s attempt at creating a ‘universal translator’ akin to the Babel Fish in Douglas Adams’ classic sci-fi series The Hitchhiker’s Guide to the Galaxy.
The team behind the SeamlessM4T tool has now detailed its work in a piece in the journal Nature, revealing the advanced system delivers an all-in-one solution for text-to-text, speech-to-text, speech-to-speech, and text-to-speech translations across an impressive, and growing, array of languages.
Over 400 years of raw audio
SeamlessM4T, which, among other things, is being used to automatically dub videos on Facebook and Instagram, currently supports speech-to-speech translation from 101 to 36 languages, speech-to-text translation for from 101 to 96 languages, text-to-text translation for 96 languages, text-to-speech translation from 96 to 36 languages, and automatic speech recognition for 96 languages. This unified approach overcomes the limitations of traditional cascaded systems, which often require separate subsystems for speech recognition, translation, and text-to-speech synthesis.
By streamlining these processes, Meta says SeamlessM4T outperforms existing models, achieving up to 23% higher BLEU (Bilingual Evaluation Understudy) scores in translation accuracy and demonstrating impressive resilience to background noise and speaker variations.
To create SeamlessM4T, Meta started with 4 million hours (over 400 years) of multilingual raw audio originating from a publicly available repository of crawled web data. The team developed SeamlessAlign, a multimodal corpus containing over 470,000 hours of aligned speech and combined the dataset with cutting-edge machine learning techniques, including SONAR (Sentence-level Multimodal and Language-Agnostic Representations) embeddings, which enable multilingual and modality-agnostic encoding for text and speech.
Meta says that by addressing social and ethical challenges through the use of safeguards, SeamlessM4T can be a valuable tool for global communication. These safeguards reduce gender bias – errors in grammatical gender determination – and mitigate the problem of added toxicity – where offensive words appear in translations but not in the original source.
Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!
You might also like
Back in August 2023, Meta revealed an ‘all-in-one’ AI translation model capable of understanding close to 100 different languages. Dubbed SeamlessM4T (Massively Multilingual and Multimodal Machine Translation), this is Meta’s attempt at creating a ‘universal translator’ akin to the Babel Fish in Douglas Adams’ classic sci-fi series The Hitchhiker’s Guide…
Recent Posts
- H&R Block Coupons and Deals: $50 Off Tax Prep in 2025
- Elon Musk says Grok 2 is going open source as he rolls out Grok 3 for Premium+ X subscribers only
- FTC Chair praises Justice Thomas as ‘the most important judge of the last 100 years’ for Black History Month
- HP acquires Humane AI assets and the AI pin will suffer a humane death
- HP acquires Humane AI assets and the AI pin may suffer a humane death
Archives
- February 2025
- January 2025
- December 2024
- November 2024
- October 2024
- September 2024
- August 2024
- July 2024
- June 2024
- May 2024
- April 2024
- March 2024
- February 2024
- January 2024
- December 2023
- November 2023
- October 2023
- September 2023
- August 2023
- July 2023
- June 2023
- May 2023
- April 2023
- March 2023
- February 2023
- January 2023
- December 2022
- November 2022
- October 2022
- September 2022
- August 2022
- July 2022
- June 2022
- May 2022
- April 2022
- March 2022
- February 2022
- January 2022
- December 2021
- November 2021
- October 2021
- September 2021
- August 2021
- July 2021
- June 2021
- May 2021
- April 2021
- March 2021
- February 2021
- January 2021
- December 2020
- November 2020
- October 2020
- September 2020
- August 2020
- July 2020
- June 2020
- May 2020
- April 2020
- March 2020
- February 2020
- January 2020
- December 2019
- November 2019
- September 2018
- October 2017
- December 2011
- August 2010