With the hope of building a universal language translator, Meta on August 22 released a new AI model called SeamlessM4T that can perform translation and transcription services in dozens of languages. The model currently supports: Speech recognition for nearly 100 languages. Speech-to-text translation for nearly 100 input and output languages. Speech-to-speech translation, supporting nearly 100 input languages and 36 output languages. Text-to-text translation for nearly 100 languages. Text-to-speech translation, supporting nearly 100 input languages and 36 output languages. You can try out a demo of the AI model here or get an idea of how it works from the video below: [video width="1920" height="1080" mp4="https://www.medianama.com/wp-content/uploads/2023/08/01_Seamless-Multitasking-video.mp4"][/video] The model is available publicly under a research license for researchers and developers. It cannot be used for commercial reasons under this license. Meta also released the metadata of SeamlessAlign, a multimodal translation dataset totalling 270,000 hours of mined speech and text alignments. For more on the technical details of the model, check out the post by Meta here. Why does this matter: Translation and transcription tech that are currently available publicly are either limited in features (such as only allowing text-to-text, speech-to-text, etc.) or limited in the number of languages they support. SeamlessM4T is multimodal in that it supports various transcription and translation services (as listed above) in a single model and also supports a large number of languages already. Separately, as pointed out by Meta in its announcement, "The world we live in has never been more interconnected, giving people access to…
- RBI increases UPI and recurring transaction limits for certain categories December 8, 2023
- The Quantum Hub discusses concerns on age verification and children’s data processing under India’s data protection law December 8, 2023
- India’s central bank to set up cloud facility for financial sector December 8, 2023
- Navigating the Complexities of Open Source AI: Insights from Carnegie India Summit December 8, 2023
- Supreme Court Dismisses Plea by Foundation for Media Professionals on Internet Shutdowns December 8, 2023
MediaNama’s mission is to help build a digital ecosystem which is open, fair, global and competitive.
Factors like Indus not charging developers any commission for in-app payments and antitrust orders issued by India's competition regulator against Google could contribute to...
Is open-sourcing of AI, and the use cases that come with it, a good starting point to discuss the responsibility and liability of AI?...
RBI Deputy Governor Rabi Shankar called for self-regulation in the fintech sector, but here's why we disagree with his stance.
Straw man fallacy: IT Ministers’ defence of government exemptions in data protection law misses the point
Both the IT Minister and the IT Minister of State have chosen to avoid the actual concerns raised, and have instead defended against lesser...
The Central Board of Film Certification found power outside the Cinematograph Act and came to be known as the Censor Board. Are OTT self-regulating...
Please subscribe to MediaNama. Don't share prints and PDFs.
You May Also Like
Google has released a Google Travel Trends Report which states that branded budget hotel search queries grew 179% year over year (YOY) in India, in...
135 job openings in over 60 companies are listed at our free Digital and Mobile Job Board: If you’re looking for a job, or...
Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request
By Aroon Deep and Aditya Chunduru You’re reading it here first: Twitter has complied with government requests to censor 52 tweets that mostly criticised...