Meta has developed an AI translator for an unwritten language

资讯 2024-09-22 04:33:12 5336

Meta has developed an artificially intelligent translation system that can convert an oral language — Hokkien — to spoken English. It's another step closer to finally making Star Trek's Universal Translator a reality.

CEO Mark Zuckerberg shared a video demonstrating the tech alongside software engineer Peng-Jen Chen in a Facebook post on Wednesday. In it, the two converse in English and Hokkien respectively, with Meta's AI system audibly translating. The demonstration appears fairly impressive, though like Meta's VR legs, it's highly likely the video was edited for illustrative purposes and the current product isn't quite as smooth.

Translation AI is typically trained on text, with researchers feeding their systems reams of written words for them to learn from. However, there are over 3,000 languages that are primarily spoken and have no widely used written system, making them difficult to incorporate into such training.

Hokkien is one such language. Used by over 45 million people across mainland China, Taiwan, Malaysia, Singapore, and the Philippines, Hokkien is an oral language without an official, standard written system.

As such, Hokkien speakers who need to write down information tend to do so phonetically, resulting in significant variation depending on the writer. There is very little recorded data translating Hokkien to English either, and professional human translators are scarce.

Mashable Light SpeedWant more out-of-this world tech, space and science stories?Sign up for Mashable's weekly Light Speed newsletter.By signing up you agree to our Terms of Use and Privacy Policy.Thanks for signing up!
SEE ALSO:'Squid Game' and the ‘untranslatable’: The debate around subtitles explained

To work around this, Meta used written Mandarin as an intermediary between English and Hokkien when training its AI. 

"Our team first translated English or Hokkien speech to Mandarin text, and then translated it to Hokkien or English — both with human annotators and automatically," Meta researcher Juan Pino said. "They then added the paired sentences to the data used to train the AI model."

Of course, filtering a sentence through multiple languages can sometimes distort its meaning — as anyone who has ever played around with Google Translate knows. Meta also worked with Hokkien speakers to check translations, and is releasing its models, data, and research as open-source information for other researchers to utilise.


Related Stories
  • Meta's VR legs aren't real and can't hurt you
  • Anne Hathaway and Jimmy Fallon's Google translated songs are catchier than the originals
  • Meta has made thousands of dollars from hate speech ads, report finds
  • Google Meet can now translate speech and turn it into captions on the fly
  • Meta employees aren't thrilled about Mark Zuckerberg's metaverse chase

Mashable has reached out to Meta for comment.

We're still quite a way off from having a fully functioning TARDIS Translation Matrix. Meta's oral translation system can currently only handle one sentence at a time, and only works with translations between Hokkien and English. Even so, it's promising progress toward the real-time oral translation Meta is aiming for.

Meta's Hokkien-English oral translation project is part of Meta's ongoing Universal Speech Translator program, which aims to be able to translate speech between two languages as they're being spoken.

本文地址:http://x.zzzogryeb.bond/html/19c399581.html
版权声明

本文仅代表作者观点,不代表本站立场。
本文系作者授权发表,未经许可,不得转载。

全站热门

广东超一半的北运淡水鱼来自这里!全链发力打响“南海鱼”金字招牌

[Graphic News] Air passengers drop to record low in March amid virus woes

茶旅融合发展 名山茶树育种园受游客青睐

老人路上突发病 好心人接力相助

SpaceX Polaris Dawn mission: How to watch the launch

Uber will make you explain why you're giving a driver a bad rating

North Korea ramps up public executions

陆路内联外通 水上通江达海

友情链接