Saturday, February 22, 2020
Home Machine Learning Amazon researchers trained an AI model in multiple languages to improve product...

Amazon researchers trained an AI model in multiple languages to improve product searches

Advertisement

Amazon operates in 14 countries around the world, nine of which are eligible for its Prime yearly subscription service. It goes without saying that the company has a real desire to make available its shopping experience in any number of languages, particularly where customers who speak different dialects are searching for the same products.

In pursuit of an efficient means of translating multiple languages, Amazon researchers devised a shopping model called a multitask model, in which the functions overlap across tasks and tend to reinforce each other. They say that their AI, which was trained on data from several different languages at once, delivered better results using any of those languages.

As Amazon applied scientist Nikhil Rao explained in a blog post, the reason for the improvement is that a corpus in one language is able to fill gaps in that of another language. For instance, phrases easily confused in French might not look much like their equivalents in German, so multilingual training could help sharpen the distinctions among several product queries.

Amazon AI shopping model

These images depict embeddings of queries and product descriptions in Italian and English. At left are the embeddings that result from separate training of four monolingual models; queries (orange) and product descriptions (blue) in Italian and in English (green and yellow) cluster in four distinct regions of the space. At right are the embeddings that result from simultaneously training the multitask model on English and Italian data.

- Affiliate - Website Builder 250x250

The team’s system maps queries relating to a product and product description into the same region of a representational space regardless of the language, principally to help the model generalize what it learns in one language to other languages. For example, the searches “school shoes boys” and “scarpe ragazzo” end up near each other in one region of the space, while the product names “Kickers Kick Lo Vel Kids’ School Shoes – Black” and “Kickers Kick Lo Infants Bambino Scarpe Nero” end up close in a different region.

The system ingests two inputs — a query and a product title — and it outputs a single bit, indicating whether the product matches the query or not. An encoder component taps Google’s Transformer architecture, which the researchers say scales better than alternative architectures, while the model’s classifier combines query and product encodings.

The team trained the system by picking one of its input languages at random and “teaching” it it to classify query-product pairs in just that language. Then, they trained it end to end over a series of epochs — complete presentation of the data set — on annotated sample queries in each of its input languages. An alignment phase ensured that the outputs tailored to different languages shared a representational space by minimizing the distance between encodings of product titles and queries.

Amazon says that in experiments involving 10 different bilingual models (five models, each of which was paired with the other four), 10 trilingual models, and one pentalingual model, they achieved “strong results” in as few as 15 or 20 epochs. According to F1 score, a common performance measure in AI that factors in false-positive and false-negative rates, a multilingual model trained on both French and German outperformed a monolingual French model by 11% and a monolingual German model by 5%. Separately, a model trained on five languages (including French and German) outperformed the French model by 24% and the German model by 19%.

“The results suggest that multilingual models should deliver more consistently satisfying shopping results to our customers,” said Rao. “In ongoing work, we are continuing to explore the power of multitask learning to improve our customers’ shopping experiences.”

Credit: https://wordpress.com/read/blogs/126020344/posts/2570287

- Advertisement -

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

advertisement

Must Read

video

Making the Web Accessible

Strategies, standards, and supporting resources to help you make the Web more accessible to people with disabilities. Source and...

The Rise of AI

History of AI Credit: https://earlybirdz.co.in/2020/02/08/the-rise-of-ai/ Arguably, Artificial intelligence or AI debuted at a conference at...

Math in Data analytics

Credit Author: Vaish https://myworldofelectronics.wordpress.com/2020/02/05/math-in-data-analytics/ Digital data is growing at a very rapid rate, and changing the way we live....

Python Training: Intro to Python

# this just gets the notebook to print all the output from IPython.core.interactiveshell import InteractiveShell InteractiveShell.ast_node_interactivity = "all" Credit Author: Andrew...

Echo Show devices can now add items to your shopping list by barcode

If you manage your grocery list using Amazon’s Alexa, good news: It just became easier to add items in need of restocking....

Interested in learning more? Check out this selection of books.

advertisement