Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MACHINE LEARNING APPROACH TO CROSS-LANGUAGE TRANSLATION AND SEARCH
Document Type and Number:
WIPO Patent Application WO/2020/144491
Kind Code:
A3
Abstract:
Techniques are disclosed relating to implementing a machine learning approach to cross-language translation and search. In certain embodiments, a method may include receiving a plurality of characters of a first language that are unsegmented and grouping the plurality of character into multiple groups. The method also includes determining a set of word tokens based on one or more transliterations of the multiple groups and one or more translations of the multiple groups to a second language. Further, the method includes generating one or more word token solution sets by querying an index file using the one or more word tokens. The method also includes determining whether the index file references an entity name corresponding to the plurality of characters of the first language based on comparing the one or more token solution sets with the index file.

Inventors:
UPADHYAY RUSHIK (US)
LAKSHMIPATHY DHAMODHARAN (US)
RAMESH NANDHINI (US)
KAULAGI ADITYA (US)
Application Number:
PCT/IB2019/001453
Publication Date:
September 17, 2020
Filing Date:
December 26, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PAYPAL INC (US)
International Classes:
G06F40/53; G06F40/55; G06F40/58
Foreign References:
US20130041647A12013-02-14
US20110137636A12011-06-09
US20090150370A12009-06-11
US20030074185A12003-04-17
US20160162545A12016-06-09
Other References:
CHEN ET AL.: "Combining multiple sources for short query translation in Chinese -English cross-language information retrieval", IRAL '00: PROCEEDINGS OF THE FIFTH INTERNATIONAL WORKSHOP ON ON INFORMATION RETRIEVAL WITH ASIAN LANGUAGES, 2000, XP058310829, Retrieved from the Internet [retrieved on 20200717]
Attorney, Agent or Firm:
CHEN, Tom (US)
Download PDF: