Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD FOR GENERATING TRAINING DATA FOR MACHINE TRANSLATION, METHOD FOR CREATING LEARNABLE MODEL FOR MACHINE TRANSLATION PROCESSING, MACHINE TRANSLATION PROCESSING METHOD, AND DEVICE FOR GENERATING TRAINING DATA FOR MACHINE TRANSLATION
Document Type and Number:
WIPO Patent Application WO/2023/243261
Kind Code:
A1
Abstract:
Provided is a machine translation processing system that can make an accurate machine translation of a text containing a markup language tag for a text to be translated, the machine translation being made while keeping information about the markup language tag without preparing a large number of tagged translations. In a machine translation processing system (1000), a training data generating device (1) performs processing for generating training data, so that a start/end corresponding code is detected in translation data not containing the markup language tag and the detected start/end corresponding code is replaced with an alternative code. Thus, a large amount of data equivalent to translation data with the inserted markup language tag can be easily generated. Moreover, in the machine translation processing system (1000), the translation data acquired by the processing for generating the training data by the training data generating device (1) is used as training data for learning of a machine translation model. Thus, the same effect as learning of the machine translation model can be obtained using the translation data with the markup language tag as training data.

Inventors:
UCHIYAMA MASAO (JP)
Application Number:
PCT/JP2023/017453
Publication Date:
December 21, 2023
Filing Date:
May 09, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NAT INST INF & COMM TECH (JP)
International Classes:
G06F40/44
Foreign References:
JP2012185679A2012-09-27
US20100235162A12010-09-16
Other References:
OKADA, KOHEI EL AL.: "Improving translation accuracy of legal summaries by dividing bracket expressions", PROCEEDINGS OF THE 21ST ANNUAL MEETING OF THE ASSOCIATION FOR NATURAL LANGUAGE PROCESSING; MARCH 16TH - 21ST, 2015, vol. 21, 9 March 2015 (2015-03-09) - 21 March 2015 (2015-03-21), pages 541 - 544, XP009551497
Attorney, Agent or Firm:
NAKANISHI Ken et al. (JP)
Download PDF: