Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD FOR GENERATING TRAINING DATA FOR MACHINE TRANSLATION, METHOD FOR CREATING LEARNABLE MODEL FOR MACHINE TRANSLATION PROCESSING, MACHINE TRANSLATION PROCESSING METHOD, AND DEVICE FOR GENERATING TRAINING DATA FOR MACHINE TRANSLATION
Document Type and Number:
Japanese Patent JP2023183618
Kind Code:
A
Abstract:
To provide a machine translation processing system that can accurately translate an original sentence containing a markup language tag for a text to be translated by machine translation, while holding information about the markup language tag without preparing a large number of tagged translations.SOLUTION: In a machine translation processing system 1000, a training data generating device 1 executes training data generation processing to detect a start/end corresponding code in translation data not containing a markup language tag and replace the detected start/end corresponding code with an alternative code, thereby easily generating a large amount of data equivalent to translation data with the inserted markup language tag inserted thereto. A machine translation processing device 2 uses the translation data acquired by the training data generation processing in the training data generating device 1 as training data for learning of a machine translation model, thereby producing the same effect as when learning of the machine translation model is performed using the translation data with the markup language tag as training data.SELECTED DRAWING: Figure 1

Inventors:
UCHIYAMA MASAO
Application Number:
JP2022097221A
Publication Date:
December 28, 2023
Filing Date:
June 16, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NAT INST INF & COMM TECH
International Classes:
G06F40/44; G06F40/221; G06F40/45
Attorney, Agent or Firm:
Ken Nakanishi
Hiroshi Kitahara