Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
単語符号化装置、解析装置、言語モデル学習装置、方法、及びプログラム
Document Type and Number:
Japanese Patent JP7143677
Kind Code:
B2
Abstract:
To generate a vector representing a word in consideration of the importance of the character string of the word. A character N-gram vector acquisition unit acquires a vector representing each of character N-grams included in a word. A weight calculation unit 40 calculates a weight for a vector representing each of the character N-grams on the basis of the vector representing each of the character N-grams included in the word, and a character N-gram vector combining unit 42 generates a composite vector obtained by combining the vector representing each of the character N-grams on the basis of the vector representing each of the character N-grams included in the word and the weight for the vector representing each of the character N-grams.

Inventors:
Sho Takase
Masaaki Nagata
Application Number:
JP2018157473A
Publication Date:
September 29, 2022
Filing Date:
August 24, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Nippon Telegraph and Telephone Corporation
International Classes:
G06F40/30; G06N3/04
Foreign References:
WO2018081089A1
Other References:
進藤 裕之、松本 裕治,畳み込みニューラルネットワークを用いた複単語表現の解析,情報処理学会 研究報告 自然言語処理(NL) 2015-NL-223 [online] ,日本,情報処理学会,2015年09月20日,pp.1-7
Attorney, Agent or Firm:
Patent Service Corporation Taiyo International Patent Office