Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A keyword expansion method, a system, the classification corpus notes method, and a system
Document Type and Number:
Japanese Patent JP6231668
Kind Code:
B2
Abstract:
This invention provides a keyword expansion method and system. The method comprises searching with a predetermined initial keyword to obtain current keywords used as a basis of a next search, performing loop search through keyword iteration; if a keyword error between keywords obtained in the current search and those keywords obtained in a previous search is less than a predetermined threshold, using the keywords obtained in the current search as expanded keywords of the initial keyword. With this method, the problem of manually establishing a thesaurus in the prior art may be solved. This method is a simple, accurate and efficient keyword expansion method. A method and system of automatically annotating a classified corpus is also provided. The method comprises: determining one or more initial core keywords for each class; obtaining expanded keywords for each class through expanding the initial core keywords; searching with the expanded keywords corresponding to a class to select a classified corpus and annotating the classified corpus.

Inventors:
Ye Mao
Turn gee
Shuige Embo
Ray Chao
Gin leaf on
Application Number:
JP2016518124A
Publication Date:
November 15, 2017
Filing Date:
December 05, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Peking University Founder Group Company, Limited
Founder Apavi Technology Limited
Peking University
International Classes:
G06F17/30
Domestic Patent References:
JP2008077137A
JP2003058566A
JP2010286888A
JP2012234485A
JP2004029906A
Foreign References:
US20020073079
US20070010804
Other References:
河合 英紀、外4名,ブートストラップ式同位語辞書構築における検索効率の向上,情報処理学会論文誌 論文誌トランザクション 平成20年度(1) [CD-ROM],日本,社団法人情報処理学会,2009年 2月 4日,第1巻,第1号,pp.36-48
Attorney, Agent or Firm:
Atsushi Aoki
Jun Tsuruta
Tomohiro Minamiyama
Endo Riki
Tsuyoshi Miura