Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
Instance classification method
Document Type and Number:
Japanese Patent JP6292322
Kind Code:
B2
Abstract:
A method for classifying a new instance including a text document by using training instances with class including labeled data and zero or more training instances with class including unlabeled data, comprising: estimating a word distribution for each class by using the labeled data and the unlabeled data; estimating a background distribution and a degree of interpolation between the background distribution and the word distribution by using the labeled data and the unlabeled data; calculating two probabilities for that the word generated from the word distribution and the word generated from the background distribution; combining the two probabilities by using the interpolation; combining the resulting probabilities of all words to estimate a document probability for the class that indicates the document is generated from the class; and classifying the new instance as a class for which the document probability is the highest.

Inventors:
Andrade Silva Daniel Georg
Hiroki Mizuguchi
Kai Ishikawa
Application Number:
JP2016571775A
Publication Date:
March 14, 2018
Filing Date:
June 20, 2014
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NEC
International Classes:
G06F17/30; G06N20/00
Domestic Patent References:
JP2004362584A
JP2010108265A
Foreign References:
US20120078969
Other References:
藤野 昭典,ラベルあり・なしデータの最適な結合に基づくパターン分類,電子情報通信学会技術研究報告,日本,社団法人電子情報通信学会,2005年 2月17日,Vol.104 No.669,19-24ページ
古宮 嘉那子,文書分類のためのNegation Naive Bayes,自然言語処理,日本,言語処理学会,2013年 6月14日,第20巻 第2号,161-182ページ
Attorney, Agent or Firm:
Masahiko Desk
Naoki Shimosaka