Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
日本語文字認識誤り訂正方法及び装置、並びに、誤り訂正プログラムを記録した記録媒体
Document Type and Number:
Japanese Patent JP4066507
Kind Code:
B2
Abstract:
PROBLEM TO BE SOLVED: To precisely present a correction candidate for a short word without using the context by retrieving a word in a dictionary which is similar to a partial character string included in a character matrix according to a character mixing probability and a word appearance frequency and presenting a word string in the increasing order of the product of the concurrency probability of the word string and the character mixing probability of the respective characters. SOLUTION: An unknown word candidate generating means 2 generates a pair of the notation and appearance probability of a word in order to identify an unknown word included in the character matrix. Further, a similar word collating means 3 retrieves the word in the dictionary which is similar to the partial character string included in the character matrix in order to generate the correction candidate for a word whose correct characters are not included in the candidate characters according to the character mixing probability and word appearance probability without using the contexts of the precedent and following parts. A morpheme analyzing means 1 outputs only an arbitrary number of word strings among combinations of words in the dictionary included in the character matrix, unknown word candidates, and similar collatated words in the decreasing order of the probability according to a word division model 7.

Inventors:
Masaaki Nagata
Application Number:
JP12761598A
Publication Date:
March 26, 2008
Filing Date:
May 11, 1998
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Nippon Telegraph and Telephone Corporation
International Classes:
G06K9/62; G06F17/22; G06K9/72; G06F17/27; G06F17/28
Domestic Patent References:
JP8315078A
JP7271921A
JP9288673A
Attorney, Agent or Firm:
Tadahiko Ito