Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
CHARACTER STRING EXTRACTING METHOD AND ITS SYSTEM
Document Type and Number:
Japanese Patent JPH09138801
Kind Code:
A
Abstract:

To utilize extracted words for the syntax analyzation, etc., of the text by enabling the generation of a dictionary optimum for the text if registering the words as a dictionary.

The optimum consecutive character string is extracted 3 from the text 1 described in natural language and concerning a character adjacent to the consecutive character string, an appearing frequency appearing at the same time of the consecutive character string is investigated 4. Whether the character is provided or not with the high probability of being used integrally with the consecutive character string is objectively evaluated by means of a numerical value corresponding to this appearing frequency. When the frequency is high, the character string is recognized to be one group of words and phrases including the adjacent character. Words extracted in this way are registered as the dictionary to utilize for the syntax analyzation, etc., of the text.


Inventors:
SHIMOHATA SAYORI
SUGIO TOSHIYUKI
NAGATA JUNJI
Application Number:
JP32117995A
Publication Date:
May 27, 1997
Filing Date:
November 15, 1995
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
OKI ELECTRIC IND CO LTD
International Classes:
G06F17/21; G06F17/27; G06F17/28; G06F17/30; (IPC1-7): G06F17/27; G06F17/28; G06F17/30
Attorney, Agent or Firm:
Yukio Sato (1 person outside)