Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
NATURAL-LANGUAGE PROCESSING SYSTEM USING A LARGE CORPUS
Document Type and Number:
WIPO Patent Application WO2001071448
Kind Code:
A3
Abstract:
A computer-parsing system based upon using vectors (lists) to represent natural-language elements, providing a robust, distributed way to score grammaticality of an input string by using as a source material a large corpus of natural-language text. The system uses recombining of asymetric associations of syntactically similar strings to form the vectors. The system uses equivalence lists for subparts of the string to build equivalence lists for longer strings in an order controlled by the potential parse to be scored. The power of recombination of vector elements in building longer strings provides a means of representing collocational complexity. Grammaticality scoring is based upon the number and similarity of the vector elements.

Inventors:
FREEMAN ROBERT J (NZ)
Application Number:
PCT/IB2001/000678
Publication Date:
April 11, 2002
Filing Date:
March 20, 2001
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
FREEMAN ROBERT J (NZ)
International Classes:
G06F17/27; (IPC1-7): G06F17/27
Foreign References:
US5418717A1995-05-23
US5477450A1995-12-19
US5510981A1996-04-23
US5680511A1997-10-21
US5963940A1999-10-05
Download PDF: