Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND SYSTEM FOR QUERY REFORMATION
Document Type and Number:
WIPO Patent Application WO/2002/027563
Kind Code:
A1
Abstract:
A method (and system) (100) for converting a keyword based search engine (103) coupled to an information source (124) into a natural language enhanced search engine (119). The method includes determining expression based syntax of the keyword (101) based search engine (103). The method then couples a natural language based search engine to the keyword based search engine based upon the expression based syntax by linking the natural language based search engine to the keyword based search engine.

Inventors:
PUSTEJOVSKY JAMES D
ONEIL JOHN H
Application Number:
PCT/US2001/042165
Publication Date:
April 04, 2002
Filing Date:
September 14, 2001
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
LINGOMOTORS INC (US)
International Classes:
G06F17/27; G06F17/30; G06F40/00; G06Q10/00; H04M3/493; (IPC1-7): G06F17/30; G06F17/27
Foreign References:
US6263335B12001-07-17
US5963940A1999-10-05
US5953718A1999-09-14
US6026388A2000-02-15
US6246977B12001-06-12
Attorney, Agent or Firm:
Boucher, Patrick (Two Embarcadero Center Eigth Floo, San Francisco CA, US)
Download PDF:
Claims:
WHAT IS CLAIMED IS:
1. A method for searching information using a reformulated query expression, the method comprising: entering a query in a form of a natural language expression, the query comprising a plurality of terms ; converting the query by identifying one or more interesting terms using semantic and syntactic information for one or more of the terms of the query to derive only interesting terms ; and searching an information source of information based upon the interesting terms.
2. The method of claim 1 wherein converting the query comprises converting the query with a type system.
3. The method of claim 1 wherein converting the query comprises using logical expressions to identify one or more noninteresting terms.
4. The method of claim 1 wherein converting the query identifies one or more noninteresting terms, the one or more noninteresting terms being one or more context dependent stop words, each such stop word being defined as a term that is free from processing in subsequent processing operations.
5. The method of claim 4 wherein the one or more stop words is provided using respective one or more logical expressions of the one or more stop words.
6. The method of claim 1 wherein the information source is a database.
7. The method of claim 6 wherein the information source is selected from book information, financial information, news information, email information, legal information, and consumer information.
8. The method of claim 1 wherein the interesting terms are defined as those terms relevant from the query in an index for a specific domain.
9. The method of claim 1 wherein the steps are provided on a networked computer system.
10. A method for forming an enhanced query, the method comprising: entering a query in a form of a natural language expression, the query comprising a plurality of terms; converting the query into a logical form based upon semantic and syntactic information for each of the terms ; reformulating the query in the first logical form into an enhanced query based upon one or more fields in a database; and querying a source of information based upon the reformulated query.
11. The method of claim 10 wherein entering the query is provided on a client device.
12. The method of claim 10 wherein converting the query and reformulating query are provided on a server device.
13. The method of claim 10 wherein converting the query, reformulating the query, and querying the source of information are provided on a server device.
14. The method of claim 10 wherein reformulating the query comprises filtering the query to ignore nonessential terms.
15. The method of claim 10 wherein reformulating the query comprises expanding one or more terms in the query using a type system.
16. The method of claim 10 wherein reformulating the query comprises identifying field terms in the query.
17. A method for operating a searching method by a user, the method comprising: entering a query in a form of a natural language expression, the query comprising a plurality of terms; converting the query into a logical form based upon a semantic and syntactic information for one or more of the terms; reformulating the query in the logical form into an enhanced query based upon one or more fields in a database; querying a source of information based upon the reformulated query; and repeating entering, converting, reformulating, and querying for one or more other queries without permanently storing all of the enhanced queries into memory.
18. A system for forming an enhanced query, the system comprising: a receiving module for receiving a query in a form of a natural language expression, the query comprising a plurality of terms; a natural language engine for converting the query into a logical form based upon semantic and syntactic information for each of the terms; and a reformulating module for the query from the first logical form into an enhanced query based upon one or more fields in a database.
19. A system for forming query reformulation, the system comprising: a receiving module for receiving a query in a form of a natural language expression in a logical form; a query reformulation engine coupled to the receiving module, the query reformulation engine being adapted to receive the natural language expression in the logical form and to form a reformulated query from the natural language expression; and an information retrieval engine coupled to the query reformulation engine to receive the reformulated query, the reformulated query being adapted to be received by the information retrieval engine by the query reformulation engine.
20. The system of claim 19 wherein the query reformulation module comprises a normalization module to normalize the reformulated query to be compatible with the information retrieval engine.
21. A method for retrieving information from an information store, comprising : receiving a user query comprising plural terms; identifying zero or more noninteresting terms based on semantic and syntactic relationships among said terms; and producing a request to access information contained in said information store, said request comprising said terms exclusive of said noninteresting terms, including expressing said request in a language used to access information from said information store.
22. The method of claim 21 wherein said user query is a natural language query.
23. The method of claim 21 wherein said user query is in logical form.
24. The method of claim 21 further including expanding said terms exclusive of said noninteresting terms.
25. The method of claim 21 further including associating one or more of said terms with one or more fields defined in said information store, wherein said producing a request includes incorporating said one or more of said terms into said request.
26. The method of claim 21 wherein identifying zero or more non interesting terms includes associating a plurality of types with groups of said terms based on said semantic and syntactic relationships, each group comprising a subset of said terms, said identifying being based on said types.
27. The method of claim 21 wherein identifying zero or more non interesting terms is based on stop words.
28. The method of claim 27 wherein said stop words are identified based on pattern recognition.
29. The method of claim 27 wherein said stop words are identified using tree transduction.
30. The method of claim 21 wherein said information store is a database and said language is an appropriate database language for accessing said information.
31. The method of claim 30 wherein said database language includes operations for reading and updating said information, and inserting new information.
32. A method for retrieving information from an information store, comprising: receiving a user query comprising plural terms; associating one or more of said terms with one or more fields defined in said information store; and producing a search request using a search language suitable for accessing said information store, said search request including said one or more of said terms for targeting said one or more fields.
33. The method of claim 32 wherein said producing includes generating searches terms for said one or more fields using said one or more of said terms.
34. The method of claim 32 wherein said user query is a natural language query.
35. The method of claim 32 wherein said user query is in logical form.
36. The method of claim 32 further including identifying zero or more noninteresting terms based on semantic and syntactic relationships among said terms; said request being exclusive of said noninteresting terms.
37. The method of claim 36 wherein said identifying is based on stop words.
38. The method of claim 37 wherein said stop words are identified based on pattern recognition.
39. The method of claim 37 wherein said stop words are identified using tree transduction.
40. The method of claim 36 further including expanding said terms exclusive of said noninteresting terms.
41. The method of claim 32 further including selecting a subset of said terms based on semantic and syntactic relationships among said terms, said request including one or more terms contained in said subset.
42. A method for retrieving information from a database, comprising: receiving a natural language query comprising plural terms; converting said natural language query to logical form; identifying noninteresting terms; and reformulating said logical form to produce an enhanced query in terms of the query language of said database, said enhanced query being exclusive of said noninteresting terms, said reformulating including identifying said terms that are associated with a plurality of predefined database fields contained in said database and database fieldfilling said associated terms.
43. The method of claim 42 wherein said identifying noninteresting terms is based on the context in which said terms occur.
44. The method of claim 42 wherein said identifying noninteresting terms includes pattern matching.
45. A method for converting a keyword based search engine coupled to an information source into a natural language enhanced search engine, the method comprising: determining expression based syntax of the keyword based search engine ; and coupling a natural language based search engine to the keyword based search engine based upon the expression based syntax by linking the natural language based search engine to the keyword based search engine.
46. A method of claim 45 wherein the expression based syntax is selected from a Boolean logic based rule, a not to exceed rule, and a within a number of characters rule.
47. A method of claim 45 further comprising determining a corpus of a database coupled to the keyword based search engine.
48. A method of claim 45 further comprising determining one or more database fields in the database and coupling the one or more database fields into the natural language based search engine to target a natural language query to the one or more of the database fields.
49. A method of claim 45 wherein the natural language based search engine uses semantic and syntax information of one or more of the terms of the natural language query.
50. A method of claim 45 further comprising training the natural language based search engine with a corpus of the information source.
51. The method of claim 45 further comprising identifying selected noninteresting terms.
52. The method of claim 45 wherein the natural language based search engine comprises a query reformulation module.
53. The method of claim 52 wherein the query reformulation module comprises a normalization module to provide the expression based syntax.
54. The method of claim 45 further comprising expanding a size of a text box for a graphical user interface coupled to the natural language based search engine.
55. A method for converting an information retrieval search engine coupled to an infor mation source into a natural language enhanced search engine, the method comprising: determining an expression based syntax of the information retrieval search engine, the information retrieval system comprising a graphical user interface coupled to a client device; and coupling a query reformulation module to the information retrieval search engine, the query reformulation module being adapted to couple a natural language engine to the information retrieval search engine.
56. A system for forming query reformulation, the system comprising: a receiving module for receiving a query in a form of a natural language expression in a logical form; a query reformulation engine coupled to the receiving module, the query reformulation engine being adapted to receive the natural language expression in the logical form and to form a reformulated query from the natural language expression; and a keyword based search engine coupled to the query reformulation reformulation engine to receive the reformulated query.