Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
FULL-TEXT FUZZY RETRIEVAL METHOD FOR SIMILAR CHINESE CHARACTERS IN CIPHERTEXT DOMAIN
Document Type and Number:
WIPO Patent Application WO/2019/153813
Kind Code:
A1
Abstract:
A full-text fuzzy retrieval method for similar Chinese characters in a ciphertext domain. The method realizes a fuzzy search on a Chinese ciphertext domain on the basis of a symmetric searchable encryption scheme and an inverted index structure, supports a fuzzy search on Chinese characters having similar glyphs in ciphertext status, ensures that the searching result is ordered, and supports a multi-keyword logical connection fuzzy search. According to the method, a distributed searching engine Lucene and a Chinese word segmenter IKAnalyzer are used for full-text word segmentation on a document, and a plaintext inverted index comprising similar Chinese characters is constructed by means of the established similar character library of 3,755 commonly used Chinese characters. Considering the security of the inverted index structure, all the keywords in the plaintext inverted index and document numbers corresponding thereto are constructed in an encrypted chain form, and a B+ tree structure is used for accelerating searching. The method realizes a full-text fuzzy search on a Chinese ciphertext domain in a semi-trusted cloud server without false detection and missed detection.

Inventors:
TANG SHAOHUA (CN)
ZHAO BOWEN (CN)
WU YIMING (CN)
Application Number:
PCT/CN2018/114868
Publication Date:
August 15, 2019
Filing Date:
November 09, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV SOUTH CHINA TECH (CN)
International Classes:
G06F17/27
Foreign References:
CN108334612A2018-07-27
CN103955537A2014-07-30
CN106997384A2017-08-01
US5706497A1998-01-06
Other References:
See also references of EP 3674928A4
Attorney, Agent or Firm:
GUANGZHOU HUAXUE INTELLECTUAL PROPERTY AGENCY CO., LTD. (CN)
Download PDF: