Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TEXT INFORMATION EXTRACTION METHOD AND APPARATUS, AND COMPUTER DEVICE AND STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2021/072848
Kind Code:
A1
Abstract:
A text information extraction method and a related device. The method comprises: obtaining a first language marking corpus set, a first language unmarked corpus set, a second language marking corpus set, and a second language unmarked corpus set according to a first language corpus text and a second language corpus text; collaboratively training a first language classifier and a second language classifier by using the corpus sets; using the first language classifier to classify a first language target entity pair obtained according to a mixed statement; using the second language classifier to classify a second language target entity pair obtained according to the mixed statement; and obtaining an entity relationship of a mixed entity pair of the mixed statement according to classification results of the first language target entity pair and the second language target entity pair. The method implements the accurate extraction of the entity relationship from texts in two different languages.

Inventors:
YANG DONGYAN (CN)
WANG ZHIHAO (CN)
Application Number:
PCT/CN2019/117231
Publication Date:
April 22, 2021
Filing Date:
November 11, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G06F40/279; G06F40/30
Foreign References:
CN103559181A2014-02-05
CN109902303A2019-06-18
US20180189269A12018-07-05
US20180314756A12018-11-01
Attorney, Agent or Firm:
SHENZHEN SCIENBIZIP INTELLECTUAL PROPERTY AGENCY CO.,LTD. (CN)
Download PDF: