Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
INDUSTRY PROFESSIONAL TEXT AUTOMATIC LABELING METHOD AND APPARATUS, TERMINAL, AND STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2023/178903
Kind Code:
A1
Abstract:
Provided are an industry professional text automatic labeling method and apparatus, a terminal, and a storage medium. A semi-supervised entity recognition algorithm is used, and an external professional knowledge text library is combined, such that the labor cost of text entity labeling is reduced to the maximum extent, and the quality and efficiency of modeling in a text entity recognition process are improved. In addition, a universal entity recognition algorithm is also used, and a specific bag of words high-dimensional vectorization similarity calculation technique is combined, such that the early stage of the text entity recognition process can be carried out automatically, and high-quality entity information extraction can be achieved in a plurality of different professional fields. In addition, on the basis of a data augmentation technique, by means of noise entity feature interpolation and noise statement feature extrapolation techniques, the problem of an insufficient generalization capability of a traditional unsupervised automatic labeling algorithm is mitigated, such that a labeling model can implement semi-supervised automatic labeling on various different industry professional texts.

Inventors:
SHEN HAO (CN)
WU YOU (CN)
Application Number:
PCT/CN2022/109617
Publication Date:
September 28, 2023
Filing Date:
August 02, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SHANGHAI FLAGINFO INFORMATION INCORPORATED TECH CO LTD (CN)
International Classes:
G06F40/295
Foreign References:
CN114386424A2022-04-22
CN113065341A2021-07-02
CN112131366A2020-12-25
CN113343695A2021-09-03
US20150205780A12015-07-23
Attorney, Agent or Firm:
BEIJING TRUST OF ZHIGUO IP AGENCY CO., LTD. (CN)
Download PDF: