Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TEXT ERROR CORRECTION METHOD AND SYSTEM, AND DEVICE AND STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2023/193542
Kind Code:
A1
Abstract:
A text error correction method and system, and a device and a storage medium. The method comprises: segmenting, into short sentences, text which has been subjected to automatic speech recognition (S110); inputting the short sentences into a trained error correction model, the error correction model comprising a phoneme extractor (11), a phoneme feature encoder (12), a language feature encoder (13), a feature merging module (14) and a decoder (15), which synchronously update parameters during training, wherein the phoneme extractor (11) acquires phoneme information, and the phoneme feature encoder (12) converts same into a phoneme feature; the language feature encoder (13) obtains language features; and the feature merging module (14) merges the phoneme feature with the language features to obtain a merged feature, and the decoder (15) decodes same to perform error correction thereon, and the error correction model outputting error-corrected short sentences after completing error correction of the short sentences (S120); determining a first degree of confusion and a second degree of confusion of the same short sentence (S130); determining correct text of the short sentence by means of comparing the first degree of confusion with the second degree of confusion (S140); and sequentially merging the correct text of all the short sentences into correct text (S150). Various levels of processing of text are integrated in an error correction model, such that parameters of the various levels are synchronously updated during training, and an error of an upper-layer structure is corrected in downstream training, thereby avoiding error accumulation.

Inventors:
LYU ZHAOBIAO (CN)
XU CHENGCHONG (CN)
LI JIANFENG (CN)
XIAO QING (CN)
ZHOU LIPING (CN)
Application Number:
PCT/CN2023/078708
Publication Date:
October 12, 2023
Filing Date:
February 28, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CHINA UNICOM GUANGDONG IND INTERNET CO LTD (CN)
International Classes:
G10L15/06; G10L15/02; G10L15/26
Domestic Patent References:
WO2018120889A12018-07-05
Foreign References:
CN114495910A2022-05-13
CN113129865A2021-07-16
CN111523306A2020-08-11
JP2014077882A2014-05-01
CN114091437A2022-02-25
CN114282523A2022-04-05
Attorney, Agent or Firm:
GUANGZHOU RUNHE INTELLECTUAL PROPERTY AGENCY (GENERAL PARTNERSHIP) (CN)
Download PDF: