Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
EFFICIENT ANNOTATION OF LARGE SAMPLE GROUP
Document Type and Number:
WIPO Patent Application WO/2018/157410
Kind Code:
A1
Abstract:
A method for annotating a batch of original samples is provided. A first subset of original samples, selected from the batch and determined by minimizing an entropy-mean difference between the first subset and the batch, is used for human annotation to yield human-annotated samples. The human-annotated samples are used as training data to configure an annotation process for annotating an input sample to yield an annotated output sample, and a check process for verifying annotation accuracy of the annotated output sample. Remaining original samples in the batch are processed by the annotation process to yield machine-annotated samples, whose accuracy is verified by the check process. In one embodiment, part of the original samples corresponding to erroneous machine-annotated samples are selected for human annotation. Resultant additional human-annotated samples are used to update the two processes. The remaining original samples not yet annotated are then processed by the two processes.

Inventors:
LIU YANG (CN)
FENG CHAO (CN)
GAN ZHENGMAIRUO (CN)
LEI ZHI BIN (CN)
XIANG YI (CN)
Application Number:
PCT/CN2017/075796
Publication Date:
September 07, 2018
Filing Date:
March 06, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
HONG KONG APPLIED SCIENCE & TECH RESEARCH INST CO LTD (CN)
International Classes:
G06F17/30; G06N20/00
Foreign References:
CN104462614A2015-03-25
US20160307113A12016-10-20
US20130097103A12013-04-18
Attorney, Agent or Firm:
CHINA PAT INTELLECTUAL PROPERTY OFFICE (CN)
Download PDF: