Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND APPARATUS FOR DIVIDING RANDOMLY SAMPLED DATA SUB-BLOCKS OF BIG DATA
Document Type and Number:
WIPO Patent Application WO/2019/169619
Kind Code:
A1
Abstract:
A method for dividing randomly sampled data sub-blocks of big data, which is applicable to the technical field of big data processing, and comprises: cutting one big data block to obtain P original data sub-blocks (S101); randomly extracting several pieces of data from each of the original data sub-blocks among the P original data sub-blocks, combining the several pieces of data extracted from each of the original data sub-blocks, and generating one new randomly sampled data sub-block; and repeating the extracting and combining operations for a total of K times to obtain K randomly sampled data sub-blocks (S102). The described division method may ensure that the obtained randomly sampled data sub-blocks are random samples of the entire big data block; and when each randomly sampled data sub-block is obtained, it is not necessary to traverse the whole big data block, thereby greatly improving efficiency.

Inventors:
HUANG ZHEXUE (CN)
HE YULIN (CN)
ZHANG XIAOLIANG (CN)
WEI CHENGHAO (CN)
ZHU HUFEI (CN)
Application Number:
PCT/CN2018/078509
Publication Date:
September 12, 2019
Filing Date:
March 09, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV SHENZHEN (CN)
International Classes:
G06F17/30
Foreign References:
CN102750309A2012-10-24
CN105303456A2016-02-03
CN103473255A2013-12-25
CN103336844A2013-10-02
CN106021567A2016-10-12
US5613091A1997-03-18
Attorney, Agent or Firm:
HENSEN INTELLECTUAL PROPERTY FIRM (CN)
Download PDF: