Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
WEB PAGE CRAWLING CONFIGURATION METHOD, APPLICATION SERVER AND COMPUTER READABLE STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2019/153603
Kind Code:
A1
Abstract:
Disclosed in the present application are a web page crawling configuration method, an application server and a computer readable storage medium. The method comprises: receiving a website to crawl on, inputted by a user; configuring an information-to-extract type; configuring a crawling task processing node; transferring from a link on the web page of said website to the crawling task processing node; extracting corresponding information on the crawling task processing node according to the information-to-extract type. Further provided in the present application are an application server and a computer readable storage medium. The web page crawling configuration method, application server and computer readable storage medium provided in the present application can flexibly control the crawling depth, and also can realize data classification during web page crawling, improving the effectiveness of entire data crawling and usage.

Inventors:
CAI JUN (CN)
Application Number:
PCT/CN2018/089706
Publication Date:
August 15, 2019
Filing Date:
June 03, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G06F17/30
Foreign References:
CN107622125A2018-01-23
CN103970788A2014-08-06
Attorney, Agent or Firm:
SHENZHEN WORLD INTELLECTUAL PROPERTY AGENCY (GENERAL PARTNERSHIP ) (CN)
Download PDF: