Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
OBJECT-LEVEL IDENTIFICATION OF DUPLICATE DATA IN A STORAGE SYSTEM
Document Type and Number:
WIPO Patent Application WO/2012/173859
Kind Code:
A3
Abstract:
The technique introduced here includes a system and method for identification of duplicate data directly at a data-object level. The technique illustratively utilizes a hierarchical tree of fingerprints for each data object to compare data objects and identify duplicate data blocks referenced by the data objects. The hierarchical fingerprint trees are constructed in such a manner that a top-level fingerprint (or object-level fingerprint) of the hierarchical tree is representative of all data blocks referenced by a storage system. In embodiments, inline techniques are utilized to generate hierarchical fingerprints for new data objects as they are created, and an object-level fingerprint of the new data object is compared against preexisting object-level fingerprints in the storage system to identify exact or close matches. While exact matches result in complete deduplication of data blocks referenced by the data object, hierarchical comparison methods are used for identifying and mapping duplicate data blocks referenced by closely related data objects.

Inventors:
YASA GIRIDHAR APPAJI NAG (IN)
CHANDRASEKARASASTRY NAGESH PANYAM (IN)
Application Number:
PCT/US2012/041301
Publication Date:
April 25, 2013
Filing Date:
June 07, 2012
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NETAPP INC (US)
YASA GIRIDHAR APPAJI NAG (IN)
CHANDRASEKARASASTRY NAGESH PANYAM (IN)
International Classes:
G06F12/00; G06F11/00; G06F15/16
Foreign References:
KR100985169B12010-10-05
US20100088296A12010-04-08
US20110016152A12011-01-20
US7739317B22010-06-15
US7062493B12006-06-13
Other References:
See also references of EP 2721495A4
Attorney, Agent or Firm:
BECKER, Jordan M. et al. (P.O. Box 1208Seattle, Washington, US)
Download PDF: