Title:
OBJECT-LEVEL IDENTIFICATION OF DUPLICATE DATA IN A STORAGE SYSTEM
Document Type and Number:
WIPO Patent Application WO/2012/173859
Kind Code:
A3
Abstract:
The technique introduced here includes a system and method for identification of duplicate data directly at a data-object level. The technique illustratively utilizes a hierarchical tree of fingerprints for each data object to compare data objects and identify duplicate data blocks referenced by the data objects. The hierarchical fingerprint trees are constructed in such a manner that a top-level fingerprint (or object-level fingerprint) of the hierarchical tree is representative of all data blocks referenced by a storage system. In embodiments, inline techniques are utilized to generate hierarchical fingerprints for new data objects as they are created, and an object-level fingerprint of the new data object is compared against preexisting object-level fingerprints in the storage system to identify exact or close matches. While exact matches result in complete deduplication of data blocks referenced by the data object, hierarchical comparison methods are used for identifying and mapping duplicate data blocks referenced by closely related data objects.
More Like This:
Inventors:
YASA GIRIDHAR APPAJI NAG (IN)
CHANDRASEKARASASTRY NAGESH PANYAM (IN)
CHANDRASEKARASASTRY NAGESH PANYAM (IN)
Application Number:
PCT/US2012/041301
Publication Date:
April 25, 2013
Filing Date:
June 07, 2012
Export Citation:
Assignee:
NETAPP INC (US)
YASA GIRIDHAR APPAJI NAG (IN)
CHANDRASEKARASASTRY NAGESH PANYAM (IN)
YASA GIRIDHAR APPAJI NAG (IN)
CHANDRASEKARASASTRY NAGESH PANYAM (IN)
International Classes:
G06F12/00; G06F11/00; G06F15/16
Foreign References:
KR100985169B1 | 2010-10-05 | |||
US20100088296A1 | 2010-04-08 | |||
US20110016152A1 | 2011-01-20 | |||
US7739317B2 | 2010-06-15 | |||
US7062493B1 | 2006-06-13 |
Other References:
See also references of EP 2721495A4
Attorney, Agent or Firm:
BECKER, Jordan M. et al. (P.O. Box 1208Seattle, Washington, US)
Download PDF:
Previous Patent: HIERARCHICAL IDENTIFICATION AND MAPPING OF DUPLICATE DATA IN A STORAGE SYSTEM
Next Patent: TIME-DELAY FLUIDS FOR WELLBORE CLEANUP
Next Patent: TIME-DELAY FLUIDS FOR WELLBORE CLEANUP