Deduplication is the process of identifying duplicate files during the discovery process and removing them from further processing and analysis. Deduplication is a necessary step in managing the volume of data that must be analyzed.

A duplicate file is an exact copy of another file. Deduplication is necessary in many situations involving electronic documents because multiple identical documents are a typical feature of large record sets. For example, in electronic discovery sets containing e-mail archives for an organization, it is not uncommon for multiple e-mail accounts to contain the exact same widely distributed e-mail or file attachment.  

CloudNine™ LAW identifies duplicate files by comparing hashes of files. A hash is a numerical representation of a file whose value is based on the file contents or other attributes. In essence, the file is subjected to an encryption process that yields a unique value. An exact copy of a file will yield the same hash value. In the case of electronic documents, the file is hashed. For e-mail, metadata fields are hashed. You can set the encryption key in the deduplication settings.

The scope of the project will determine whether or not deduplication will be performed and which methods will be used.

 

ac_note_icon

In addition to deduplicating prior to the import process, LAW also allows you to deduplicate at these other times in a pre-discovery workflow:

After the import against other records in the case by using the Deduplication Utility.

After the import against other records in the case and other LAW cases by using Inter-Case Deduplication.

 

 

hmtoggle_plus1To configure deduplication

 

Need additional help? E-mail the CloudNine™ LAW Technical Support team at: lawsupport@cloudnine.com, or contact a support representative at 713-462-6464 for CloudNine™ LAW Ext. 12 or CloudNine™ Explore Support Ext. 13. The Technical Support team is available between the hours of 9:00 A.M to 7:00 P.M. Eastern Time, Monday - Friday.

Copyright © 2024 CloudNine™. All rights reserved.