Integrated near-duplicate detection has the potential of reducing review costs by up to 70%.[1]
Industry studies estimate up to 60% of documents in legal reviews may be near-duplicates.[2]
One Decision® Document Review Accelerator from OrangeLT™ simplifies reviews with advanced near-duplicate identification to deliver more consistent, less costly and higher quality results.
Orange Legal Technologies’ proprietary One Decision Document Review Accelerator leverages advanced near-duplicate identification technology to enhance reviewer efficiency by grouping similar documents and allowing them to be considered together during legal document reviews as part of the eDiscovery process. This grouping and ability to propagate coding decisions throughout all near-duplicate documents results in more efficient and economical document reviews. Advantages to One Decision from OrangeLT include:
Reduced review redundancy by focusing reviewers on appropriate content.
Reduced number of documents to be reviewed without sacrificing review quality.
Reduced risk of multiple reviewers reviewing similar documents.
Reduced dollar and hour resources required for a complete review.
Reduced inconsistent tagging decisions on similar documents after first-pass review.
Fully integrated into the OneO® Discovery Platform, One Decision enhances the review capabilities of OneO to deliver a more consistent, less costly and higher quality approach to technology enhanced review. One Decision does this by empowering reviewers to select ONE DOCUMENT, make ONE DECISION and have that decision propagated throughout all near-duplicates for ONE RESULT.
The One Decision Review Accelerator allows users to experience and benefit from the advanced near-duplication identification technology developed by Orange Legal Technologies by providing them a simple, three step near-duplicate approach that is accessible directly from the OneO Discovery Platform Review Module. This One Decision Near-Duplicate Review Accelerator approach consists of:
1) Reviewers selecting ONE DOCUMENT from the review module Document List Grid for near-duplicate evaluation.
2) Reviewers applying ONE DECISION on resemblance[3] and containment[4] thresholds as set in the Near Duplication Setting Wizard to establish a targeted set of near-duplicate documents.
3) Reviewers making coding decisions on ONE DOCUMENT and having those decisions propagated throughout associated near-duplicates for ONE RESULT for all documents.

Three Step One Decision Review Discovery Accelerator Approach
This highly intuitive, fully integrated approach simplifies and expedites document reviews while delivering more consistent, less costly and higher quality document reviews.
What is Near-Duplicate Detection of electronically stored information (ESI)?
Near-Duplicate Detection is a technology enabled feature of advanced eDiscovery platforms that allows for the identification and grouping together of documents that have strong similarities in regard to content and context, yet are not completely identical.
Common academic definitions of near-duplicate detection include:
Why is Near-Duplicate Detection important in eDiscovery?
When one considers that vendor estimates show that the number of near-duplicate documents in complex reviews may range from 20% to 60%[7] of reviewable documents, Near-Duplicate Detection technology is important in the field of eDiscovery as it can significantly increase the speed of legal document reviews by decreasing the number of documents a review team must consider to gain a complete understanding of available information.
How is Near-Duplicate Detection executed within the OneO Discovery Platform?
Fully integrated into the architecture of the OneO Discovery Platform, One Decision Near-Duplicate Identification (NDI) capability is enabled by two wizard driven interfaces allowing user input and adjustment into NDI process. These wizards consist of the Near-Duplicate Settings Wizard and the NDI Wizard.
Figure 1 – Example of Near-Duplicate Settings Wizard (Pre-processing Setting).
Figure 2 – Example of Near-Duplicate Settings Wizard – Viewing Filter (Post-processing Setting).
How is Orange Legal Technologies’ delivery of Near-Duplicate Detection technology different that other approaches?
Differentiation factors for One Decision from OrangeLT include:
User visibility and access to near-duplicate identification algorithm parameters.
User adjustable thresholds for near-duplicate resemblance and containment
Fully integrated into OneO and presented as an optional part of a comprehensive eDiscovery workflow.
Developed by OrangeLT and delivered without the third party licensing cost constraints typically associated with near duplication technology.
To learn more about how OrangeLT can augment your eDiscovery capabilities with the One Decision Review Accelerator, contact us today to schedule an introductory briefing and demonstration.
Formed in 2008 as an eDiscovery-centric outgrowth of the 1995 founded Litigation Document Group, Orange Legal Technologies is a leading provider of electronic discovery litigation, audit, and investigation services for law firms and corporations. Having served over 1,000 clients since inception and with over 250 clients leveraging the OneO® Discovery Platform since its introduction, OrangeLT™ has worked with some of the world’s most well known corporations and law firms and has been recognized in leading analyst and media publications from such organizations as Gartner, IDC, 451 Research and Forbes.
OrangeLT™ is a leading provider of electronic discovery litigation, audit, and investigation services for law firms and corporations. OrangeLT offers a complete suite of electronic discovery technology and services, including collection, analysis, processing, review and production of digital and paper-based information. OrangeLT is enabled by the OneO® Discovery Platform – an integrated, web-accessible electronic discovery platform that provides online analysis, processing, and review of unstructured data from the security of a hosted centralized repository.
[1] Orange Legal Technologies’ Service and Technology Pricing. Orange Legal Technologies. January 2013.
[2] Where The Money Goes: Understanding Litigant Expenditures for Producing Electronic Discovery. Nicholas M. Pace and Laura Zakaras. RAND. April 2012. 53.
[3] Resemblance (or Jaccard coefficient) of two documents is defined as the size of the intersection of their shingle sets divided by the size of the union of shingle sets. (A Shingle is defined as a contiguous sequence of words.)
[4] Containment of two documents is defined as the size of the intersection of shingle sets divided by the size of the shingle of the initial shingle set.
[5] The Grossman-Cormack Glossary of Technology Assisted Review. Version 1.03. By Maura Grossman and Gordon Cormack. December 2012.
[6] See Nicholas M. Pace and Laura Zakaras. RAND. April 2012. 51.
[7] See Nicholas M. Pace and Laura Zakaras. RAND. April 2012. 53.