Statistics for eDiscovery: Random Sample Generation and eDiscovery
How do I generate a random sample?
In election polling, it is quite a challenge to generate a truly unbiased random sample. Not everyone who will vote has a telephone, not everyone is willing to answer, not everyone is truthful. Sampling documents is a little easier, because they don’t have a choice in whether they respond or not. The most practical way to generate a random sample is to use a random number generator. Spreadsheet programs, such as Excel, have random number generators built in. If you have, say ten thousand documents, from which you want a sample of 400, then you could generate 400 random numbers in the interval of 1 to 10,000. You need to be careful to prevent the same number appearing twice. Technically, the numbers generated by a random number generator are pseudo- random numbers, but for our purposes, they are good enough. They are pseudo-random in that eventually they will repeat, but unless you are dealing with extremely large samples, the repetition is unlikely to be encountered. These pseudo-random number generators are used in practically all slot machines, for example.
Source: Statistics For eDiscovery (PDF) - Used by Permission of Herbert L. Roitblat, Ph.D.
Series: Statistics for eDiscovery – Blog Posts – Orange Legal Technologies.
Orange Legal Technologies helps corporate legal departments and their outside counsel conduct the critical electronic discovery task of document review by providing advanced predictive coding technologies and expert reviewer assistance to accelerate the electronic discovery process. To learn more about our automated predictive review services, click here.
This entry was posted on Friday, February 24th, 2012 at 12:29 pm. It is filed under metadata, resources and tagged with resources, services. You can follow any responses to this entry through the RSS 2.0 feed.
Comments are closed.
OrangeLT's Digital Evidence Services help law firms and corporations conduct the critical eDiscovery tasks of legal hold, data mapping and data collection for multiple sources from multiple locations without incurring the burdensome time, cost and equipment expenses.
To learn more about these services, click here.
The OneO® Discovery Platform is an integrated, web-accessible, forensically sound electronic discovery platform that enables online analytics, processing, and review of data from the security of a hosted centralized repository.
To learn more about OneO®, click here.
Information and Demonstrations from OrangeLT eDiscovery experts can help you quickly understand and begin to evaluate the benefit of our services.
To make an information request or schedule a personalized walk-though of the OneO eDiscovery Platform, click here.
The One Decision® Document Review Accelerator leverages advanced near-duplicate identification technology to enhance reviewer efficiency by grouping similar documents and allowing them to be considered together during legal document reviews as part of the eDiscovery process. This grouping and ability to propagate coding decisions throughout all near-duplicate documents results in more efficient and economical document reviews.
To learn more about One Decision®, click here.
OrangeLT's Complete Managed Review Service helps corporate legal departments and their outside counsel conduct document reviews by combining teams of expertly trained legal review attorneys with an integrated, forensically sound eDiscovery platform to allow for efficient, economical and defensible document reviews.
To learn more about managed review, click here.
Unfiltered Orange is a regular newsletter update providing legal professionals with quick reference to unfiltered electronic discovery news, views, and events.
To check out the latest update, click here.
To sign up for our regular updates, click here.
The Intermountain eDiscovery Conference (IEDC) is a one-day legal technology conference designed to provide electronic discovery practitioners with information on the latest tools, techniques, and practices of eDiscovery. Hosted Annually by OrangeLT and presented this year in conjunction with the Mountain West Chapter of the Association of Corporate Counsel, IEDC combines presentations, panels and interactive discussions led by nationally recognized experts to create an intimate and educational professional learning experience.
To learn more about IEDC 2013, click here.
