Even after using a variety of techniques to cull those that it found, it was still left with over 2.1 million that needed consideration. Further keyword searching might have resulted in more reductions but the team wasn’t comfortable with what that process might miss.
Realizing they had neither the time nor money to review all 2.1 million documents, the client and counsel turned to OpenText™ Insight Predict, a technology-assisted review (TAR) engine based on continuous active learning (CAL).
Because Insight Predict has no limit on the amount of training seeds it can handle, review managers used 50,000 documents tagged for an earlier matter to jumpstart the review. Almost immediately, relevant documents from the larger collection were pushed to the front of the line.
Insight Predict’s algorithm provided batches made up primarily of the documents it most highly ranked. This ensured the review team was productive immediately, as it was focused on the documents that were most likely relevant. It turn, the trial team quickly got its hands on the most important documents needed to sharpen their analysis of the case.
The review batches included a mix of documents chosen based on their “contextual diversity.” This unique feature is designed to solve the problem of “you don’t know what you don’t know.” Specifically, the contextual diversity algorithm chooses documents for review that are different than those already reviewed. The algorithm clusters unseen documents by their common themes. It then pulls the most relevant examples from each cluster and presents them to reviewers as part of the batch. If the reviewer tags an example as relevant, the ranking engine is cued to promote similar documents. If the example is not relevant, the ranking engine learns that this cluster is of lesser interest.
The continuous active learning protocol within Insight Predict feeds reviewer judgments back to the system to improve the training and thereby the responsiveness rate for subsequent review assignments. As the reviewers release their batches, Insight Predict adds their judgments to further its training. The net result is that the algorithm gets smarter and smarter about finding and promoting relevant documents for review.
Furthermore, Insight Predict’s ability to use flexible inputs allowed the review team to take a multimodal approach to finding responsive documents. As the review progressed, Insight Predict identified promising search terms as well as custodians who held the most likely relevant documents. This enabled the trial team to independently run searches with these key terms and then tag the relevant documents found through searches.