Network-based filtering for large email collections in E-Discovery

Authors
Citation
H.helsen, Network-based filtering for large email collections in E-Discovery, Artificial intelligence and law , 18(4), 2010, pp. 413-430
ISSN journal
09248463
Volume
18
Issue
4
Year of publication
2010
Pages
413 - 430
Database
ACNP
SICI code
Abstract
The information overload in E-Discovery proceedings makes reviewing expensive and it increases the risk of failure to produce results on time and consistently. New interactive techniques have been introduced to increase reviewer productivity. In contrast, the techniques presented in this article propose an alternative method that tries to reduce information during culling so that less information needs to be reviewed. The proposed method first focuses on mapping the email collection universe using straightforward statistical methods based on keyword filtering combined with date time and custodian identities. Subsequently, a social network is constructed from the email collection that is analyzed by filtering on date time and keywords. By using the network context we expect to provide a better understanding of the keyword hits and the ability to discard certain parts of the collection.