E-Discovery Tip Sheet Redaction Exposed When we talk about reserving material from document production, we generally think about two main strategies: 1. withholding documents under a claim of privilege, or 2. redacting specific portions of produced documents. Volumes can and have been written about privilege claims, privilege logs, and clawbacks of inadvertently-produced privileged material. In the past year alone, we have seen Samsung chastised for posting Apple secrets to its employees, and Magistrate Judge Andrew Peck lecturing all who would listen that the absence of a 502(d) Order (specifying return of inadvertently produced privileged material) under the Federal Rules of Evidence should be grounds for legal malpractice. Let us confine our discussion today to redactions. What are we talking about when we talk about redaction? Let’s first turn back to the days of photocopies of black-marked photocopies. Redactions obliterate a privileged or otherwise sensitive portion of a document so that it may be produced at least in part. Sometimes, in very small part: George W. Bush’s military service record, for example, revealed little more than his name and a visit to the dentist. As long as the information was on paper, or exportable to paper, markers, scissors and tape ruled. The mechanics of redaction only started to change when the focus of discovery shifted to electromagnetic bits. Back in the early-to-mid aughts, electronic documents became the principal subjects of interest as digital document creation and storage was acknowledged to be more prevalent than paper document creation. E-Discovery, the {00033337;1 } June 2015 E-Discovery Tip Sheet Page 2 collection, review and production of electronic mail and computer-generated documents, was the focus of study (e.g., The Sedona Conference), decisions (e.g., Zubulake v. Warburg and its sequels) and law (the December 2006 amendments to the Federal Rules of Civil Procedure). It came to pass that one had to redact electronic documents. And it was likewise true that one could not do so, at least without denaturing the original document. If one deleted text from a Word document (most preferably a copy), the metadata and the shape of the document changed, so arguably it was not the same document, but an adulterated version. Obscuring text also involved adulterating the document, with the bonus of being ineffective: a search, or a font color change, or a manipulation of an overlaid graphic would still turn up the text. Enter Adobe Adobe’s Portable Document Format (PDF) seemed to be the ideal solution to the problem of producing redacted documents. As a digital print medium, the PDF file could be seen as an a print-out where metadata could be elided and text obscured. The difference between obscuring and redacting text became apparent when deleted material prepared by Merck & Co. in 2005 for publication was found in the document’s metadata, accidentally disclosing information which linked Vioxx to an increased risk of heart disease.<1> Adobe addressed the issue in a 2006 technical note<2> which cautioned that “[p]roblems can arise when editors use an improper method such as trying to obscure information rather than deleting it, or if they are unaware of sensitive metadata in a document.” The paper goes on to recommend using the third-party Redax tool from Appligent (which is still offered by Appligent Document Solutions); otherwise, “every effort should be made to redact in the authoring application before converting to PDF.” The technique described involves: copying a document, in the copy, deleting the redactable content (or replacing it with XXXX characters or a blank graphic to maintain the document’s spacing), creating a new blank document, {00033337;1 } June 2015 E-Discovery Tip Sheet Page 3 copying the content of the marked up copy and pasting it into the new blank document, setting the PDF conversion parameters (if the Adobe plug-in is present) to confirm that Convert document information and Attach source file to Adobe PDF are turned off, and saving the new document. If redacting a PDF document directly, Adobe in this paper recommends doing black markups, exporting the page images to TIFF (tagged image file format), then building a fresh, image-only PDF from the TIFFs. Apparently, the Transportation Security Administration (TSA) did not get the memo. In December 2009, a TSA contract employee posted a PDF format ‘redacted’ security manual to a publicly-available Federal procurements website. [See footnote <1>] The text of the redacted content in the manual magically appeared when the black boxes were selected, copied, and pasted into Notepad. A betanews commentary published following the debacle<3> noted that Adobe had long since released Acrobat Professional 8, its first release to include true built-in redaction. The included Mark for Redaction / Apply Redactions tool passed the publication’s tests with flying colors. Now redaction, with selection of text and scrubbing of underlying content, is a standard feature of Acrobat Professional, Nuance PDF Converter, and a host of other tools. Redaction in Review Document review software has long addressed redaction of images (and more recently, near-native views) of documents during review. Summation redacted; Opticon / Concordance Image redacted, allowed redactions to be toggled and text selected. These days translucent redaction, which renders the redacted text visible while still under review until the production burn-in, is often a default view. Redaction in review software remained a pretty manual process: instead of a black marker, you had a pointer-applied tool, but it remained that every redaction required individual judgment and application. More vexingly, while data have grown more portable, redactions have not. XML should allow the specification of redaction coordinates for each document and each page, but EDRM XML transfer has yet to sweep away de facto standards such as {00033337;1 } June 2015 E-Discovery Tip Sheet Page 4 Concordance DAT or OPT or IPRO LFP formats. Converting Summation iBlaze data means leaving behind all those .ann files in the markup folder; at least you have the Bates numbers of the affected pages. Concordance provides the –Redlines database for generating a report and parameters. That is not the same as seeing your REDACTED box in the same place on the same page without further intervention. One other wrinkle is the handling of extracted or OCR text for a redacted document. Early applications were not cognizant of a redaction – in which case the text could be inadvertently produced; later applications might acknowledge the redaction by suppressing all text or native production for that document. In either case, an extra OCR step for redacted document images is required, whether handled by your principal review application or not. Be sure you know how your application handles redactions before you send out your first production. Redaction: The Next Generation While, as noted, redaction requires eyeballs, some parts of the process which lend themselves to programming rules have surfaced in various applications. Some features to look for: Auto-redaction – This is the ability to find and auto-redact identifiable strings of text, such as Social Security numbers, credit card numbers, telephone numbers or business tax ID numbers. It uses a standard coding system, called regular expressions, which allows users to build “masks” of general or specific numbers, uppercase or lowercase characters, and other type objects, to identify instances of the item sought. Those are then redacted automatically under the rule set. Repetitive redaction – If a document has several consecutive pages which are to be redacted in their entirety, some systems will allow the coordinates to be defined and set to repeat. Negative redaction – Sometimes the bulk of the content needs to be secured, but a few tidbits may be exposed (see the Bush example above). Some review tools allow the exposed spaces to be defined in each page, with the remainder defaulting to redacted. As with public scandals, it is less the fact than the concealment that can complicate matters and trip up the unwary. Know your objectives, know your {00033337;1 } June 2015 E-Discovery Tip Sheet Page 5 software, be vigilant, and verify that you are, like a good fashion designer, in firm control of what is to be concealed and what is to be exposed. REFERENCES: <1> COMPUTERWORLD -- TSA Gaffe Shows Pitfalls of Redaction by Jaikumar Vijayan [Jan 4, 2010] (http://www.computerworld.com/article/2550522/security0/tsa-gaffe-showspitfalls-of-redaction.html) <2> Redaction of Confidential Information in Electronic Documents [Adobe Technical Note - 2006] (http://partners.adobe.com/public/developer/en/acrobat/Redaction.pdf) <3> betanews – The PDF redaction problem: TSA may have been using old software by Scott M. Fulton, III [12/10/2009] (http://betanews.com/2009/12/10/the-pdf-redactionproblem-tsa-may-have-been-using-old-software/) -- Andy Kass [email protected] 917-512-7503 The views expressed in this E-Discovery Tip Sheet are solely the views of the author, and do not necessarily represent the opinion of U.S. Legal Support, Inc. U.S. LEGAL SUPPORT, INC. ESI & Litigation Services PROVIDING EXPERT SOLUTIONS FROM DISCOVERY TO VERDICT • • • • • • e-Discovery Document Collection & Review Litigation Management Litigation Software Training Meet & Confer Advice Court Reporting Services • • • • • • At Trial Electronic Evidence Presentation Trial Consulting Demonstrative Graphics Courtroom & War Room Equipment Deposition & Case Management Services Record Retrieval www.uslegalsupport.com Copyright © 2015 U.S. Legal Support, Inc., 90 Broad Street, New York NY 10004 (800) 824-9055. All rights reserved. To update your e-mail address or unsubscribe from these mailings, please reply to this email with CANCEL in the subject line. {00033337;1 }
© Copyright 2024 Paperzz