Paper Discovery
Document Scanning
This process converts paper documents into a digital representation of the documents. The electronic images will be used for the client to view the documents based upon database search criteria results. The scanner will create a black and white image for all pages using 300 dpi resolution under the industry standard CCITT group IV TIFF. Documents are expected to be primarily 8.5” by 11” pages, though some oversized documents may be present within the population. Documents that can be auto fed into a scanner will be so scanned; documents, which cannot be auto fed into a scanner will be scanned on the glass or other appropriate method.
Document Coding
Coding is the process of capturing information contained within a document. This process typically applies to paper based documents. Coding consists of manually capturing such information as document type, date, author, recipients, title or subject, and source. The documents are typically logically unitized during the coding process.
OCR
OCR will be performed on each of the documents in the population using OCR technology. The OCR process will be run on a single pass basis using the following settings: Recognition: checking subsystem enabled, correction enabled, trade off accurate, suspect level – level 1; image pre-processing: auto zoning enabled, auto rotation enabled, and de-skew enabled. Output format: remove extra spaces selected, and remove blank lines enabled. Quality of the OCR data will depend upon the quality of the original documents. QA-Scanning, as detailed above, will ensure that the best possible digital images are obtained for the document population so as to maximize the accuracy of the OCR process.

