The software release for DocSight OCR 4.2.0 (4.2.0.20024) includes new features.


Software Requirements

The following software is required to successfully use DocSight OCR.

  • Windows Server® 2008 R2, 2012 R2 or 2016
  • Microsoft® .NET Framework 4.6.2 (If it is not detected, it is installed automatically)
  • Supported browsers (for DocSight Verifier): Chrome and Firefox

Hardware Requirements

Pentium 1.6 GHz or higher processor (Intel Core or higher CPU is recommended).


Development Environment

  • 4 GB minimum RAM; 6 GB Recommended for grayscale or color images and more for multithreaded applications.
  • 1 GB of free hard disk space

Runtime Environment

  • 2 GB minimum RAM; 4 GB Recommended 
  • 600 MB of free hard disk space

Note: If installing an ActivePDF product on a Windows 2012 R2 server for the first time, you must download and install two Microsoft updates for Windows 2012 R2 servers. The updates resolve issues with Microsoft Visual C++ Redistributable Runtime Components. For links and step-by-step instructions, see the ActivePDF Knowledge Base article Installing Products on Windows 2012 R2 Servers.


New Features

DocSight OCR 4.2.0 has new features available through the Configuration Manager.

  • Character Filter: Specify which characters are searchable in the resulting output PDF, by using the Character Filter option in the OCR Profiles General tab for the Searchable PDF (Image over Text) OCR Type. Search all characters (by default), numbers only, case-sensitive words, or punctuation.
  • Auto Detect Language: OCR auto detects languages for word recognition. Use the Auto Detect Language check box in the OCR Profiles Character Recognition tab to automatically recognize the language in your input document.
    Note: Install the corresponding language font locally for auto detect to work. For example, if OCR detects the document's language as Japanese, OCR requires a Japanese font to correctly process the document characters.
  • File Mask: Create a filter to ignore a file during processing. Enter a file name, such as Thumbs.db, and OCR ignores that file during conversion, but processes all other files in the Input folder.
    Note: The text box is for a specific, single file name; for example, generic syntax such as *.txt does not mask all .txt files.
  • Document Confidence Level:  When Debug is enabled, the logging results now display the confidence level for the entire document as a percentage. It also includes the number of suspicious characters out of the total number of characters in the document.

Bugs Fixed


ID #Description
18002

The Arabic letters display correctly.


19326

OCR remote conversion for .NET works as expected.



Installation and Getting Started

For installation and configuration information, see Installing DocSight OCR and Configuring DocSight OCR.


API Reference

API information is available in the  DocSight OCR User Guide:

http://documentation.activepdf.com/OCR/OCR_User_Guide/index.htm


OCR Product Page

For more information, go to the DocSight OCR product page:

https://www.activepdf.com/products/ocr


DocSight Verifier

Installation and Getting Started

For installation and configuration information, see Configuring ActivePDF Verifier.

User Guide

The DocSight Verifier User Guide is available here:

https://www.activepdf.com/support/documentlinks.