Why is optimizing scanned documents important? Document scanners deliver images of the pages in a document. Cleaning up and enhancing those images before the file is stored is an important part of any document capture system. Benefits of adding image enhancement to the document scanning workflow are:
- Better readability and visual appearance of the stored documents
- Reduce risk of losing information from the original
- Less storage space required due to better image compression
- Improved accuracy of text recognition (OCR)
The most common cleanup operations needing to be done after the document has been scanned are:
- Detect page sizes and orientation
- Remove black borders and punch holes
- Remove blank pages
- Deskew (straighten) pages that were skewed by the scanner's sheet feeder
- Remove spots (speckle) caused by dust and stains on the original
These are all options that may be added to the automated processing. They can be done directly after scanning, and you will see the result immediately. Or they can be added as a queued background process in DocServer™ so you can continue working, for example with scanning more documents.
Sometimes it is also desirable to enhance the scanned images, for example if the originals are very dark, dull, unsharp, have a color cast, etc. Some document scanners may even introduce such unwanted effects during scanning. By adding image enhancement to the scan processing, you can to some degree compensate for this.
- Automatically enhance or adjust contrast
- Adjust brightness
- Adjust color balance
All these operations can also be added to the automated scanning process.
You can add almost any image processing function or document composition operation to the workflow by using a macro. A macro is a set of PixEdit commands stored in a text file. Macros can be used both interactively and as part of the automatic scan process.
Cropping and organizing pages
These automatic functions greatly increase efficiency of document scanning.
Detecting page sizes and blank pages
This enables you to scan a mixture of page sizes and single-sided or double-sided without having to manually crop pages and remove blank back sides afterwards.
Detecting page orientation
This can save scanner operators a lot of time. They no longer have to browse the entire batch to make sure every page is oriented correctly, before they scan it.
The easiest way to scan a booklet to PDF is usually to just remove the stitches and put it in the sheet feeder of your document scanner. But the file you get from the scanner isn't very user friendly. You'll be browsing through double pages in a very confusing order.
You could of course cut the booklet along the fold with a knife and rescan, but then you would destroy the original. There is a simpler way: PixEdit's automatic booklet function helps you with this problem. It will automatically cut the pages digitally and rearrange them into correct reading order.
When scanning thousands of documents, file size is a very important concern. Producing many large files can take up a lot of harddisk space and network bandwidth and make your production scanning a lot more costly. However, when cutting down on file size, it is crucial to retain the image quality and readability of documents. PixEdit® has two unique and powerful features to help you with this.
ACRO™ - Automatic Color and Resolution Optimization
This technology exploits the fact that, in most cases, a large document consists of only a few pages with color images on them and many pages with just black-and-white text. If you scan the whole document in full color, you will get a huge file. However, if you run it through the ACRO function, all pages with little or no color content will be converted to black-and-white. And black-and-white images takes up roughly one-twentieth of hard disk space compared to full color images!
In cases where you have some color content that must be preserved on a major part of a large document, this technology is a better solution than ACRO. It will reduce the number of colors on each page so that they can be compressed much more than the images that came from the scanner. This also reduces the file size many times and can save a lot of space and network load.
Interactive and Automatic Batch Processing
This tool is perfect for enhancing, optimizing and converting existing archives of scanned files. An easy user interface will guide you through the required steps. Take a look at this video to see it in action:
This unique technology exploits the great potential of modern multi-processor computers. It lets you set up several scan processing workflows that runs in parallel and in the background. For example, you can let DocServer automatically take over the job of cleaning up, optimizing and storing a document right after it has been scanned. You may then continue working, scanning new documents or editing them. Here are some videos that will show a little more