Skip to main content

The Basics: Software

Image editing software

Which software you use depends on what you are digitizing. If you are digitizing images, Photoshop or something similar is vital. The GIMP is a similar program that is free.

Whenever possible you want to avoid having to edit the images at all. You want your scan to have good lighting, good white balance, and a good crop box before you touch it with a program. It saves time and saves the image from unnecessary tampering.

All image editing software works off of different algorithms. This is why the same function will work differently in different programs. The more expensive and popular software (like Photoshop) has good algorithms that may produce better results.

The basic functions you want in an image editing software:
· color correction
· cropping
· brightness/contrast correction


Text image processing software

If you are scanning mostly text, then you need a totally different kind of program. If you are doing text, then you are likely doing books, newspapers (stuff that has a lot of pages). You will probably also want OCR (Optical Character Recognition ), so you can make the text searchable.

In order to make Text from a scan readable, the scan has to be as clear as possible, as level as possible, and as clean as possible. This means that when you are looking at text image processing, you want these basic functions:

· Batch processing (many images in a batch without human interaction)
· Crop
· Conversion to bi-tonal (black and white) or grayscale
· De-skew (leveling the image based on lines of text)- There is actually a plugin for deskewing for GIMP


Many companies that sell digitization equipment will have some piece of software that takes care of these issues. Check around, and ask.

In addition to a program that can to the above, you need a separate program to OCR. The program I’ve heard used most often is AABBY Finereader. It creates a PDF with searchable text.

That's the basic software.

Comments

Popular posts from this blog

Atiz scanner and Kirtas scanner aren’t playing nice with eachother

I love the Atiz scanner for it's simplicity, good design, and utility. I love the Kirtas scanners for their speed and their "wow" factor when people see the things work. The only problem I have at the moment is taking our current Kirtas workflow (using Kirtas's software Bookscan Editor, Superbatch, and OCR manager), and finding a way to make the Atiz scanner workflow work with it. The Atiz machine came with a hefty batch editing program that does a great job of cleaning up the images and making them wonderfully presentable. The machine even came with a PDF maker, but it doesn't OCR on its own, and it doesn't give you the options that Kirtas' OCR manager do. So, I want to process the Atiz scanner finished images using Kirtas’s OCR manager. However, that seems to be more difficult than I had first expected. For the next month, I’ll be trying to figure out how to make this marriage of Atiz and Kirtas systems work. If it ends up failing, then I may have t...

Ex Libris Digital Preservation system

Today I attended a webinar from Sun Microsystems about the new Ex Libris Digital Preservation system. You can view the webinar here . The talking points are they handle all the hardware and they can handle the software. They claim it’s secure and built with redundancy. The major problem is that they say you can’t provide access to the files without getting Primo (Ex Libris’s new Amazon-like catalog toy-which is looking fun). They won’t convert the files for you when the formats out of style, but they make it so that you can maintain and upgrade the files. All and all, I like the idea of a comprehensive digital preservation system being handled by people who know hardware. I Just think it is going to be too expensive for most libraries. Time will tell how many libraries pick this up.

Microfilm and Microfiche scanners

I have been researching high speed microfiche and microfilm scanners for the last year. There are four major companies that produce microform scanners. Mekel (a Crowley Company), Wicks and Wilson , nextScan ,and Sunrise . They each have their advantages and disadvantages. Both nextScan and Sunrise have 3-in-1 or 2-in-1 models, where you have one machine (~$100,000) that comes with one attachment, and you buy other attachments for different types of microform (Microfilm, Microfiche, and Aperture card). Each attachment costs extra. I never figured out the cost for the attachments. nextScan also has a dedicated roll film scanner , that I’ve heard good reviews from the Newspaper Digitization Project in Australia . In general, I have heard that the 3-in-1 or 2-in-1 machines are fine, but they tend to go slower than dedicated machines. They really are built for versatility and marketed toward libraries who can only afford one machine that can do all types (Paying $100,000+ for one...