Explain that stuff
Sponsored links

You are here: Home page > Computers > Scanners
Advertisement
A feed-through scanner based on a Canon BJC-80 printer

Scanners

by Chris Woodford. Last updated: August 29, 2011.

If you've got a printed document and you want to get it into your computer, who you gonna call? Chances are, you're going to "call" your scanner! A scanner is a brilliantly useful piece of equipment that has some things in common with a photocopier and other things in common with a digital camera. Let's take a closer look at how it works.

Photo: A feed-through scanner (seen from directly above) looks much like an ordinary computer printer. Feed the document to be scanned in and it travels through the scanner from the back to the front. The scanner head (the small mauve square in the center) moves from left to right and back across the page, like the print head in a printer.

What is scanning?

They say a picture is worth a thousand words. What that really means is that pictures contain a huge amount of information. To describe a complex picture, you might need far more than a thousand words! Now just imagine if you're a computer and you want to make sense of a picture like the Mona Lisa. Where do you begin? People find this job easy because a huge part of our brain (something like 25 percent) is devoted to processing the things we can see with our eyes. But if you're a computer, you can't process a whole picture at once like a person can; instead, you have to go about trying to understand the information the picture contains in a much more systematic way.

A scanner's job is to turn a picture a human can understand into a digital picture that a computer can store and process—an image coded in the form of millions of numbers (zeros and ones) called binary code. A scanner does this by moving a scanning head backwards and forwards across the picture to recreate the image, one line at a time, inside the computer. This process is called scanning and that's how a scanner gets its name. (Televisions also use a process called scanning to build up the pictures on their screens, but they do it so quickly that we can't (usually) see it happening.)

Using a flatbed scanner

Photo: A flatbed scanner is more like a photocopier: you put your paper face down on a flat piece of plastic or glass. The paper stays still while the scanning head, inside the scanner, moves past it.

Different types of scanners

Flatbed scanners

There are several different types of scanner, but they all work in broadly the same way. The most popular ones are flatbed scanners, which look a bit like photocopiers that have been on a diet. You open up a lid, place the document you want to scan face down on a piece of glass, and then close the lid again. When you tell the computer to scan, the scanner moves its scanning head from side to side and progressively down the page until it's covered the entire printed area. The scanning head contains a very bright light and a light-sensing unit called a CCD (charge-coupled device), side by side. As the unit moves back and forth, the bright light shines up onto the page and reflects the pattern of black and white characters back down into the CCD. The CCD is the same sort of light detector that you find in a digital camera (and you can find out how it works in our CCDs article). It detects the pattern of light being reflected into it off the printed page and produces a series of electrical signals. This effectively converts the light pattern on the page into a pattern of numbers that your computer can store.

Feed-through scanners

Another type of scanner looks and works much like a computer printer and you have to feed documents through one page at a time. In a flatbed scanner, the scanner head moves down the page, which remains stationary. But in a feed-through scanner, the page moves through and the scanner head simply moves from side-to-side. My old Canon inkjet printer, pictured in the top photo, came with a scanner cartridge. To turn the printer into a feed-through scanner, I simply have to remove the print cartridge and put the scanning cartridge in instead. My computer figures out what I've done and it's smart enough to know that the printer is now, effectively, a scanner.

Top side of Canon IS12 image scanner cartridge Underside of Canon IS12 image scanner cartridge showing the light scanning tube and CCD lens

Photo: This clever Canon cartridge turns an inkjet printer into a scanner.

Left: You can see what the scanner cartridge looks like from above. On the right of this picture, you can see the gold-coloured connectors where the scanner cartridge sends its data, via the converted printer, to my computer. They're actually made of copper, not gold.

Right: The same cartridge, viewed from underneath, has two slits in its base (circled in red). The white-colored one on the left is the bright light that shines onto the paper. The black slit right next to it is a plastic lens. Underneath it is the CCD that converts the reflected light into a string of digital information.

Black and white or color?

Scanning red, blue, and green

Like a photocopier, a scanner can make black and white images of a page (whether the page is in color or black and white) simply by scanning a light across it. But to make decent color images of a color page, it has to scan the red, green, and blue elements of the page separately. Some scanners do this using separate red, green, and blue lights. Instead of sending the light across the page once, they send it over three times—once to capture each of the three colors that combine to make up a color page. On the photo I've taken here, you can see how my scanner scans each line of the page three times using separate red, green, and blue lights. Scanning a whole page can take some time!

Photo: Each line is scanned three times, once for red information, once for green, and once again for blue.

Processing scanned images

Once you've scanned a page, you end up with an image file on your computer (usually in a graphical format called TIFF, BMP, or JPG). If you've scanned a page from a book, what you have is effectively a photograph of a page from a book. You still don't have the words in your computer in a form that you can paste into a word-processor and edit. Indeed, your computer doesn't even know that the image you scanned is a page: for all it knows (or cares), you might have scanned a photograph.

To turn a scanned image into usable text, you have to use a type of computer program called OCR (optical character recognition). This works its way through the image file in a systematic way looking for recognizable patterns of letters and characters. When it finds them, it strings them together, in sequence, to build up a simple text file. After the OCR program has done its job, your original image of the printed page will have been turned into something much more useful: a digital file you can edit however you want. Find out more about this process in our main article on how OCR works.

Apart from flatbed and feed-through scanners, you can also get pen scanners, which look like fat, electronic pens. They have built in mini-scanning units and OCR programs. You simply run them across a line of printed text in a book or magazine. A tiny CCD in the tip of the pen scans an image of the printed line and built-in OCR software turns it into processable text. When you plug the pen into your computer, the text file is automatically transferred across. Digital pens are very similar.

How to get the most from your scanner: 10 top tips

One of the golden rules of computing is "GIGO"—garbage in, garbage out. Roughly translated in this context, it means that if you scan a document the right way, you'll save yourself endless hours of work later! Here are some tips I've discovered over the years:

  1. Buy the right scanner to start with!: If you're scanning multi-page documents, a feed-through scanner (which will scan a whole pile of papers, one after another, without any intervention from you) is generally going to be much quicker to use than a flatbed scanner (where you'll have to scan and remove every single page in turn). However, if you're scanning books, or awkward objects, you'll need a flatbed. It's worth remembering that if you have a reasonably high-resolution digital camera with a good macro lens, and you just want to make an occasional scan, you may not need a scanner at all: my camera (a 7-megapixel Canon) will produce decent images of paper documents that are plenty good enough to OCR. Experiment with your camera before you waste money on a scanner you may not need.
  2. Clean your scanner before you start: If there's dirt, fingermarks, grease spots, spider legs, jam smears, or worse on your scanner, you can confidently expect them to appear on your finished image. If you've got a flatbed scanner, keep the glass or plastic surface as clean as you can. Using a microfiber cloth is a good way to clean it without scratching it.
  3. Make sure the page (or photo) you're scanning is straight: It sounds blindingly obvious, but it's much easier to work with an image that's straight than one that's crooked. You can straighten images later, but why not get it right first time and save the effort?
  4. Shut the scanner lid or put something over the top to keep the light out: Flatbed scanners are much like photocopiers: if you don't close the lid, you'll let light in and it'll degrade the edges of your image (they're likely to go much blacker than they should).
  5. Pick the right scanning mode: Most scanners have different preset scanning modes, such as "Photo", "Document", and "OCR." Figure out the correct one before you start. With OCR, it's crucial that your computer can tell the difference between the black and white areas of the page, so it needs a crisp two-tone scanned image to work from rather than one in many colors or shades of gray.
  6. Practice first: Not sure which mode to use? If you've got a lot of scanning to do, experiment with scanning modes and figure out the best one before you start. For example, a few years ago, I had to convert a very old, very faded typewritten manuscript into a Word file I could put on the web. I could have scanned the whole thing at very high resolution, but after experimenting it turned out to be far quicker (more accurate) to scan in fax mode, OCR, and then correct the scanned material with a spellchecker. Sometimes it's actually easier to photocopy a bad, faded manuscript (to increase the black and white contrast) and then scan the copy for OCR instead of the original.
  7. Higher resolution is better—but not for the Web: If you're scanning a photo for a website, you'll probably want it to be either a small editorial size (maybe 200–300 pixels wide) for basic decoration or a bit bigger if it's the main content (say, 800 pixels wide or so). Remember that most people browse the Web with screens that are no wider than 1024 pixels. There's no point in scanning a photo at a resolution of 3000 pixels or higher for Web use. It'll be physically too big to fit on the screen and the file size will be much too large for quick and efficient downloading.
  8. Master your photo-editing software: A scanned image is much the same as a digital photo, so you can edit or enhance it in a photo-editing program (such as Photoshop, Google's Picasa, or the GIMP) if you need to. Crop off the sides of the image to remove bits of the scanner mechanism, the unwanted edges of a page, or whatever. Use the "clone" brush to remove dirt spots, page tears, or anything else you don't want in your scanned image.
  9. Scanners make great translators: Online translation has advanced enormously over the last few years and it's now possible to get a reasonable, working translation of a document in just a few minutes. Simply scan the document, OCR it, and then feed the OCR'd text into something like Google's Language Tools (or an alternative online translation service). Bear in mind that GIGO applies doubly to online translation. After you've OCR'd your document, always proofread it and manually correct any scanning errors before feeding it into the translator—unless you want twice as much mistranslation as you're going to get anyway.
  10. Backup your photos: What happens if your home burns down and you lose a lifetime of photographic memories in a matter of minutes? A scanner offers a great way to keep a backup copy. Scan your photos, put them on a CD or DVD, and give them to someone in your family to look after. Alternatively, join a photo-sharing website like Flickr or upload your photos onto a "cloud" storage service like Amazon's S3.

Further Reading

On this website

Books

Sponsored links

Please do NOT copy our articles onto blogs and other websites

Text copyright © Chris Woodford 2007, 2011. All rights reserved. Full copyright and legal notice.

Please read our copyright notes before using any material from this website.

Follow us on Facebook

Rate this page

Please rate or give feedback on this page and I will make a donation to WaterAid.

Share this page

Help other people find this page by bookmarking it with:

Social bookmark icons Delicious   Digg   reddit   Facebook   StumbleUpon   Google   Twitter   Email it to a friend   Google+

Cite this page

Link to this page

If you'd like to link to this page, thank you! Here's some code you can cut and paste:

Can't find what you want? Search our site here!