pctechguide.com

  • Home
  • Guides
  • Tutorials
  • Articles
  • Reviews
  • Glossary
  • Contact

OCR Technology

Pattern

When a page of text is scanned into a PC, it is stored as an electronic file made up of tiny dots, or pixels; it is not seen by the computer as text, but rather, as a picture of text’. Word processors are not capable of editing bitmap images. In order to turn the group of pixels into editable words, the image must go through a complex process known as Optical Character Recognition (OCR).

OCR research began in the late 1950s, and since then, the technology has been continually developed and refined. In the 1970s and early 1980s, OCR software was still very limited – it could only work with certain typefaces and sizes. These days, OCR software is far more intelligent, and can recognise practically all typefaces as well as severely degraded document images.

One of the earliest OCR techniques was something called matrix, or pattern matching. Most text is either in Times, Courier, or Helvetica typefaces in point sizes between 10 and 14. OCR programs which use the pattern matching method have bitmaps stored for every character of each of the different font and type sizes. By comparing a database of stored bitmaps distributed to the bitmaps of the scanned letters the program attempts to recognise the letters. This early system was only really successful using non-proportional fonts like Courier where letters are spaced regularly and are easier to identify. Complex multi-font documents were well beyond its scope and an obvious limitation to this method is that it is only useful for the fonts and sizes stored (learn more info here).

Feature

Feature extraction was the next step in OCR’s development. This attempted to recognise characters by identifying their universal features, the goal being to make OCR typeface-independent. If all characters could be identified using rules defining the way that loops and lines join each other, then individual letters could be identified regardless of their typeface. For example: the letter a is made from a circle, a line on the right side and an arc over the middle. The arc over the middle is optional. So, if a scanned letter had these features it would be correctly identified as the letter a by the OCR program.

In terms of research progress, feature extraction was a step forward from matrix matching, but actual results were badly affected by poor-quality print. Extra marks on the page, or stains in the paper, had a dramatic effect on accuracy. The elimination of such noise became a whole research area in itself, attempting to determine which bits of print were not part of individual letters. Once noise can be identified, the reliable character fragments can then be reconstructed into the most likely letter shapes.

No OCR software ever recognises 100% of the scanned letters. Some OCR programs use the matrix/pattern matching and/or feature extraction methods to recognise as many characters as possible – and complement this by using spell checking on the hitherto unrecognised letters. For example: if the OCR program was unable to recognise the letter e in the word th~ir, by spell checking th~ir the program could determine the missing letter is an e.

Recent OCR technology is far more sophisticated than the early techniques. Instead of just trying to identify individual characters, modern techniques are able to identify whole words. This technology, developed by Caere, is called Predictive Optical Word Recognition (POWR).

Using higher levels of contextual analysis, POWR is able to virtually eliminate the problems caused by noise. It enables the computer to sift through the thousands or millions of different ways that dots in a word can be assembled into characters. Each possible interpretation is then assigned a probability, and the highest one is selected. POWR uses sophisticated mathematical algorithms which allow the computer to hone in on the best interpretation without examining each possible version individually.

When probabilities are assigned to individual words, all kinds of contextual information and evidence is taken into account. The technology makes use of neutral networks and predictive modelling techniques taken from research in Artificial Intelligence (AI). These involve extensive use of experts – algorithms set up to be specialists in various areas of character recognition. One expert might know a great deal about font styles, another about dictionary information, another about the degradation caused by faxes. By combining the use of expert systems with modelling techniques taken from Cognitive Science, a mathematical, probabilistic infrastructure can be created that dynamically investigates all the different possibilities of characters that make up a word in question by querying a variety of experts, and generating an initial set of general hypotheses. The evidence for and against each hypothesis is weighed and a probability assigned to each hypothesis. The investigation continues until a clear and compelling answer emerges. At each stage in the investigation, a new set of experts is selected based on the relevance of their areas of expertise to the particular situation and their histories of success in similar situations.

The net result is that POWR is able to identify words in a way which more closely resembles human visual recognition. In practice, the technique significantly improves the accuracy of word recognition across all document types. All the possible interpretations of a word are assessed by combining all sources of evidence, from low-level pixel-based information to high-level contextual clues. The most probable interpretation is then selected.

Although OCR systems have been around for a long time, their benefits are only just being appreciated. The first offerings were extremely costly, in terms of software and hardware, and they were inaccurate and difficult to use. Consequently many of the early adopters became frustrated with the technology. Over the past few years, however, OCR has been completely transformed. Modern OCR software is highly accurate, easy to use and affordable and for the first time OCR looks set to be adopted in all kinds of work environments on a mass scale.

Unless there is a specific need to preserve colour information from the original document, its best to scan documents for OCR in greyscale. This uses a third of the space of an RGB colour scan. Line art mode makes for even smaller file sizes – but this often loses detail, reducing the accuracy of subsequent OCR processing.

  • Scanner Operation
  • PMT Scanners
  • CCD Scanners
  • CIS Scanners
  • Scan Resolution
  • Scanner Interpolation
  • Color Scanners
  • Bit-Depth Printers
  • Dynamic Range Scanners
  • Scan Modes
  • Scanner File Formats
  • TWAIN Drivers
  • Color Calibration
  • OCR Technology
  • Photo Retouching

Filed Under: Scanners

Latest Articles

Protect Your Business with Online Reputation Management

As a company grows the potential for negative comments online become more common. This type of activity needs to be monitored for potential issues.  Online Reputation Management (ORM) is important, and here’s how you can get started with it. ORM is still a relatively new concept, and one that small … [Read More...]

EIDE Interfaces

In 1993 Western Digital brought EIDE (Enhanced IDE) onto the market. EIDE is a standard designed to overcome the constraints of ATA while at the same time maintaining backward compatibility. EIDE supports faster data transfer rates - with … [Read More...]

Transferring Software To A New PC

In this particular article, we will discuss the various methods used to transfer software to a new computer.  We will also discuss various applications and tools that are commonly used to streamline and simplify these processes for the average user.  Before transferring, always make … [Read More...]

Everything You Need to Know About Sourcing Circuit Boards From U.S. Suppliers

In This Article This article includes: Why Source PCBs From the United States?How to Get a Quote From a U.S.-Based PCB ManufacturerThe Top U.S. … [Read More...]

Top Taplio Alternatives in 2025 : Why MagicPost Leads for LinkedIn Posting ?

LinkedIn has become a strong platform for professionals, creators, and businesses to establish authority, grow networks, and elicit engagement. Simple … [Read More...]

Shocking Cybercrime Statistics for 2025

People all over the world are becoming more concerned about cybercrime than ever. We have recently collected some statistics on this topic and … [Read More...]

Gaming Laptop Security Guide: Protecting Your High-End Hardware Investment in 2025

Since Jacob took over PC Tech Guide, we’ve looked at how tech intersects with personal well-being and digital safety. Gaming laptops are now … [Read More...]

20 Cool Creative Commons Photographs About the Future of AI

AI technology is starting to have a huge impact on our lives. The market value for AI is estimated to have been worth $279.22 billion in 2024 and it … [Read More...]

13 Impressive Stats on the Future of AI

AI technology is starting to become much more important in our everyday lives. Many businesses are using it as well. While he has created a lot of … [Read More...]

Guides

  • Computer Communications
  • Mobile Computing
  • PC Components
  • PC Data Storage
  • PC Input-Output
  • PC Multimedia
  • Processors (CPUs)

Recent Posts

6 Impressive Ways Cloud Phone Systems Lower Costs

Technological developments have played a very big role in improving the cost-effectiveness of businesses. One of the biggest cost savers is a new … [Read More...]

Floppy Installation Mounting

As will have been seen earlier, the system case we're using has a 3.5in bay assembly capable of holding two 3.5in devices. This may be removed by … [Read More...]

USB Flash Drives

These are not "drives" at all - the misnomer was gained as they were intended to replace old portable drive technology. Offering massive storage and … [Read More...]

[footer_backtotop]

Copyright © 2026 About | Privacy | Contact Information | Wrtie For Us | Disclaimer | Copyright License | Authors