How does Optical Character Recognition (OCR) works?

You know, it is pretty easy to take words on your computer screen and put them on a physical sheet of paper. Just click print and boom, you will have your data on paper. But going in the opposite direction, scanning dead tree information into your PC, is actually quite a bit trickier.

I mean sure flatbed scanners are not all that difficult to operate, but many of them are basically just taking a picture of the document and saving it on to your PC, meaning not only it will probably not look very crisp due to file compression and little bit of dust in your scanner.

But you can not edit a clean copy of your document in your favourite word processor, because the scanner won’t recognize each individual character.



Fortunately, there are a number of devices out there, that enable Optical character recognition or OCR. Where each character on a page is scanned individually. So, your papers will be uploaded as actual text documents instead of messy JPEGs.

But, how exactly does that work and how one kind of optical scanner is better than another?

Well, because the whole concept of translating text into electronic signal is pretty broad, there have been lots of different implementations of OCR over the years. In fact, one of the earliest electric OCR devices “the OptoPhone” was invented all the way back in 1914.

This bizarre looking contraption relied on the special behaviour of selenium, which conducts electricity differently in light and darkness.

As it scanned the words on a page, the OptoPhone distinguished between the dark ink of text and lighter blank spaces, generating tones that correspond to different letters making it possible for blind people to read with some practice.

Later in 1931, a machine was developed that could convert printed text to Telegraph code. One of the first technologies to translate printed characters to electrical impulses rather than sounds.

But it wasn’t until the 1960s and 70s that OCR began to take a more familiar modern form with postal services using OCR to read addresses and software that could recognize many different fonts.

So, back to present day, when you scan a document, how exactly does the software know what it is looking at?

Well, the first step is to cut out artefacts. So, your OCR program can concentrate on the text and nothing else. So, it attempts to remove dust and other various graphics, align the text properly and convert any colours or shades of gray in the image to black and white only, Making the words themselves easy to recognize.

The next step is to figure out which characters are on the page. Simpler forms of OCR compare each scanned letter, pixel by pixel to a known database of fonts and decide on the closest match.

Smarter OCR however takes this step farther by breaking down each character down to constituent elements, like curves and corners and looking for matching physical features and actual letters. You can think of the differences between those two approaches similarly to the difference between raster and vector images.

OCR software can also make use of a dictionary, so it won’t accidentally spit out nonsense words due to inaccurate scanning. Giving OCR software situational information, can further cut down on errors such as telling it to only try to match numbers, if it is reading ZIP codes on an envelope.

even with these tricks however OCR obviously is not perfect.Which you have probably seen for yourself, if you have ever used it.

But with greater processing power and machine learning techniques that allow software to recognize more subtle patterns over time, OCR has become versatile enough to recognize harder to read typefaces, inconsistently printed material and even handwriting.

And free OCR cloud processing services like Google Drive, which has a lot more machine learning capability than your Home PC for which I hope our fairly obvious reasons, have made OCR more accessible than ever. Even Google translate has a feature to translate anything by pointing camera on writing.

You can just point your camera on the writing and Google translate will translate it for you, which is a obvious combination of Google Translate and OCR technology. So, this is it with OCR technology for now. What do you think about this OCR technology, let me know in comments.

Latest

Data Breaching and Hacking Preventions

Recently,  number of cases of getting Credit card or bank account hacked has grown too much. So, probably this is a great time to talk…..

What is the largest possible Size of a Single File?

Kids from 2000s, you may not remember a time before a 120 GB hard drives were the bare minimum, but other of you from 90s…..

File Systems: What is FAT and NTFS? Which one is better?

You probably know someone, who is always miss placing stuff, I do personally. Whether they can not find their keys in the morning, they leave…..

App Review: MoveIt – transfer Media Files Automatically

From the title of the blog, you will think that, what is new in that? But well, there are some features in this you will…..

Status Forever: Android App to get Status for every mood

Are you looking for new Status to put on your Social media account? Whether you are feeling sad or true love, you will get all…..

Vivo Y69: Review and Specifications

The Chinese smartphone manufacturer Vivo always try to give you good and better Camera. Which is boring in a way. You just don’t get any…..

Apple iPhone X: Top 5 Features, Dream Project of Apple

If you have followed Apple at all, you have probably heard the old cliche that it does things better, instead of doing them first. Nowhere…..

How AHCI or IDE Modes can affect the Drive Performance?

When you finally decide to address a huge bottleneck in your computer by upgrading to an SSD for your Boot drive. But when you finally…..

Technical Terms that you don’t have an Idea of being Misused

Using the word Bacteria to refer to a single germ, which should be bacterium instead, mixing up Discrete and discreet and saying my Facebook got…..

How Important System32 Folder is for Windows OS?

Warning! Deleting this file can cause damage to Windows or another Program. Odds are pretty high that you have seen a message like that, if…..

Samsung Galaxy Note 8 First Impression: Do Bigger Things

Last year’s Galaxy Note 7 was perhaps “the most famous Samsung smartphone in history”, for all the wrong reasons. The company kicked off its recovery…..

Top 6 Upcoming Smartphones of 2017

2017 has been an incredibly exciting year, when it comes to smartphone launches. But that doesn’t mean, it will not get even better. So, Let’s…..

Mobile Networks: Evolution from 1G to 4G Explained

Most of us uses smartphone with 3G or 4G mobile networks, well mostly 4G after Jio Storm. But do you ever think what does it…..

Lenovo K8 Note: New Killer Note is ready to Kill the Ordinary

Chinese Mobile manufacturer Lenovo made a great success with their Lenovo K series. Especially with Lenovo K3 Note, Lenovo K6 Power and Lenovo K6 Note……

Moto’s Flagship Moto Z2 Force: Review and Specifications

Motorola’s best smartphone for 2017 is a thin, classy-looking handheld with a long-lasting battery. It’s compatible with Moto Mod accessories, and sold at a fair…..

Vivo X9s Plus Overview and Specifications

Chinese Smartphone Manufacturer Vivo has making quite the market in India from two to three years. Vivo has risen to the top in trusted mobile…..

Xiaomi Redmi Note 5: Full Specifications and Leaked Details.

Chinese Smartphone Manufacturing Company Mi created many record with their Redmi Series in India. As you are aware of Xiaomi Redmi 4, which is a…..

How does Optical Character Recognition (OCR) works?

You know, it is pretty easy to take words on your computer screen and put them on a physical sheet of paper. Just click print…..

Micromax Evok Note Review: Really Invincible?

Micromax is literally failing to make their mark on smartphone market again. There was a time when Micromax did shook every smartphone companies with their…..

Wireless 802.11 ac Wave 2 Explained

It seems like the onward march of Wireless technology can not be stopped. I mean we keep flinging more and more stuff through the airwaves,…..

  • Get more stuff like this
    in your inbox

    Subscribe to our mailing list and get interesting stuff and updates to your email inbox.