Error Detection

Introduction

Scientists and engineers have worked very hard to make satellites that travel very far from Earth. Voyager 1 is now 13 billion miles away from Earth. When a satellite captures information about space, it must then transmit this data back to a satellite dish back on Earth. Perhaps this data is a photo of Jupiter's great red spot? Or perhaps it's data about the temperature on Neptune? Or movement data of asteroids? Regardless of what the message is we received, how do we know that the data we've received is accurate?

Unforuntately, our atmosphere gets in the way of our messages. Just like looking through a glass of water distorts the image of what's behind it, the charged particles in our atmosphere might distort some of the signals coming from the satellite.

Hold down to transmit bits back to Earth.

When we receive a message, transmitted as many packets of binary numbers, there is a chance that some of the bits may be incorrect. We need a way to figure out (a) if any bits are incorrect and (b) which ones. Just like if you write a letter to someone and send it in the mail. If some of the words are blurred, you might be able to infer the original message from context -- but with 0s and 1s, we don't have any way to understand the context.

Let's think about some ways we could possibly make sure that we know the correct message.

Perhaps the simplest way to do it would be to send multiple copies of the same message. If we send two copies, then we know that there's an error, but we won't know what the error is because . This would be an Error Detecting Code.

If we send three copies, then we will know that there's an error, and be able to see which message is correct. This would be an Error Correcting Code. In this example to the left, we know that the envelope will be the correct message.

This message-encoding scheme would consume a lot of space! Imagine if you bought a state-of-the-art smartphone with 96 GB of memory which used this type of error correction. of the space would be redundant data. The phone would only be able to hold less than GB of memory!

There must be a better way.

Parity Bit

One very simple and cost-effective way to detect an error in a sequences of numbers is with a parity bit. The parity bit is added to the end of the message to tell us something about the rest of the message.

For example we can add one more bit to the end of a sequence of binary numbers depending on the sum of the first bits.

If the sum is even, the parity bit is 0.

If the sum is odd, the parity bit is 1.

With this scheme, after the parity bit has been appended, the sum of the entire encoded number will be .

Adding one parity bit will not always allow the receiver to detect an error. If an number of bits have flipped, the parity will still be even, and no error will be detected.

This simple method of adding a parity bit gives us a hint into how we can build more complex error detection and correction schemes. Before we do that, let's look at an example that we experience quite often.

Bar Codes

We encounter error-detecting codes every time we go to the supermarket, in the form of bar codes. Bar codes are also used for identifying all sorts of things like driver's licenses or babies in hospitals. And in the last decade, 2-dimensional barcodes (QR codes) have been adapted for ticketing at events like concerts or airplane flights.

Barcodes are used for many things!

Bar codes were invented as a way to make it easier for stores to track which items a customer is buying. Numbers are easy for us humans to read, but it's much harder for computers. For example, 1 and 7 or 6 and 8 might look almost the same to a low-resolution camera. A new system was needed, so that cashiers or nurses didn't have to manually type all these numbers into a computer.

Transition: similar to morse code. Joe Woodland, one of the inventors of Bar Code was sitting on the beach, trying to think of a way to encode information for a light scanner to understand.

I remember I was thinking about dots and dashes when I poked my four fingers into the sand and, for whatever reason--I didn't know--I pulled my hand toward me and I had four lines. I said 'Golly! Now I have four lines and they could be wide lines and narrow lines, instead of dots and dashes.'

Joe Woodland, inventor of the Bar Code

Woodland used his knowledge of Morse Code as a foundation to invent a new type of code. By learning about different types of codes, we can build our own knowledge foundation and perhaps invent our own codes!

The original barcode was invented as a circle, but in the 1970s an IBM engineer figured out that a rectangle would be more compact than a bulls-eye.

An early use of barcodes was for tracking train cars.

So how does it work?

Each bar code is a 12-digit number. See here that we can read the numbers written on the bottom. But how does a computer read it?

There are 95 columns, each of which can be black or white. The computer uses a laser to identify each stripe's color, based on how much light is reflected. These 95 columns are grouped into 15 groups of columns. (continue below)

Bar code values

The guards are on the outside and in the center. The first six digits are on the left. The last six digits are on the right.

The Guards

LEFT	CENTER	RIGHT
101	01010	101

The Digits

why is left and right different?

DIGIT	LEFT SIDE	RIGHT SIDE
0
1
2
3
4
5
6
7
8
9

There are a few interesting patterns we can recognize here. First, notice that the codes on the left side are different from the codes on the right side. This allows the bar code to be read upside down or backwards.

Barcodes in different countries

These barcodes with 12 digits are how they work in the US. In Europe, there are 13 digits. And other countries have different protocols for how barcodes. work.

Digit Value

The first digit represents what type of object it is (0 - Standard; 2 - Weighted item like fruit; 3 - Pharmacy; 5 - Coupon)
Digits 2-6 (highlight) represent the Manufacturer's Code
The 7-11 (highlight) represent the Product Code
The final, 12th digit (highlight) is the Error-Detection digit, also called the Modulo Check Character

Different between Left and Right

Every binary representation of digits on the left begins with a 0 and ends with a 1, and every digit on the right begins with a 1 and ends with a 0. This results in a pattern: there will always be a 10 (on the left side) or a 01 (on the right side) on the edge between digits.

Error Checking.

Just like our case with the satellite signals, there is a chance that the laser reading the barcode can make a mistake. Perhaps there is some dirt, or a scratch, or the laser reads it incorrectly. For that reason, barcodes have a special way of determining whether there's been a mistake.

The 12th digit is dependent on the first 11 digits, so that if any of the digits is wrong, we will know (most of the time). Can you guess how the 12th digit can indicate information about the first 11?

The odd digit places are summed and multiplied by three. Then we calculate the sum of the even places, and sum these two values together. The Modulo Check Character is 10 minus the ones digit of this number.

Calculating the Error-Check Digit

X = 3 * (∑ oddDigits) + ∑ evenDigits

N=10−onesX

Example

X = 3 * (0 + 1 + 0 + 0 + + ) + (5 + 0 + 0 + + )

X = 3 * () + ()

X = + 11

X =

N=10−ones23

N = 10 -

N =

When the computer scans the digits of a barcode, it performs this calculation. If the modulo check doesn't match up, we immediately know that the barcode hasn't been read correctly. In a supermarket, the cash register won't beep, and you can try again -- or, as a last resort, enter the numbers manually.

Pick the barcodes that have a valid error-check character.

ACTIVITY: Look around you for something with a barcode (pretty much anything you can buy at a store). Hide the numbers with paper or tape, and try to decode the numbers. After you've written down your answer, see if you're correct! And then you can try the error detection formula and confirm that it is correct. (PHOTOS also)

Barcode Conclusion

These are only the UPC bar codes. There are many other types of bar codes listed here: http://www.makebarcode.com/specs/barcodechart.html

Hamming Codes

Let's go back to our original problem of how we might detect errors in a message received from a satellite. We don't know the message the satellite intended to send, and unlike with barcodes there is no way to find it, such as a cashier who can look at the number and type it into the computer.

A mathematician named Richard Hamming encountered a very similar problem not with data from satellites, but with mechanical computers.

Computers used to be programmed by creating holes in special punch cards, which were then fed into the computer. These were time-consuming to program, and the calculations could take hours or even days to complete. In 1947, Hamming wrote a program to perform a long and complex series of calculations while he went home over the weekend. When he returned, he discovered that an error had occurred somewhere early in the calculation and his entire result was useless. He felt a need to invent a way to correct when an error had happened.

A blue IBM punched card.

If the computer can tell when an error has occurred, surely there is a way of telling where the error is so that the computer can correct the error itself. - Richard Hamming

How can we detect and correct an error in a messaage that is just a sequence of binary numbers? A Hamming Code can be used to do just this. The solution involves inserting new bits into the message that will tell us information about the values of those original bits. If we know the location of where an error occurred, then we know the original value of that bit because .

Encoding a Hamming Code

Click through the slides to see how to encode a string of bits using a Hamming Code.

Let's say we want to encode this string of 8 bits. We call these 8 bits the data bits.

First we must shift the data bits to the right to make room for the parity bits. The parity bits will encode information about the data bits.

The parity bits will go into any position that is a power of 2. Here we place parity bits at 1, 2, 4, and 8.

We must figure out the values that go into the parity bits. We do this by dividing the sequence into one parity group for each parity bit. We then assign the parity bit whatever value will make the parity of the group even.

Let's start with the first parity group. Start at the parity bit in position 1, then choose every other 1 bit.

This group of bits has an parity, so we give the parity bit value .

Now do the next parity group. Start at the parity bit in position 2, then choose every other 2 bits.

This group of bits has an parity, so we give the parity bit value .

Now do the next parity group. Start at the parity bit in position 4, then choose every other 4 bits.

This group of bits has an parity, so we give the parity bit value .

Now let's do the last parity group. Start at the parity bit in position 8, then choose every other 8 bits.

This group of bits has an parity, so we give the parity bit value .

Here is our final encoded string of bits to send!

Did you notice anything about how the bits for each parity group are selected? The answer has to do with the of each digit position.

Here is how to select the digits to include in a parity group.

Convert the place numbers to binary.
Identify the parity groups. Each will be a power of 2.
For each parity group, select any digit whose binary place number has a **1** in that position.

group	1	2	3	4	5	6	7	8	9	10	11	12
1	1	0	1	0	1	0	1	0	1	0	1	0
2	0	1	1	0	0	1	1	0	0	1	1	0
4	0	0	0	1	1	1	1	0	0	0	0	1
8	0	0	0	0	0	0	0	1	1	1	1	1

The 9th digit in the encoded bit sequence will be part of parity groups

The 6th digit in the encoded bit sequence will be part of parity groups

Some table formatting would be nice.

Decoding a Hamming Code

Now that we have an encoded message, how can we extract the data from it? We must reverse the process.

Encoding

Make room for parity bits
Identify parity groups.
Define parity bits.

Decoding

Check parity bits.
Make corrections.
Remove parity bits.

Click through the slides to see how to decode a string of bits using Hamming Codes.

What if we receive this packet of information? Assume we know that is has been encoded using Hamming encoding. This means that each parity group should have an parity. As we check each group, we can mark which have an incorrect bit.

First let's check parity group 1. The parity is . There is wrong here!

Now let's check parity group 2. The parity is . There is wrong here!

Now let's check parity group 4. The parity is . There is wrong here!

Now let's check parity group 8. The parity is . There is wrong here!

We identified there is something wrong in the 1st and 4th parity groups. We can add these values to determine that there is an error with bit . We flip this bit to a .

Now we can remove the parity bits to fully decode our message.

This is not a perfect system. This type of code will not correctly detect an error when there are more than errors.

Other Error Detection and Correction

Credit cards use a checksum digit calculated with Luhn's algorithm to check for errors.

CDs and DVDs use the Reed-Solomon error-correcting code to allow playback even when the disc has scratches.

QR Codes such as Snapchat profile codes also use a Reed-Solomon error-correction code.

PHOTO? Software applications often include a MD5 or SHA-1 checksum that indicates they came from a valid source.

Sign in to Innings2

Reset Progress

Glossary

Error Detection

Introduction

Parity Bit

Bar Codes

So how does it work?

Bar code values

Barcodes in different countries

Digit Value

Different between Left and Right

Error Checking.

Barcode Conclusion

Hamming Codes

Encoding a Hamming Code

Decoding a Hamming Code

Other Error Detection and Correction