Learning to see : Image Histogram

December 17, 2022

Learning to see

An image is what a camera captures. Technically, an image is a discretized reflectance map of an illuminated scene. Light reflecting off objects is captured by a sensor and converted to a discrete value by an analog-to-digital converter (ADC).

Inside the computer memory the image is stored as a 2-dimensional array of numbers. To see, the computer needs to find the information of interest from the 2-dimensional collection of numbers. For example, for autonomous driving application the information of interest would be the obstacle and free lane information in front of the robot. Another example would be optical character recognition (OCR), where the information of interest is the character present in the image.

Histogram

A histogram is a frequency tally i.e. given a collection of objects the histogram tells how many of a each object type are there in the collection. An example follows:

An image histogram is similarly a tally of image pixels. For grayscale images, each pixel in the image is a number between 0-255 and the histogram essentially represents the distribution of intensities in the image.

The image histogram is a concise yet powerful representation of the image intensity contents. The hills and valleys convey important information about the contents of the image. For example, the hill denoted as 'darker' in the above histogram corresponds to the box shown in the image. For simple images containing a principle foreground against a background this information is enough to extract the foreground object. This is how the green screen background is removed in modern VFX.

Next time we will take a look at an example of object extraction using histogram information. Cheers!

Search This Blog

CotsSense