Image Processing Overview
Explain in detail and extract all the formulas(if any) with derivation.
The PDF you uploaded, titled "Digital Image Processing" by Prof. M.K. Bhuyan from IIT Guwahati,
introduces the fundamental concepts of digital image processing. It covers the imaging process, system
components, types of digital images, and basic image operations. Here's a detailed breakdown of the
key concepts and all the formulas with explanations and derivations where applicable.
📌 1. Mathematical Representation of an Image
A digital image is represented as a 2D function:
f (x, y)
Where f maps spatial coordinates (x, y) to an intensity value.
In practice, f is defined over a rectangular domain, and values are quantized:
f : [a, b] × [c, d] → [0, 1]
This can be scaled to 8-bit (0–255), 10-bit, etc., based on application.
📌 2. Color Image Representation
Color images are represented using three components: Red, Green, and Blue:
r(x, y)
g(x, y)
b(x, y)
This is a vector-valued function of three intensity channels. This forms the basis of 24-bit images (8 bits
per channel).
📌 3. Linear Point Operation
General form:
g(n) = h[f (n)]
Where h is the transformation function. In linear transformation, this becomes:
g(n) = P ⋅ f (n) + K
This can be used for contrast stretching, brightness adjustment, etc.
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 1/7
📌 4. Image Negative
To create a negative image:
g(n) = (L − 1) − f (n)
Where:
L is the number of gray levels (typically 256)
f (n) is the original pixel intensity
Derivation: Inversion is done by subtracting pixel intensity from max level:
g(n) = max intensity − f (n) = (L − 1) − f (n)
📌 5. Arithmetic Operations Between Images
(a) Image Summation:
n
1
f = ∑ fi
n
i=1
Used in averaging multiple noisy images to reduce noise.
(b) Image Differencing:
fdiff = fr − fs
This helps in change detection and motion detection.
📌 6. Geometric Operations
(a) Translation:
g(n) = f (n + a)
Shifts image by vector a.
(b) Zooming / Scaling:
a 1 (n 1 , n 2 ) = n 1 / c
a 2 (n 1 , n 2 ) = n 2 / d
Where c, d are scaling constants.
(c) Rotation:
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 2/7
a1 (n1 , n2 ) = n1 cos θ − n2 sin θ
a2 (n1 , n2 ) = n1 sin θ + n2 cos θ
This rotates the image counterclockwise by angle θ .
(d) Translation with Offset:
a 1 (n 1 , n 2 ) = n 1 − b 1
a 2 (n 1 , n 2 ) = n 2 − b 2
Where b1 , b2 are the offset parameters.
📌 7. Histogram Equalization
While no explicit formula is listed, the process involves:
k
nj
sk = (L − 1) ∑
MN
j=0
Where:
sk is the output intensity
nj is number of pixels with intensity j
MN is total number of pixels
L is total intensity levels
This redistributes the histogram for contrast enhancement.
📌 8. Image Filtering & Edge Detection
Edge detection uses digital differentiation (e.g., Sobel, Prewitt filters), but exact formulas are not
given in this file.
Filtering often involves convolution:
a b
g(x, y) = ∑ ∑ f (x − s, y − t) ⋅ h(s, t)
s=−a t=−b
Where h(s, t) is the filter kernel.
Summary of Extracted Formulas
Operation Formula
Image as function f (x, y) : [a, b] × [c, d] → [0, 1]
Color image f (x, y) = [r(x, y), g(x, y), b(x, y)]T
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 3/7
Operation Formula
Linear point operation g(n) = P ⋅ f (n) + K
Negative transformation g(n) = (L − 1) − f (n)
1 n
Image summation f= n
∑i=1 fi
Image difference fr − fs
Translation g(n) = f (n + a)
Scaling a1 = n1 /c,
a2 = n2 /d
a1 = n1 cos θ − n2 sin θ
Rotation
a2 = n1 sin θ + n2 cos θ
Offset Translation a1 = n1 − b 1 ,
a2 = n2 − b 2
k nj
Histogram Equalization sk = (L − 1) ∑j=0
MN
Filtering (convolution) g(x, y) = ∑s,t f (x − s, y − t)h(s, t)
Would you like me to create LaTeX notes or a summarized document from this with illustrations and
examples?
Can you provide some examples of application for all.
Absolutely! Here's a detailed list of applications for each formula or concept from the digital image
processing lecture, along with real-world use cases so you can see how they’re applied in practice.
🔢 1. Image as a Function
Formula:
f (x, y) : [a, b] × [c, d] → [0, 1]
Application: Basic digital image representation
Example: Storing satellite images or medical scans in a pixel matrix so that image processing algorithms
can access and analyze pixel intensities.
🌈 2. Color Image Representation
Formula:
r(x, y)
f (x, y) = g(x, y)
b(x, y)
Application: Color image processing
Example: Automatic color correction in smartphone cameras, skin tone detection in facial analysis.
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 4/7
➕ 3. Linear Point Operation
Formula:
g(n) = P ⋅ f (n) + K
Application: Brightness and contrast adjustment
Example: Increasing brightness of underexposed images in photo editing apps like Photoshop or
Snapseed.
🌓 4. Negative Image Transformation
Formula:
g(n) = (L − 1) − f (n)
Application: Medical imaging, photographic negatives
Example: Enhancing radiographs (X-rays) for better visibility of bones by using negative transformation
to emphasize lighter structures.
📊 5. Image Averaging (Denoising)
Formula:
n
1
f = ∑ fi
n
i=1
Application: Noise reduction
Example: Stacking multiple astrophotography images to produce a clear image of the night sky by
reducing sensor noise.
🔁 6. Image Differencing
Formula:
fdiff = fr − fs
Application: Motion detection, change detection
Example: Security cameras use frame differencing to detect intruders moving between frames.
🔁 7. Translation
Formula:
g(n) = f (n + a)
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 5/7
Application: Image registration / alignment
Example: Aligning MRI scan slices taken at slightly different positions.
🔍 8. Scaling (Zoom)
Formula:
a1 = n1 /c,
a2 = n2 /d
Application: Zoom in/out on digital images
Example: Zooming into map tiles in Google Maps using pixel scaling and interpolation.
🔄 9. Rotation
Formula:
a1 = n1 cos θ − n2 sin θ
a2 = n1 sin θ + n2 cos θ
Application: Image orientation correction
Example: Auto-rotating smartphone photos to landscape or portrait based on device orientation.
🧭 10. Translation with Offset
Formula:
a1 = n1 − b1 ,
a2 = n2 − b2
Application: Centering images or cropping
Example: Recentering object of interest in automated cropping tools used in photography software.
📈 11. Histogram Equalization
Formula:
k
nj
sk = (L − 1) ∑
j=0
MN
Application: Contrast enhancement
Example: Improving visibility in low-light surveillance footage or satellite images with uneven
illumination.
🎛️ 12. Image Filtering (Convolution)
Formula:
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 6/7
a b
g(x, y) = ∑ ∑ f (x − s, y − t) ⋅ h(s, t)
s=−a t=−b
Application: Edge detection, blurring, sharpening
Examples:
Edge detection in self-driving car cameras using Sobel/Prewitt filters.
Gaussian blur in Instagram filters to soften backgrounds.
Sharpening filters in scanning apps like Adobe Scan to make text crisper.
🚀 Bonus Use Cases (Higher-Level Concepts)
Segmentation: Isolating tumors in CT scans.
Object Recognition: Detecting and tagging faces in Facebook photos.
Image Understanding: An AI understanding whether an image is of a car or a cat using scene
context.
Would you like these compiled into a neat tabular PDF or Markdown document with diagrams and real-
world images?
Printed using ChatGPT to PDF, powered by PDFCrowd HTML to PDF API. 7/7