Scan text from images

Date: November, 22 2020
PLAGIARISM SCAN REPORT
3% 97% 702 4528

Plagiarised Unique Words Characters
Exclude Url : None
Content Checked For Plagiarism

We humans have a very robust visual system, which helps us to identify people and objects, play sports, perform operations, drive vehicles,
read, and so on. Although it might seem that we do not put any special effort into most of these tasks, the human visual system is fairly
complex to replicate and implement. Computer Vision, in the simplest terms, is the automation of such a visual system, so that computers
or machines, in general, can obtain a high level understanding of the environment from digital images and videos. In the manufacturing
sector identifying defective products and ensuring quality and accuracy is of utmost importance. Object detection, will detect instances of
objects of a given class, such as humans, buildings, or cars in digital photos and films. Computer Vision is vital in implementing object
detection from digital images. OBJECTIVE To develop a computer vision based text scanner that will scan through any image (Example: a
Sudoku Puzzle from the Newspaper) to obtain the respective text from it. METHODOLOGY Any computer vision application starts with
Image acquisition (Image acquisition is the digital representation of the visual characteristics of the physical world).Image sensors are used
to detect and capture the information required to make an image. The images acquired are then processed in the next stage. In this step,
the signals in the acquired images are filtered to remove the noise or any irrelevant frequencies. If needed the images are padded and
transformed to a different space, so as to make them ready for the actual analysis. The processed images are then analysed to extract
useful information, this involves pattern identification, colour recognition, object recognition, feature extraction, motion tracking, image
segmentation, etc. Finally, the high dimensional data obtained from all the above steps is used to produce meaningful numerical
information, which leads to making decisions. SCRIPTS main.py This script combines all the scripts given below. christopher.py This script
consists of a Convolutional Neural Network instructed on a custom dataset. basic.py This script is used to take as input the original image,
apply pre-processing, get the corner points of the board, warp the image and separate out the individual smaller grids (tiles) containing the
individual digits/blanks. sud.py This script is used to take the individual tiles, It does a bit of pre-processing and predicts the digits in each
tile. As the dimensions of the grid are 9x9, the no. of tiles are 81. RESULTS 1) Image Processing The image is converted to grayscale and
further Adaptive Thresholding and Dilation are applied to the image to reduce noise and enhance contours. After this happens, the
coordinates of the Sudoku grid in the image are found. 2) Warping Using the coordinates found, we warp the image and form individual
grids on the image. These individual grids will assist in extracting out the smaller tiles which incorporate a single digit or a blank. 3) Digit
Recognition The individual grids are passed into a convolutional neural network (Christopher) which is pre-trained on a custom dataset.
These grids are recognized and returned in the form of a list. CONCLUSION We can scan a sudoku puzzle of an image using our camera
scanner and then convert it into text, using warping & Artificial intelligence. But in future we plan to work on how to scan any image to
successfully obtain respective texts on it. Further improvements and adaptations of this technology can help us, it improves the Searching
ability of our computers, as our data records continue to get bigger and more complex, computers with OCR will make record searching
much easier. Computers with OCR can scan a document and save it in a database, making it easier to retrieve it quickly in the future.
Inclusion of AI, such that it can read text from images or banners etc. can use in-built software to rapidly translate it into your desired
language, enhancing communication. Also, AI-enabled with OCR would be able to examine paper bills and records, analyse complicated
charts, offer suggestions and make commercial enterprise decisions. AI that is capable of recognizing facial expressions can understand
how people around them are feeling. This offers benefits in the hospitality and healthcare sectors. Assembly processing robots with
computer vision enables identifying faulty merchandise or rotten produce and separating them from quality products.
3% Plagiarised
github.com/manpreet1130/RealTime-Sudoku-Solver

Scan text from images

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Scan text from images

Uploaded by

Copyright:

Available Formats

Date: November, 22 2020

PLAGIARISM SCAN REPORT

3% 97% 702 4528

Exclude Url : None

Content Checked For Plagiarism

You might also like