We humans have a very robust visual system, which helps us to identify people and objects, play sports, perform operations, drive vehicles, read, and so on. Although it might seem that we do not put any special effort into most of these tasks, the human visual system is fairly complex to replicate and implement. Computer Vision, in the simplest terms, is the automation of such a visual system, so that computers or machines, in general, can obtain a high level understanding of the environment from digital images and videos. In the manufacturing sector identifying defective products and ensuring quality and accuracy is of utmost importance. Object detection, will detect instances of objects of a given class, such as humans, buildings, or cars in digital photos and films. Computer Vision is vital in implementing object detection from digital images. OBJECTIVE To develop a computer vision based text scanner that will scan through any image (Example: a Sudoku Puzzle from the Newspaper) to obtain the respective text from it. METHODOLOGY Any computer vision application starts with Image acquisition (Image acquisition is the digital representation of the visual characteristics of the physical world).Image sensors are used to detect and capture the information required to make an image. The images acquired are then processed in the next stage. In this step, the signals in the acquired images are filtered to remove the noise or any irrelevant frequencies. If needed the images are padded and transformed to a different space, so as to make them ready for the actual analysis. The processed images are then analysed to extract useful information, this involves pattern identification, colour recognition, object recognition, feature extraction, motion tracking, image segmentation, etc. Finally, the high dimensional data obtained from all the above steps is used to produce meaningful numerical information, which leads to making decisions. SCRIPTS main.py This script combines all the scripts given below. christopher.py This script consists of a Convolutional Neural Network instructed on a custom dataset. basic.py This script is used to take as input the original image, apply pre-processing, get the corner points of the board, warp the image and separate out the individual smaller grids (tiles) containing the individual digits/blanks. sud.py This script is used to take the individual tiles, It does a bit of pre-processing and predicts the digits in each tile. As the dimensions of the grid are 9x9, the no. of tiles are 81. RESULTS 1) Image Processing The image is converted to grayscale and further Adaptive Thresholding and Dilation are applied to the image to reduce noise and enhance contours. After this happens, the coordinates of the Sudoku grid in the image are found. 2) Warping Using the coordinates found, we warp the image and form individual grids on the image. These individual grids will assist in extracting out the smaller tiles which incorporate a single digit or a blank. 3) Digit Recognition The individual grids are passed into a convolutional neural network (Christopher) which is pre-trained on a custom dataset. These grids are recognized and returned in the form of a list. CONCLUSION We can scan a sudoku puzzle of an image using our camera scanner and then convert it into text, using warping & Artificial intelligence. But in future we plan to work on how to scan any image to successfully obtain respective texts on it. Further improvements and adaptations of this technology can help us, it improves the Searching ability of our computers, as our data records continue to get bigger and more complex, computers with OCR will make record searching much easier. Computers with OCR can scan a document and save it in a database, making it easier to retrieve it quickly in the future. Inclusion of AI, such that it can read text from images or banners etc. can use in-built software to rapidly translate it into your desired language, enhancing communication. Also, AI-enabled with OCR would be able to examine paper bills and records, analyse complicated charts, offer suggestions and make commercial enterprise decisions. AI that is capable of recognizing facial expressions can understand how people around them are feeling. This offers benefits in the hospitality and healthcare sectors. Assembly processing robots with computer vision enables identifying faulty merchandise or rotten produce and separating them from quality products. 3% Plagiarised