You are on page 1of 6

SOFTWARE REQUIREMENT SPECIFICATION

DOCUMENT
Project Title Here
Submitted in partial fulfilment of the requirements for the degree
of
Bachelor of Technology in Information Technology

Project Team Members

Vaidhi Kapoor [1903868] Ayush Julka [1903823]

Gurkirat Singh [1903829] Jatin Wadhwa [1903838]

Supervisor
Mrs. Reeta Bhardwaj
Assistant Professor, IT Department

Date of submission: ___________

DEPARTMENT OF INFORMATION
TECHNOLOGY
DAV INSTITUTE OF ENGINEERING &
TECHNOLOGY
Kabir Nagar, Jalandhar - 144001

July 2019
Table of Contents

Section Page No.

1. Introduction
1.1 Purpose
1.2 Scope
1.3 Definitions, Acronyms, and Abbreviations

2. The Overall Description


2.1 Product Functions
2.2 User Characteristics
2.3 Constraints
2.4 Assumptions and Dependencies

3. Specific Requirements
3.1 External interfaces
3.2 Functional Requirements
3.3 Performance Requirements
3.4 Logical Database Requirements
3.5 Design Constraints
3.5.1 Software
3.5.2 Hardware
3.5.3 Acceptance
1. Introduction

1.1. Purpose

The purpose of this document is to present a detailed description of the Handwritten

Text Recognition System. It will explain the purpose and features of the system, the

interfaces of the system, what the system will do, the constraints under which it must

operate and how the system will react to external stimuli. This document is intended

for both the stakeholders and the developers of the system and will be proposed to the

Regional Historical Society for its approval.

1.2. Scope of Project

Handwritten character recognition is process of converting the hand written work over

page to a attractive digital format. There is a growing demand for software applications

that recognise characters in computer when information is scanned from paper

documents, as we all know that there are many historical, mythical, and newspaper

books and newspapers still in print. They deteriorate day by day as a result of climatic

changes or incorrect handling. As a result, there is a high need presently for "saving the

information available in these paper documents in a computer storage disc and then

utilising this information through a searching procedure." Scanning the documents first

is a straightforward technique to get the information from them into a computer

system. When we scan documents using the scanner, the documents are saved in the

computer system as images. These photos contain text that the user cannot change.

However, reading individual contents and searching the contents of these documents

line-by-line and word-by-word is extremely tough for a computer system to achieve.

The challenge arises because the font qualities of characters in paper documents differ

from those of characters in a computer system. As a result, when reading the


characters, the computer is unable to recognise them. Document processing is the

process of saving the contents of paper documents in a computer storage location and

then reading and searching the material. We occasionally need to process information

that is connected to languages other than English in this document processing.

Document Image Analysis is another name for this procedure (DIA). Many techniques to

dealing with DIA have been offered by researchers in recent years; each strategy has its

own advantages and limitations, which will be described in depth in the next portion of

this study.

1.3 Definitions, Acronyms, and Abbreviations

Term Definition

Machine learning Machine learning (ML) is a type of artificial


intelligence (AI) that allows software
applications to become more accurate at
predicting outcomes without being explicitly
programmed to do so.

Handwritten Text A piece of writing that is handwritten is one


that someone has written using a pen or pencil
rather than by typing it.

Text Document A text file is a type of digital, non-executable


file that contains letters, numbers, symbols
and/or a combination. It enables the creation
and storage of text without any special
formatting.

OCR OCR (optical character recognition) is the use


of technology to distinguish printed or
handwritten text characters inside digital
images of physical documents, such as a
scanned paper document.

Dataset A data set (or dataset) is a collection of data. In


the case of tabular data, a data set corresponds
to one or more database tables, where every
column of a table represents a particular
variable, and each row corresponds to a given
record of the data set in question.

Input Image It is the handwritten image which we provide


to the machine to convert to computer-based
format.

Noise Removal Noise reduction is the process of removing


noise from a signal. Noise reduction techniques
exist for audio and images. Noise reduction
algorithms may distort the signal to some
degree.

Segmentation Image segmentation is the process of


partitioning a digital image into multiple image
segments, also known as image
regions or image objects (sets of pixels).

Feature Extraction Feature extraction is a type of dimensionality


reduction where a large number of pixels of
the image are efficiently represented in such a
way that interesting parts of the image are
captured effectively.

2. The Overall Description

The Overall Description section, of this document gives an overview of the


functionality of the product. It describes the informal requirements and is used to
establish a context for the technical requirements specification in the next chapter.
The third chapter, Requirements Specification section, of this document is written
primarily for the developers and describes in technical terms the details of the
functionality of the product. Both sections of the document describe the same
software product in its entirety but are intended for different audiences and thus
use different language.
2.1 Product Functions

Client

Client GUI (Graphical


User Interface)

OCR
(Easy OCR)

Client Editor

Handwritten Text
Recognition

Programmer

You might also like