Hindi

2020 IEEE 7th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON) | 978-1-6654-0373-3/20/$31.
00 ©2020 IEEE | DOI: 10.1109/UPCON50219.2020.9376566
Conversion of Hindi Braille to Speech using Image

and Speech Processing
Parmesh Kaur Sahana Ramu Sheetal Panchakshari Niranjana Krupa
Department of ECE Department of ECE Department of ECE Department of ECE
PES University PES University PES University PES University
Bengaluru, India Bengaluru, India Bengaluru, India Bengaluru, India
parmesh.kaur@gmail.com sahanramu@gmail.com sheetal2398@gmail.com bnkrupa@pes.edu
Abstract—This paper explores the conversion of Devanagari image enhancement techniques are required to improve the
Hindi Braille, first to text, and subsequently to speech. The first clarity of the images and the dots, which are more intricate
part of the implementation is the conversion of Hindi Braille when the Braille is double-sided.
to text, in which two approaches are used for Braille character
recognition: a conventional sequence-mapping approach and a Speech synthesis is the production of human voice or speech
deep learning-based method. The second part of the paper deals by a machine. It is mostly used to convert written information
with the conversion of Hindi text to speech, in which text is con- into spoken information for convenience. A text-to-speech
verted to speech by concatenating speech samples corresponding (TTS) system performs this function. One form of speech
to Hindi vowels and consonants. Successful conversion of Hindi
synthesis is concatenative, which involves rearranging of voice
Braille to text and, consequently, speech, yielded two forms of
output. Generated samples of Hindi Braille letters, as well as samples spoken by humans into words and sentences.
extracts from a Hindi Braille textbook, were used to create an The motivation behind this paper was to bridge the gap
image dataset. A Hindi speech corpus was created using speech between sighted people and visually impaired people. This
coefficients extracted from a recorded audio sample. The authors project could help preserve Braille books written by the
achieved an accuracy of 100 percent using the conventional
method of Hindi Braille to text conversion and an accuracy of 96 visually impaired. A lot of sighted people have begun to prefer
percent using the deep learning approach. Experts also validated audiobooks over physical books, due to their portability and
the quality of Hindi speech generated from the text-to-speech ease of use. A similar system for Braille books could help
model, based on factors such as clarity of speech, pronunciation, visually impaired people enjoy books on the go. This system
sound quality, and speed of speech. could also help sighted people understand the Braille script
Index Terms—Braille to text, deep learning, AlexNet, text-to-
speech (TTS) system
without any prior knowledge about Braille.
In this paper, the authors describe a novel methodology to
I. I NTRODUCTION convert the obtained Hindi Braille images to new forms of
images using certain image processing techniques for dots
Braille is the most popular system used by visually impaired enhancement and noise reduction, which are then used for
people for reading and writing using tactile means. Developed converting into Hindi text using deep learning and later
by Louis Braille in 1824 for the French alphabet, Braille now convert the obtained text to speech by creating a text-to-speech
exists for several languages used by sighted people. A unified (TTS) system and using the concepts of concatenative speech
script, Bharati Braille is used for communication in various synthesis.
Indian languages, which is based on the English Braille script,
and uses a 6-dot cell system, arranged in 3x2 form as shown II. L ITERATURE S URVEY
in Fig. 1.
In [2], the authors discuss a Braille to Text conversion
system using images from a flatbed scanner. The paper eluci-
dates the image processing techniques to differentiate between
a recto dot (protrusion) and a verso dot (depression) based
Fig. 1. Hindi Braille Characters [1] on the illumination of light. In [3], the authors talk about
an algorithmic approach to differentiate recto dots and verso
Optical Braille recognition (OBR) helps capture Braille dots, albeit from recto-dot-only documents and verso-dot-only
characters from documents, convert them to images, and documents. In [4], the authors discuss the steps involved
process those images to get their natural language equivalents. in building a character recognition system that translates a
This technique is used to preserve documents and reproduce standard character to its corresponding alphanumeric character
them when required. One challenge faced during OBR is from a single-sided page using a conventional method. In [5],
that no ink is used while producing the documents to help the authors talk about a system devised to convert Cyrillic
differentiate between raised dots and the flat surface. Different Braille characters to text using artificial neural networks. The
978-0-7381-1151-3/20/$31.00 ©2020 IEEE
Authorized licensed use limited to: BOURNEMOUTH UNIVERSITY. Downloaded on June 19,2021 at 15:33:41 UTC from IEEE Xplore. Restrictions apply.
multi-layer perceptron was implemented using modified back-
propagation algorithms, reducing convergence time. In [6],
the authors discuss a famous Convolutional Neural Network
(CNN) popularly used for image classification, AlexNet. In
[7], the authors give a brief introduction to Support Vector
Machine (SVM) and how it can be used as a binary classifier
in the case of Optical Braille Recognition (OBR), where the
two classes are presence and absence of dots.
In [8], the authors talk about the methodology used for
designing and creating a Hindi speech corpus consisting of
sentences and phrases, and their respective annotations. In [9],
the authors discuss a concatenative technique of speech syn-
thesis for the Kannada language, creating a database of only
phonemes extracted from MP3 audio files, and concatenating
certain phonemes to form any word or phrase.
The authors of [10] devised a methodology to convert
Kannada Braille to text or speech using Field Programmable
Gate Arrays (FPGAs). Classifications were made on the basis
of number of dots in each Braille cell, and case statements
were used to determine the character based on the presence
or absence of a dot in each of the six positions. In [11], the
authors present the development of a system speaking from
Braille writing using dynamic thresholding, adaptive Braille
grid, and text-to-speech software.
The authors of this paper found that many papers focused
on one of the two processes, Braille to Text or Text to
Speech. They also found that the languages worked on were
mainly foreign and Indian Regional languages. Only one paper
used neural networks, another used support vector machines,
and the rest used manual methods of classification, mainly Fig. 2. Flowchart of proposed methodology
character to binary mapping methods.
III. P ROPOSED M ETHODOLOGY a Hindi Braille book [12] using a mobile camera or a flatbed
As stated in Section I, the main objective of this work is to scanner as shown in Fig. 3. The dataset consisted of 34,800
convert Hindi Braille samples to text and later into speech. The images, 600 images for each of the 58 Hindi Braille characters.
algorithm to perform the same is shown in Fig. 2. The entire The images were created by slightly changing the position and
work was coded in Python 3, using a Windows 10 system. size of dots, while also adding some amount of skewness to
For conversion of Hindi Braille to text, the Braille images them.
are first preprocessed and segmented to obtain individual
Braille characters. Once this is done, Braille character recog-
nition is performed using two approaches i.e, Conventional
Approach and Deep Learning Approach. In the first approach,
the Braille character is converted to a binary sequence us- (a)
ing either the Contouring Approach or SVM Approach and
then mapped to its corresponding Hindi letter. In the second
approach, concepts of Deep Learning (namely Convolutional
Neural Networks) are used in order to train a Convolutional
(b)
Neural Network called AlexNet using a dataset containing
images of Braille letters. Fig. 3. Images captured by various devices (a) Mobile Camera, (b) Flatbed
Scanner
For conversion of text to speech, concatenative speech
synthesis is implemented, wherein the vowel and consonant
combinations in every word is mapped to its corresponding Images captured using a mobile camera or a flatbed scanner
audio files from a speech corpus. tend to have noise content, and some dots in the images might
not be visible due to various lighting factors. Therefore, to
A. Dataset Creation and Preprocessing enhance Braille dots for easy Braille character recognition,
The Devanagari Hindi Braille script consists of 57 charac- various preprocessing methods were employed to obtain noise-
ters. To create the Braille dataset, images were captured from free images as explained below.
1) Gaussian Blur single-sided image. For double-sided images, various methods
Gaussian Blur: is a method where an image is convolved were employed to differentiate between Recto and Verso dots
with a Gaussian filter (a low-pass filter) in order to remove for the final Braille character segmentation process.
the high frequency noise components as shown in Fig. 4.
In the case of Braille character recognition, the sharpness 1) Canny Edge Detection
of Braille dots is essential for edge detection. Therefore, an Edge detection is the process of finding boundaries of
optimal kernel size had to be found by trial and error to remove objects within an image, which is done by finding sudden
noise while retaining the sharpness of the dots. changes in the color of pixel values. As shown in Fig. 7,
an edge is detected when there is a sudden change from
the white background to a black dot. Therefore, Canny Edge
Detection, one of the most popular algorithms, was used for
edge detection.
Fig. 4. Image before and after Gaussian blurring
2) Thresholding
Thresholding: is the process of changing pixel values based
on a predefined threshold value to convert it to a binary image.
In the case of single-sided Braille, adaptive thresholding was Fig. 7. Image after application of canny edge detection
used. In the case of double-side Braille images, the Threshold
to Zero technique was used to differentiate between Recto dots
(protrusions), Verso dots (depressions), and the background as
2) Find Contour Method
shown in Fig. 5.
Contouring is a method where boundaries are drawn around
continuous points having the same color or intensity. Since this
method detects shapes and objects effectively, it works well in
finding Braille dots as well. Contouring is applied to binary
images to improve accuracy.
(a)
An image moment, defined as the weighted average of
image pixel intensities, is used to calculate the centroids of all
the contours in the image. These centroid values are then used
(b) to draw uniform dots on the image by replacing the existing
irregular Braille dots.
Fig. 5. Thresholding (a) Single sided, (b) Double sided
3) Differentiation of Recto and Verso dots
3) Erosion and Dilation
As explained in section III, techniques like Gaussian Blur
Erosion and Dilation: Morphological operations are image
and Thresholding are applied on the double-sided image, and
processing techniques that depend on the shapes present in
later centroid detection is done using Canny Edge Detection
an image. Since the shape of the dots has to be maintained,
and Find Contour method. These centroid values are then used
the two main morphological operations used are Erosion and
to differentiate between Recto and Verso dots.
Dilation.
Fig. 6 shows the image obtained after applying a few rounds In a flatbed scanner, the reflection of light is captured
of Erosion and Dilation. differently for different types of dots. When a Braille page
is scanned, Recto dots have a light region followed by a
dark region, and vice-versa for Verso dots. This image can
be enhanced using Threshold to Zero.
Fig. 6. Image after application of erosion and dilation

Using this image and the acquired centroid values, we can
compare the centroid positions with the original image to see
the type of region that lies above or below the centroid. If the
B. Detection of Braille Dots region below the centroid has a pixel value of 0, the dot is
Recto. Similarly, if the region below the centroid has a pixel
To carry out Braille character recognition, it is essential
value of 255, the dot is Verso.
to detect the Braille dots and their positions on the image
accurately. Various techniques like Canny Edge detection and Fig. 8 shows the new images with uniform Recto dots for
Find Contour method, were employed to detect the dots in a single and double sided images.
threshold value is considered. If the distance between the con-
secutive x-coordinates is greater than the threshold, then that
distance is the distance between two horizontally consecutive
(a) Braille cells. A line is then drawn using the average of the two
consecutive x-coordinates and the y-coordinate span for that
x-coordinate. The characters are then cropped out and saved
as separate image files.
(b) Fig. 10 shows the output after vertical segmentation.
Fig. 8. Drawing uniform dots using centroid positions (a) Single sided, (b)
Double sided
C. Image to Text Conversion

Once the Braille dots’ positions are found, the image has
to be segmented into individual characters and then mapped Fig. 10. Image after vertical segmentation
to their corresponding Hindi characters by either using the
conventional approach or the deep learning approach. After 2) Braille Character Recognition
the generation of raw text, it then has to be further processed Once the individual Braille characters have been obtained,
for ’matra’ correction. Therefore, this section provides a step they are mapped to their corresponding Hindi characters by
by step explanation of the image to text conversion process. using either the conventional or the deep learning approach.
1) Segmentation I. Conventional Approach: The conventional method deals
Once the Braille dot positions have been found, the image of with determining a binary sequence for each character, and
the Braille paragraph has to be segmented into lines, followed mapping that sequence to its corresponding Hindi letter. Before
by words, and finally individual Braille cells or characters to working on the mapping, each character’s Braille cell is re-
perform Braille character recognition. Therefore, two types of contoured and the image is cropped to incorporate uniform
segmentation, Horizontal segmentation and Vertical segmen- spacing around the boundaries of the Braille dots. Two types
tation, are implemented. of conventional approaches, the Contouring Approach and
a. Horizontal Segmentation: This type of segmentation Support Vector Machine (SVM) Approach, are explained
divides the Braille text into lines. This method involves finding below.
the difference of y coordinates. On observation, one can see a. Contouring approach: Each cell is checked for the pres-
that the vertical distance between the bottom dot of one Braille ence of a contour corresponding to a dot. If a contour is
cell and the top dot of the Braille cell below that cell, is present, it is assigned ’1’ else it is assigned ’0’.
more than the vertical distance between dots in the same b. SVM approach: Support vector machine is a supervised
cell. Since similar y-coordinates have been averaged out to machine learning algorithm. SVM tries to find a hyperplane
adjust for uniformity, only the difference between consecutive that best separates the two classes. The SVM model is trained
y-coordinates has to be checked. on a dataset of size 1200 images, consisting of 600 images
A threshold value is used to distinguish between the dots of dots and 600 blank images. The Gaussian radial basis
being part of a Braille cell, and two dots in two vertically con- function projects the training data to a feature space of higher
secutive Braille cells. If the distance between the consecutive dimension. [13].
y-coordinates is greater than the threshold, then that distance is Fig. 11 shows the raw text output after Braille to text
the distance between two vertically consecutive Braille cells. conversion using the conventional approach and the SVM
A line is then drawn using the average of the two consecutive approach.
y-coordinates and the x-coordinate span for that y-coordinate.
Fig. 9 shows the output after horizontal segmentation.
(a)
Fig. 9. Image after horizontal segmentation

(b)
b. Vertical Segmentation: A similar approach has been Fig. 11. Raw text output (a) Contouring, (b) SVM
performed for vertical segmentation just like horizontal seg-
mentation. A binary sequence is generated for every input character,
To distinguish between the dots being part of a Braille cell, which is then mapped to its corresponding Hindi letter as
and two dots in two horizontally consecutive Braille cells, a shown in Fig. 12.
Fig. 14. Text after matra correction
D. Text To Speech Conversion

Fig. 12. Hindi character to binary sequence mapping
II. Deep Learning Approach

A famous convolutional neural network, AlexNet [6] [14],
is trained on a dataset created using the segmented images.
This dataset contains 34,800 images, with 600 images for each
of the 58 characters. The images in a particular class have
varying dot size, shape, and placing, therefore giving a variety
of images to train on.
The dataset is split in two ways: a random train-validation-
test split of 70:15:15. and a stratified train-test-validation split
of 60:20:20. Stratified sampling works well on small datasets
compared to a random train-test split, which introduces sam-
pling bias. But stratified sampling can slightly reduce accuracy, Fig. 15. Text-to-speech flowchart
and for this reason, a random train-test split works well too.
After training and testing the model, the authors found that
stratified sampling gives consistent results, and a random train- Various types of speech synthesis methods exist, and one
test split gives more accurate results. of them is concatenative speech synthesis [8]. This form of
To determine the number of epochs best suited to the speech synthesis deals with stringing together multiple audio
dataset, the model is run for 10, 15, 25, and 60 epochs. The files consisting of phonemes, letters, or words to artificially
authors found that 15 epochs were suitable. produce speech. These audio files are usually extracted from a
bigger pre-recorded audio file. The advantages of the concate-
TABLE I native method are manifold. It also produces the most natural-
Loss and accuracy for Random and Stratified Splits
sounding output. This method is especially ideal in this case,
Trial Loss Accuracy since the authors are working with the Hindi language only,
no. Random Stratified Random Stratified and due to Hindi being an extremely phonetic language, the
1 0.071 0.053 92.87% 94.70%
2 0.053 0.052 94.66% 94.76% corpus required for synthesizing speech is not very large.
3 0.053 0.052 94.66% 94.76% 1) Corpus Creation
4 0.053 0.052 94.76% 94.76%
5 0.052 0.052 94.81% 94.76% The authors recorded a few prose chapters from the NCERT
Avg. 0.056 0.052 94.35% 94.74%
Hindi Textbook for Class 8 [15] in .wav format. From those
files, the authors extracted combinations of consonants and
TABLE II
vowels and created separate audio files for each of those
Macro-averaged performance metrics for Random and Stratified Splits combinations. Lesser-known combinations were recorded sep-
Trial PPV macro avg. TPR macro avg. F1-score macro avg.
arately. For 13 vowels and 33 consonants, the authors made
no. Random Stratified Random Stratified Random Stratified about 790 .mp3 audio files. These files are enough to form
1 92.75% 93.41% 92.86% 94.70% 91.54% 93.66%
2 93.46% 93.46% 94.70% 94.76% 93.72% 93.72%
any Hindi word. Since ending letters of words in Hindi sound
3 93.46% 93.46% 94.70% 94.76% 93.72% 93.72% different from what they sound in the beginning or end, audio
4 93.39% 93.46% 94.72% 94.76% 93.65% 93.72%
5 93.37% 93.46% 94.70% 94.77% 93.62% 93.72%
files signifying the end of a word or sentence were also
Avg. 93.29% 93.45% 94.34% 94.75% 93.25% 93.71% created for all letters. All the files are labeled according to
the corresponding vowel-consonant combination in Hindi.
Fig. 13 shows the raw text output after Braille to text 2) Text Processing And Speech Synthesis
conversion using AlexNet. The output obtained from the first part of the project, i.e.,
Hindi Braille to text, is further split into vowel-consonant
combinations. The files corresponding to the vowel-consonant
combinations are searched for, found, and concatenated to
form words. The words formed, along with a ’silence’ file
Fig. 13. Raw text output from AlexNet approach
indicating pauses, are concatenated to form a sentence. Sub-
sequent concatenation of sentences form the paragraph. Thus
3) Text Processing
the speech produced is the second form of output that we
After getting the raw from all the methods mentioned above,
planned to deliver.
it is processed for matras as shown in Fig. 14.
IV. C OMPARISON OF RESULTS two methods, contouring and SVM and the second approach
The two approaches used by the authors for the conversion was the Deep Learning. The accuracy was found to be better
of Hindi Braille to text are comparable. The conventional in case of conventional approach provided the manually set
method is a simple approach yielding optimal results, but thresholds don’t have to be changed for every image. Fur-
excessive processing has to be performed on the images, to thermore, contouring was a simpler and reliable method for
correctly determine the number of contours for making the conventional approach implementation. In terms of robustness,
binary sequence. SVM was found to be an excellent binary deep learning gave good results as there was no need for
classifier for classifying the images as dots or no-dots. Even manually setting up thresholds. As future work, a real time
though the accuracy of SVM is 100%, in comparison to the application can be implemented.
overall conventional approach, the accuracy of Braille to text ACKNOWLEDGMENT
conversion is not 100%, since SVM is not the only factor
The authors would like to thank PES University for sup-
affecting that accuracy. The deep learning approach, which
porting this work.
uses AlexNet, on the other hand, is robust but slightly less
accurate than the conventional method (Table III). The lack of R EFERENCES
accuracy can be accounted for by a larger dataset. For both [1] M. Clute, ”Elephants, and mysore, and hindi braille, oh my!,” [On-
approaches, the placement of dots must be correct and close line]. Available: https://istep2013.wordpress.com/2013/07/15/technical-
to the original Braille characters, for better recognition, since challeges-of-hindi-braille
[2] A. Antonacopoulos and D. Bridson, ”A Robust Braille Recognition
some characters have only a slight change in position which System,” In: Marinai S., Dengel A.R. (eds) Document Analysis Systems
differentiates them from the others. VI, Lecture Notes in Computer Science, Springer, Berlin, Heidelberg,
Comparing the results of the Braille to Text conversion, vol. 3163, pp. 533-545, 2004.
[3] T. Shreekanth and V. Udayashakara, ”An Algorithmic Approach For
the authors found that the output of the SVM approach was Double Sided Braille,” Int. J. Image. Process. Vis. Commun, vol. 2, no.
more accurate compared to that of the CNN AlexNet approach 4, 2014.
for both kinds of splits. Although the conventional approach’s [4] J. Subur, T. A. Sardjono, and R. Mardiyanto, ”Braille Character
Recognition Using Find Contour Method,” International Conferencce
output cannot be quantified, on inspection, the resulting text on Electrical Engineering and Informatics (ICEEI), pp. 699-703, 2015.
was very similar. [5] K. Smelyakov, A. Chupryna, D. Yeremenko, A. Sakhon, and V. Polezhai,
”Braille Character Recognition Based On Neural Networks,” IEEE
TABLE III Second International Conference on Data Stream Mining Processing
Comparison of results (macro averaged) (DSMP), pp. 509-513, 2018.
[6] A. Krizhevsky, I. Sutskever, and G. E. Hinton, ”Imagenet classification
Metric SVM AlexNet AlexNet
with Deep Convolutional Neural Networks,” Advances in Neural Infor-
approach (random (stratified
mation Processing Systems, pp. 1097-1105, 2012.
split) split)
[7] J. Li, X. Yan, and D. Zhang, “Optical braille recognition with haar
Accuracy 100% 96.07% 96.47%
wavelet features and support-vector machine,” IEEE International Con-
PPV 100% 93.29% 93.45% ference on Computer, Mechatronics, Control and Electronic Engineering,
TPR 100% 94.34% 94.75% vol. 5, pp. 64–67, 2010.
F1-Score 100% 93.25% 93.71% [8] D. Magdum, M. S. Dubey, T. Patil, R. Shah, S. Belhe, and M. Kulkarni,
“Methodology for designing and creating hindi speech corpus,” IEEE
The authors conducted a survey for 50 samples produced International Conference on Signal Processing and Communication
Engineering Systems, pp. 336–339, 2015.
by the Hindi Text to Speech converter. The average scores [9] M. Dhananjaya, B. N. Krupa, and R. Sushma, “Kannada text to
(out of 5) on the basis of clarity (4.142), speech (4.013), and speech conversion: a novel approach,” IEEE International Conference
pronunciation (4.081) were determined. on Electrical, Electronics, Communication, Computer and Optimization
Techniques (ICEECCOT), pp. 168–172, 2016.
In [4], contouring is employed for Braille character recogni- [10] S. R. Rupanagudi, S. Huddar, V. G. Bhat, S. S. Patil, and M. Bhaskar,
tion. They get an accuracy of 100% with images that have 0- “Novel methodology for kannada braille to speech translation using im-
0.5 degrees of tilt and accuracy drops to 1% for 1.5 degrees tilt. age processing on FPGA,” IEEE International Conference on Advances
in Electrical Engineering (ICAEE), pp. 1–6, 2014.
However, skewness correction on the images results in high [11] N. Falcon, C. M. Travieso, J. B. Alonso, and M. A. Ferrer, “Image
accuracy for images with tilt greater than 1.5 degrees. In [5], processing techniques for braille writing recognition,” International
ANN for Braille character recognition is used. For 33 training Conference on Computer Aided Systems Theory, Springer, pp. 379–385,
2005.
and 8 testing images per character in Cyrillic an accuracy of [12] S. Barahat, ”Aethihaasik Kathaaen,” All India Confederation for the
95% is obtained. In [16], an MLP model is trained to achieve Blind (AICB) Printing Press.
an accuracy of 98%, but skewness and non-uniform dots are [13] J. P. Vert, K. Tsuda, and B. Scholkopf, “A primer on kernel methods,”
Kernel methods in computational biology, vol. 47, pp. 35–70, 2004.
not considered. Whereas in this proposed work, skewness [14] M. ul Hassan, “Alexnet – imagenet classification with deep convolutional
correction results in high accuracy for tilt greater than 1.5 neural networks,” [Online]. Available: https://neurohive.io/en/popular-
degrees. Furthermore, a dataset of 34800 images have been networks/alexnet-imagenetclassification-with-deep-convolutional-
neural-networks/
considered to obtain an accuracy of 96.47%. [15] Vasant, Part 3, Textbook for class 8, National Council of Educational
Research and Training.
V. C ONCLUSION [16] B.-M. Hsu, “Braille Recognition for Reducing Asymmetric Communi-
The authors successfully implemented the conversion of cation between the Blind and Non-Blind,” Symmetry, vol. 12, no. 7, p.
1069, Jun. 2020.
Braille to speech for Hindi. Two approaches were used,
conventional approach which was further implemented using

Hindi

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hindi

Uploaded by

Copyright:

Available Formats

2020 IEEE 7th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON) | 978-1-6654-0373-3/20/$31.

00 ©2020 IEEE | DOI: 10.1109/UPCON50219.2020.9376566

Conversion of Hindi Braille to Speech using Image

978-0-7381-1151-3/20/$31.00 ©2020 IEEE

Fig. 4. Image before and after Gaussian blurring

Fig. 6. Image after application of erosion and dilation

C. Image to Text Conversion

Fig. 9. Image after horizontal segmentation

D. Text To Speech Conversion

II. Deep Learning Approach

You might also like