You are on page 1of 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/338116738

Automatic License Plate Recognition for Indian Roads Using Faster-RCNN

Conference Paper · December 2019


DOI: 10.1109/ICoAC48765.2019.246853

CITATIONS READS

12 1,891

2 authors:

Praveen Ravirathinam Arihant Patawari


Birla Institute of Technology and Science Pilani TNeGA
14 PUBLICATIONS   27 CITATIONS    4 PUBLICATIONS   14 CITATIONS   

SEE PROFILE SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Global Alzheimer's Association Interactive Network View project

Automatic License Plate Recognition for Indian Roads Using Faster-RCNN View project

All content following this page was uploaded by Praveen Ravirathinam on 23 December 2019.

The user has requested enhancement of the downloaded file.


Automatic License Plate Recognition for Indian
Roads Using Faster-RCNN
Praveen Ravirathinam Arihant Patawari
Department of Computer Science and Information Systems Centre of Excellence
BITS Pilani, Pilani Campus Tamil Nadu e-Governance Agency (TNeGA)
Pilani, India Chennai, India
praveen.ravirathinam@gmail.com apatawari.tnega@tn.gov.in

Abstract—Use of deep learning in Automatic License Plate own dataset, that is aligned towards the Indian scenario,
Recognition has been a topic of common research, in the past which is explained in detail, in a future section.
few years. Recognition of Indian plates have always been a One of the main contributions of this paper is the
challenge due to their aberrant nature with different fonts, size introduction of a full, end-to-end pipeline for ALPR in India,
of letters, padding and number of lines. In our work, we propose
a fully functional, end to end solution for license plate
using the state-of-the-art Faster Regional Convolutional
recognition in India, considering all irregularities. Our system Neural Network (Faster-RCNN) [2] and Convolutional
uses a series of the state-of-the-art Faster Regional Neural Network (CNN) in a cascaded manner. We used
Convolutional Neural Network to create a pipeline that gives an Faster-RCNN for LPD and CS and we used CNN for CR,
efficient solution to the Indian situation under various each with its own specifically trained weights.
scenarios. Since there is no publicly available dataset for Indian Another important contribution is the introduction of a
License plates, we made a balanced dataset using frames from technique for creating a character recognition network for
videos and photos from handheld devices, taking into India, using data created from CS network to enhance the
consideration all the irregularities. Our pipeline produced an training further. One more contribution is the introduction of
overall 88.5% total correctness and 10% partial correctness
(greater than 5 characters correct) for Indian plates. Inclusion
a technique for accurate CS in Indian Plates. We also
of a novel heuristics system increased total correctness to 91%. introduce a dataset for Indian Plates.
License plate detection had a precision of 94.98% for all types The paper is organised as follows. We touch work related
of vehicles. Our pipeline successfully segments over 99% of to ALPR in Section II. Section III contains the proposed
characters from license plates with a mean average precision of system which discusses in detail, our dataset and the ideology
99.55% and was able to correctly recognise 98.6% of the behind it, training procedures, frameworks of all the three
segmented characters. networks present in pipeline, along with the heuristic ideas
that have been applied. The details of testing process and the
Keywords—Automatic License Plate Recognition, Faster- results obtained are presented in Section IV. Conclusion and
RCNN, Deep Learning, ResNet, VGG, Computer Vision, future plans are reviewed in Section V.
Character Segmentation, Character Recognition
II. RELATED WORK
I. INTRODUCTION In this section, we shall examine some common features
Automatic License Plate Recognition (ALPR) is an present in ALPR as well as problems in India and how to
important component of any smart city initiative or approach them. This section will also show why image
intelligent transport system. ALPR has many practical processing techniques, like the ones used in Bangladesh [3]
applications including smart parking lot, toll systems, police will not work in the Indian scenario.
chases, traffic signal violation detection, identification of A. Challenges in India
stolen/unregistered vehicles and border control. ALPR in There are many problems that could arise in Indian plate
India has always been a challenge due to the many intricacies detection. The first problem is with the vast variety of
present in Indian plates and diversity of vehicles and their vehicles present on a typical Indian road. The vehicle types
types. include cars, motorcycles, scooters, lorries, buses, auto
Most ALPR systems consist of three main tasks, namely rickshaws, SUVs, mini trucks, vans, tractors etc. Each of
License Plate Detection (LPD), Character Segmentation these types have a different format and style of plate. Plates
(CS) and Character Recognition (CR) [1]. The system we also have different number of lines among different types and
propose also has these three stages along with a heuristics sometimes even within the same type. For example, 95% of
logic filter at the end. Each stage has been trained with two wheelers and auto rickshaw vehicles have two
respect to the Indian situation, taking multiple challenging lines(multiline), whereas 90% of the cars have single line
cases for training from each category of problems. The plates. Another problem is the layout of the plates even with
problems with ALPR in India are vast, most of which, with the same number of lines. For example, the layout of plates
the rise of Deep Learning (DL) techniques and proper in buses and motorcycles are different, even though both are
training can be addressed. However, one major factor in any multiline. To add to the problems, there is the usage of
DL process is the availability of a proper dataset. This is a several types of fonts and custom designed plates. The plates
major problem with ALPR in general, and India in particular, are also of different shapes and sizes i.e. all plates are not
because there is no public dataset available in India, which rectangular, some are trapezoid and other irregular shapes.
covers all the problems. So, we had to resort to creating our
B. Deep learning through simple logic statements. Faster-RCNN has a base
As mentioned before, though there are various challenges CNN on which regional proposal and classification occur.
present in India, usage of deep learning techniques can This base CNN can be VGG, ResNet [8], Inception etc. We
address most of these issues with a high level of accuracy. chose to do a comparison between VGG16 and ResNet-50 as
Over the past few years, Convolutional Neural Networks base CNNs. We trained models using both bases individually
have been used in various computer vision tasks. Many and chose the better one among the models at the end.
frameworks of CNNs have also arisen and for each task, a Some ALPR pipelines incorporate the use of vehicle
particular framework can be chosen to produce best results. identification before locating license plate however we chose
We chose the Faster-RCNN approach due to ease of training not to include this due to the sheer number of vehicle types
and ability of identifying small objects at a high speed. Some on a typical Indian road. A summary of our pipeline is given
plates in buses and autos are very small and require a lot of in Fig. 1.
training to locate those plates. Use of deep learning
techniques to solve the ALPR problem has shown to work in A. Dataset
Brazil [4] and in other countries [1],[5]-[6]. The most important factor in any deep learning problem
is a good dataset. A biased or unbalanced dataset can lead to
C. OCR wrong or biased results. It must be made sure that there is no
Many ALPR pipelines employ OCR techniques for common feature among the items in the dataset apart from
Character recognition. Unfortunately, OCR methods did not the features of interest.
work in our testing dataset. This can be attributed to the fact There are many factors to take into account, while
that letters on Indian plates are not of standard size and shape. considering India. We had to make sure that the dataset
Another major problem in the Indian context is that the created was balanced in terms of types of vehicles as well as
characters on the plate are not limited to the traditional layouts of plates. First, we found that the types of vehicles
alphanumeric dataset, but also native and indigenous letters. present in a typical Indian city road consist of cars
This is because there is a lack of standardization of plates in (hatchbacks, sedans and SUVs), two wheelers (motorcycles
India. This creates a huge challenge, as there is no proper and scooters), buses (government and private), lorries, mini
dataset for these characters. We have chosen not to include trucks and auto rickshaws. We also found that within a
such plates in our study. Owing to these factors, we chose to particular type itself, there are many layouts of plates
incorporate a CNN for CR rather than using OCR. possible. For example, majority of cars have single line
plates, but some cars have two-line plates. This brings the
D. OpenALPR next consideration of a balanced number of plates with
One of the most famous open-source ALPR software is respect to the number of lines. Indian plates can be of a single
OpenALPR [7]. It works on the principle of detection line or two lines. We also found that, within vehicle types,
followed by segmentation and then OCR processing. The same number of lines does not mean same layout. For
detector and segmenter can be trained, using your own data example, majority of auto rickshaw and motorcycle plates
which was tried using our custom dataset. Corresponding are two lined, but their layouts are different. The fonts used
files were created and tested using a validation dataset of 200 in both are also different, with some plates in auto rickshaws
images. The resulting model was about 96% accurate in having letters painted instead of printed.
detecting region of plate, only failing in cases where the plate Due to all these factors, no standard data set can be used
is very small or multiple license plate-like objects were for training models to be used in the Indian context. Training
present in the images. However, correct segmentation models, taking these factors into account will cover the
occurred in less than 30% cases. Since OpenALPR operates standard cases, since standard cases have less variance when
on a binary image, many times, a nail was considered a letter compared to the cases we are dealing with. Therefore,
and classified as ‘O’. Moreover, the OCR could not classify keeping all the factors in consideration, we began collecting
many characters. We had to incorporate the European OCR data. We first tried to extract frames from the traffic camera
to get some results. Due to the random features in letters and feed, but the frames extracted were not of great clarity. The
because of the fact that India does not follow a consistent font plates were also not visible and were too far. So, we figured
usage for all plates, OCR failed to give good results. that the best way is to collect the images ourselves. We used
handheld devices and video cameras to capture images and
All these results further encouraged us to use deep
learning approach for ALPR

III. PROPOSED SYSTEM


This section describes in detail, the proposed approach of
ALPR in India and is divided into five subsections, one for
discussing our dataset and ideology, one for each stage of
ALPR in our pipeline (LPD, CS and CR) and finally one
regarding heuristic ideas applied. We are covering each stage
in detail because each stage uses a different model and
sometimes even the CNN used is not the same. Each stage
has its parameters fine-tuned particularly for the task it is
supposed to do. Faster RCNN model was used for LPD and Fig 1. Summary of pipeline showing all 3 networks present giving a
result with confidence of 93.99%.
CS whereas CNN was used for CR. Heuristics were applied
videos directly from the road, after taking the consent of number is engraved or painted on the vehicle instead of
vehicle owners. We also downloaded good quality images having a plate. Taking all these factors into consideration, we
from websites trading in second-hand vehicles. decided that the best way to go about this was to label
Finally, we got a balanced dataset of 2806 images, shown rectangular bounding boxes for all plates, even the oddly
in Fig. 2, consisting of all vehicle types, license plate shapes, shaped ones as shown in Fig. 3. We used Faster-RCNN using
license plate layout and fonts of which 2209 images were both VGG16 and ResNet-50 model separately to train this
used for training and 597 were used for testing the LPD network.
network. We also got 824 plates for CS network of which Over multiple training examples, this method of labelling
711 were used for training and 113 were used for testing. and training worked, and the system was able to identify the
plates. We also found the optimal number of epochs for
Many people did not want to make the images of their
training to be 100 using mAP as a measurement of error. The
vehicles with plate details publicly available, which is why
relationship between epochs and mAP with testing images
the images are small and is just meant to get an idea on the can be found in Table I.
type of images we used in training. For afore stated reasons,
we are not releasing our dataset publicly. C. Character Segmentation
Another point to note is that in India the letters on the This part was very challenging, because this is the stage
plates are not printed in all cases. For example, in autos most where the most randomness is present in the whole pipeline.
of the time the letters are painted i.e. handwritten. So, while Standard methods for character segmentation like the
collecting data we made sure that we took this factor into OpenALPR [7] will not work here because the characters in
account. As a result, in many cases, our final pipeline was Indian plates are not of uniform shape, size, font and even
able to give correct answers for such painted handwritten padding. Unlike most countries, the number of characters per
plates. However, our dataset does not include non- plate in India is also different as well as the number of lines.
alphanumeric characters. This includes characters from Even within the lines, the number of characters per line
varies. For the most part, four wheeled vehicles have single
native languages such as Hindi, Tamil and Sanskrit.
line plates with characters, whereas two wheelers in majority
Normally found in rural areas, we were not able to obtain have multiline plates.
many of such plates. Our model does not work for cursive Number of characters per plate varies from 7 to 10 with
letters as well. However, plates with cursive letters are very varying number of letters and numbers. So, we cannot apply
rare. If a dataset of these plates is obtained the method of a pattern recognition model at this stage. The padding also
training we are using will work. As a result, our model cannot varies and ranges from 1 cm to 0.5 cm between letters.
account for these cases and is thus a limitation to our system. Another feature is that the letters occur in blocks i.e. letters
are clustered together. As observed, plates do not follow a
B. License Plate Detection particular pattern unlike other countries like Brazil [8], so we
Most countries have their plates to be rectangular. labelled our plates in a way to train the network to identify
However, in India, as mentioned before, plates come in all character locations, irrespective of where they are in the
shapes and sizes. For the most part plates, are rectangular in plate.
shape but two-wheeler vehicles also have a trapezoid shape For this, we took 711 plates consisting of balanced
plate in some cases. As for sizes, the mean size of plates number of multiline and single line plates with all the
within different vehicles varies with cars having a mean size
of 500 mm x 120 mm and two wheelers having 200 mm x
100 mm. Auto Rickshaws also have very small plates and are
fixed in random of the vehicle. And in most cases, the license

Fig 3. Few labelled sample images used for training the LPD network.

TABLE I
OPTIMAL EPOCH NUMBER FOR LPD NETWORK

Number of
Number Overall
Network Base CNN Training
of Epochs mAP
Samples
15 88.4%
30 91.7%
License Plate
50 92.5%
Detection ResNet-50 2209
80 93.8%
(LPD)
Fig 2. A few images from the custom dataset created. It contains a mix 100 94.98%
of all factors including multiline, buses, autos, multi-spacing and even 120 94.81%
trapezoid shaped plates.
different fonts. Labelling was done by rectangular bounding represents a number and @ represents a letter. Like this,
box around each character in the plate as shown in Fig. 4. We multiple situations were mapped, and this led to an increase
tried another method where we label parts on plate that could in accuracy of pipeline. An example of this is if the plate was
be mistaken as a character using ‘NotChar’ label. However, identified as ‘DL1SED6345’ the heuristics will correct it to
inclusion of this label class led to confusion, so we chose not ‘DL15ED6345’ replacing the ‘S’ with ‘5’ as the rules state
to include it in the final model. that a number must occur in 4th position of a plate with 10
Once again, we measured mAP of testing plates to find characters. Replacement only occurs with similar feature
out optimal number of epochs and found it to be around 20. characters such as ‘1’ and ‘I’, ‘0’ and ‘O’, ‘A’ and ‘4’, etc.
The relationship can be found in Table II. The vice versa case of a number being replaced with a letter
has also been mapped. However, all scenarios cannot be
D. Character Recognition mapped and there are constraints to this system i.e. not all
The major problem in character recognition is the vast situations can be rectified.
number of fonts used in plates. We figured the best way to One has to note that these heuristics are specific to India
address this is to train a network to identify the letter/number and must be customized according to the country of interest.
from a cropped image of the particular character. This way,
the network can learn features associated with each IV. RESULTS
letter/number, and can thus be used for covering various
fonts, including those used in the painted plates. In this section, we will discuss about the results obtained
For this, we employed a standard CNN for classification for each task and also about the overall accuracy of the
and used cropped letters from plates as training data. pipeline.
Initially, we manually labelled 1000 characters for extraction The measurement of error we used to evaluate our models
from plates and used these characters for training the model. is the concept of Mean Average Precision(mAP). mAP as a
We then used this model to extract more characters from 100 measurement of precision is commonly used in object
more plates and then trained again. We did this recursively detection models and measures the intersection of predicted
for all the 900 plates we had. At the end, we had around 8000 bounding box with the actual bounding box. We kept the
characters for training with around 200-300 characters per threshold of intersection, also known as Intersection over
class. Union (IoU), as 0.5 for all cases.
This method of recursive training worked very well and The first thing we were able to conclude is that ResNet-
gave good results. However, confusion did occur between 50 as a base CNN for Faster-RCNN worked better than
characters with similar features such as O and Q, I and 1, O VGG16. Experimental results at optimal epoch for each base
and 0, S and 5 and even A and 4 in some cases. However, with mAP is given in Table III. We can attribute this to the
most of these issues were solved using heuristic rules. residual property of ResNet as well as a smaller number of
trainable parameters.
E. Heuristics LPD was tested using 597 images, with breakdown and
The idea of a heuristic comes into play when we know of results given in Table IV. Overall mAP came to 0.9498 or
a certain pattern about the sequence of characters in plates. 94.98% which included all types of vehicles. As seen in
This method proved to be successful in research done in Table IV, the mAP of other vehicles is quite low due to the
Brazil [4]. The basic idea is that we can apply some logic fact that, in these vehicles, the plates are very small and
rules to make sure a particular place in a sequence holds a sometimes there are no plates at all. Instead, the characters
particular type of character, i.e. either a number or a letter. are painted on the surface of the vehicles. This commonly
An example of this can be seen in plates with 10 characters. occurs in auto rickshaws and buses. However, cars and two
If the sequence is detected to be 10 characters, we can say wheelers produced high mAPs, failing in only very few
that the plate must be of the form @@##@@#### where # cases. Our model was also able to detect tilted plates and non-
rectangular plates as well. The results and predictions can be
seen in Fig. 5.
CS was tested with 113 plates with results given in Table
V. Unlike LPD testing, here we had to include a confidence
check before including a prediction in the mean. This is
because, since there are many character-like objects in a
plate, our model predicted some parts of the plates to be
characters but with low confidence. We tried resolving this
using more training, but it led to overfitting. We observed
Fig 4. Few labelled sample images used for training the CS network.
that in 99.1% of cases, actual characters were predicted with
confidence of greater than 0.9. So, we kept 0.9 as confidence
TABLE II threshold to be included in the mean average precision and
OPTIMAL EPOCH NUMBER FOR CS NETWORK
TABLE III
Number of BASE CNN COMPARISON AT OPITMAL EPOCH RESNET50 VS. VGG16
Number Overall
Network Base CNN Training
of Epochs mAP
Samples Network Base CNN mAP
5 93.8% ResNet-50 94.98%
Character 10 97.8% License Plate Detection (LPD)
VGG16 91.43%
Segmentation ResNet-50 711 15 98.92% ResNet-50 99.55%
(CS) 20 99.55% Character Segmentation (CS)
VGG16 95.63%
25 99.32%
TABLE IV testing cases can be seen in Fig. 7
LPD NETWORK RESULTS BY IMAGE TYPE
The inclusion of heuristics corrected 5 of the partially
Network Image Type Number of Images mAP correct cases to full correctness thus leading to an overall
Single line 221 98.01% accuracy of 91%. One example of such a case can be seen in
License Plate
Multiline 241 97.97% Fig. 7. However, we are choosing not to focus on heuristics
Detection
Other Vehicles 135 83.24% results and rather focus on results of each model at each stage
(LPD)
Overall 597 94.98%
because, if these models are good, heuristics are not even
TABLE V needed. While comparing with other software, we chose not
CS NETWORK RESULTS BY PLATE TYPE to include heuristics in the pipeline.
Though each stage in the model has a precision of around
Network Image Type Number of Plates mAP
Single line 47 99.42%
95%, the overall results have a lower accuracy. This can be
Character
Segmentation Multiline 66 99.67% attributed to the Intersection over Union (IoU) factor. We set
(CS) Overall 113 99.55% the bar for IoU as 0.5 for all cases. However, in some cases
even if IoU is 0.6, the CR network is not able to recognise
obtained the overall precision to be 99.55%, with IoU as 0.5, the character, because the character is only partially
as shown in Table V and prediction in Fig. 6. As can be segmented out - that is, only part of the character appears and
observed, the model is able to segment multiple cases not the whole letter. This can even lead to confusion between
including different size of letters within plate, different letters such as ‘I’ and ‘T’. We found that, if IoU of the
padding within plate, painted plates, multiline and single line segmented character is above 0.9, the CR network produces
and even tilted multiline plates. a good result. Since a plate generally has over 9 characters,
The CNN used for CR network achieved an accuracy of the probability that one character segmenting out with an IoU
98.6% in identifying a character, only failing in similar of less than 0.9 is very high and this can lead to either missing
feature characters such as ‘1’ and ‘I’. The CR network was or wrong classification of this character leading to error.
also able to identify handwritten/painted characters as well Another potential error occurs in the LPD network where
because of the feature learning technique. This accuracy of the license plate region is wrongly detected. Sometimes in
98.6% is quite high compared to other models and OCR the back of some vehicles there are many plates like banners
techniques used for ALPR in India. Most of the mismatches or posters present. This is common in buses where the back
however were resolved using the heuristic rules. of the vehicle can be scattered with advertisements.
. In the end, we were left with only 200 images for pipeline Sometimes our LPD network identifies these regions as
testing. This testing set was balanced with respect to plates but with low confidence. In very rare cases this was
frequency at which such vehicles occur on the road. Our identified as the only plate region and thus led to wrong
pipeline gave a completely correct answer in 177 cases and results. Another important factor to consider is that, since we
gave a partially correct answer in 20 cases, failing only in 3 are cascading models, a little bit of error is added up at each
cases. A case was considered to be partially correct if five or stage and models, a little bit of error is added up at each stage
more characters were identified correctly. Some of the and in some cases can completely offset the end results
We also tried the pipeline on some Western country
plates to check whether the software works in standard
scenarios. As expected, the pipeline was able to identify
almost all the plates in the standard scenario as well. This can
be attributed to the fact that foreign plates in general have a
standard format and are less aberrant than the ones in India.
Also, there are not many vehicles types in foreign countries.

Fig 6. Some predictions done by CS network. As seen predictions are


working for tilted plates, uneven font sizes and uneven padding

Fig 5. Some predictions done by LPD network with confidence level of Fig 7. Some results of pipeline without incorporation of heuristics
each prediction. As seen majority of predictions have confidence greater because inclusion cannot give a real feel of the accuracy. As seen ‘0’ is
than 95%. confused for ‘O’. This was solved using heuristics.
This led to detection with high confidence in almost every incorporated is handwriting recognition. We feel that training
case. Wherever problems occurred, it was mostly because of a proper dataset using the same method can incorporate this.
the CR network. Since some characters vary in shape, there Finally, our model is trained with majority of images
was some confusion. Also, the heuristic rules cannot be being head-on. So, another improvement we can aim for is
applied here, as the sequence of characters in foreign plates training for angled plates. This can be helpful in reading a
are different from that in India. Extensive study was not done plate from a traffic light and in other uses of ALPR in India.
on this front, because the focus was on the Indian scenario. Our proposed methodology for ALPR achieves promising
Overall, the pipeline has a better accuracy in India when results for the Indian context. We believe the pipeline can be
compared to any open source software and can be used in incorporated ubiquitously as-is, especially in cities with high
traffic cameras and parking systems throughout India. One degree of stochasticity, to improve road safety and overall
goal of this can be to one day incorporate this is in mobile driving experience.
devices as suggested in [9]. This way, traffic police can use
their mobile devices to scan and the results can be directly ACKNOWLEDGMENTS
sent to a government database. We can go further to add
automatic billing for road rules violations to facilitate the The authors would like to thank TensorFlow for
process and streamline the roads for a safer experience. providing a platform to train our models and Dataturks for
providing tools for labelling with ease.
V. CONCLUSION
REFERENCES
In this paper, we have shown how the use of deep learning
techniques can greatly simplify the process of ALPR in India. [1] Laroca, R., Severo, E., Zanlorensi, L. A., Oliveira, L. S., Gonçalves,
G. R., Schwartz, W. R., & Menotti, D. (2018, July). A robust real-time
Use of deep learning also allows the inclusion of all the automatic license plate recognition based on the YOLO detector. In
aberrant features present in plates in India. In this paper, we 2018 International Joint Conference on Neural Networks (IJCNN) (pp.
have presented a full end-to-end pipeline for ALPR in India, 1-10). IEEE.
which can tackle a lot of intricacies present in plates, [2] Ren, S., He, K., Girshick, R., & Sun, J. (2015). Faster r-cnn: Towards
real-time object detection with region proposal networks. In Advances
including multiple lines, irregular padding, different plate in neural information processing systems (pp. 91-99).
shapes, different fonts and non-uniform font size. For LPD [3] Ghosh, A. K., Sharma, S. K., Islam, M. N., Biswas, S., & Akter, S.
and CS, we have proposed a method that works well for India (2011). Automatic license plate recognition (alpr) for bangladeshi
and also introduced a recursive training technique for the CR vehicles. Global Journal of Computer Science and Technology.
network, which caters to the Indian situation. [4] S. Montazzolli and C. R. Jung, “Real-time brazilian license plate
Our LPD network achieved a 94.98% precision with IoU detection and recognition using deep convolutional neural
networks,”in 2017 30th SIBGRAPI Conference on Graphics, Patterns
threshold as 0.5, only being brought down from 98% due to and Images,Oct 2017, pp. 55–62
widespread prevalence of small plates in autos and buses. [5] Li, H., Wang, P., & Shen, C. Towards end-to-end car license plates
Our CS network had a high mean average precision of detection and recognition with deep neural networks. CoRR
99.55%, with IoU as 0.5, including multiline plates and abs/1709.08828 (2017).
predicted 99.1% of characters with a confidence higher than [6] Masood, S. Z., Shu, G., Dehghan, A., & Ortiz, E. G. (2017). License
90%. Our CR network achieved a 98.6% accuracy using the plate detection and recognition using deeply learned convolutional
neural networks. arXiv preprint arXiv:1703.07330.
recursive technique, with misclassification being present
[7] Hill, M. (2016). Openalpr.
only in cases of characters with similar features. Overall the
[8] He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning
pipeline without inclusion of heuristics had an accuracy of for image recognition. In Proceedings of the IEEE conference on
88.5% full correctness and 10% partial correctness in the computer vision and pattern recognition (pp. 770-778).
remainder of total cases. Inclusion of heuristics led to an [9] Kopparapu, S. K. (2015). Mobile phone based vehicle license plate
increase in overall accuracy to 91%. The overall decrease in recognition for road policing. arXiv preprint arXiv:1504.01476..
accuracy when compared to individual stages can be [10] Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only
look once: Unified, real-time object detection. In Proceedings of the
attributed to the cascading effect as well as the low IoU of IEEE conference on computer vision and pattern recognition (pp. 779-
some segmented-out characters in character segmentation. 788).
For further work we want to use the same data to train the [11] Jiao, L., Zhang, F., Liu, F., Yang, S., Li, L., Feng, Z., & Qu, R. (2019).
YOLO framework [10], another network for object detection A Survey of Deep Learning-based Object Detection. arXiv preprint
which has proved very fast for real time object detection and arXiv:1907.09408.
has also been used in other countries for ALPR pipelines. We [12] Ferreira, L. N. (2018, October). Comparing Brazilian Platforms for
Monitoring Roads, Designed to Support Law Enforcement. In 2018
also want to increase our accuracy for LPD network International Carnahan Conference on Security Technology (ICCST)
especially for autos and buses. To do this, more data is (pp. 1-5). IEEE..
required, which we are currently working on. We also feel, [13] Saghaei, H. (2016). Proposal for Automatic License and Number Plate
another reason for lower accuracy is the irregular size and Recognition System for Vehicle Identification. arXiv preprint
arXiv:1610.03341.
quality of images. Resizing affects the low-quality images,
[14] Li, H., & Shen, C. (2016). Reading car license plates using deep
due to which they do not even clear the CS network properly. convolutional neural networks and lstms. arXiv preprint
The detection of plates in low light images can be improved arXiv:1601.05610.
by more training. The CR network can be improved by [15] Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C.,
adding support for native languages. As mentioned before, ... & Ghemawat, S. (2015). TensorFlow: Large-scale machine learning
this is a limitation to our model and with a proper dataset on heterogeneous systems, 2015. Software available from tensorflow.
org, 1(2).
addition of this feature is possible. We wish to add multiple [16] “Best online platform for your ML data annotation needs.” [Online].
languages such as Hindi, Tamil and Sanskrit in the future. Available: https://dataturks.com/index.php.
Another task that our model cannot do but can be

View publication stats

You might also like