Professional Documents
Culture Documents
Low Resolution Face Recognition System Based On ESRGAN Compressed
Low Resolution Face Recognition System Based On ESRGAN Compressed
Zhuj iang He
Chengkun Song Guangdong Ocea n University(s), Zhanjiang, Guangdong,
Guangdong Ocea n University(s), Zhanjiang, Guangdong, 524088, China
524088, China
Zhenni Zhang
2021 3rd International Conference on Applied Machine Learning (ICAML) | 978-1-6654-2125-6/21/$31.00 ©2021 IEEE | DOI: 10.1109/ICAML54311.2021.00024
Abstract- Low- re solution face re cognition is one of the propose a more suitable LRFR algorithm for teaching
re search hotspots of face recognition tuday. It can be widel)' scenanos.
used in face recognition in va r ious scenarios, such as identity With the development of relevant aspects, face
verification at sta tions and clas sroom check-in . The prior art recognition technology has developed rapidly. However,
h as ac hieved better performance in ideal scen a r ios, but in the case of LRFR, the recognition rate will decline,
detecting low -resolution images will make it difficult to making the existing model difficult to be applied in the
recognize low-resolution human face s, an d th e accuraC)' will
actual situation[3]. Therefore, in this paper, we not only
eventually decrease. This is why improving the accurac)' of
improve the existing face target detection algorithm model,
low-resolution face recognition (L RFR) is still challenging.
but also build a LRFR model through Facenet[4]
\ Ve haw fini shed the reasearch aim to solve the problem
about L RFR. The super-resolution GAN (S RGAl'l') and
combined with ESRGAN.
enhanced su per- r esolution GA N (ESRGAN) used in this
II . O VERALL S YSTEM D ESIGN
search. C om pa r ing these methods, we fin all,· obtain a model
that can solve the low-precis ion problem of LRFR. Our In this paper, A training module and a recognition
sys tem use s su per -resolut ion reconstruction as the module with reconfiguration function are the key parts of
preprocessing ste p of the LRFR problem, a nd then uses our system in Fig. 1.
F ace net to recognize the image. T hese data sets are \Vild
Face T ag (LFW), YouTube Face Databa se (YTF) , and
Wider F ace Dataset. The experimental re sults show that the
accuracy of the ESRG AN based on Facenet of the proposed
syst em in the unconstrained natural environment is as high
as 98.78 % • At the same time, incr-ease the number and speed
of face detection, effectively realize the fun ction of multiple
face re cognition, has practical application value and system
robustness.
1. INTRODUCTION
Through our observation of social phenomena, It IS
found that there are frequent phenomena such as lax Figure I. System designed by ourselves
management of middle school style construction and
students' absence from classes on campus, which is mainly LPFR consists of three parts: multi-target recognition,
due to the time-eonsuming and laborious classroom super-resolution face reconstruction, and face feature
attendance of college staff. However, due to the extremely extraction. Multi-target recognition based on technologies
low efficiency of recognition hardware support in most have used in the system such as multi-threading, high
colleges and universities, and expensive attendance concurrency, and high availability for simultaneous
machines are equipped with cameras and other hardware recognition of multiple single targets. Perform super-
for face recognition algorithm[I][2]. Therefore, we resolution face reconstruction with ESRGAN as the
algorithm support on a single low-resolution face image,
Figure 2. The structure ofYOL0v4 Figure 3. Result of detecting faces4 Face Recognition.
77
Authorized licensed use limited to: VIT University. Downloaded on January 20,2024 at 07:08:54 UTC from IEEE Xplore. Restrictions apply.
two distributions, it is difficult to know the specific
distribution expressions, so it is difficult to find a suitable L~a =- lEx, [log(l- DRa(XpXt))]-
measurement method. The idea of GAN is to give this IExr[log (DRa(xt , xr))] (3)
measurement task to a neural network, which is called a
discriminator. Xt is the image that it passes through the generator
ESRGAN15further improves the restored Image after the original low-resolution image. Since the loss of
quality of SRGAN[14]. ESRGAN[15] removes all BN the confrontation includes x, and xf' the generator
layers in the structure of the generator and replaces the benefits from the generated data and actual data in the
original basic block with the Residual-in-Residual Dense confrontation training. The gradient of the data , this
Block (RRDB). It combines combines multi- level residual adjustment will make the network learn clear edges and
network and dense connections as shown in Fig. 4. textures.
Using the perceived loss of need features before and
activation of super-resolution restoration, overcame the
two shortcomings. The features of activation are very few
and scattered, especially in the deep-network. The few and
scattered activation provide oversight of the effect is very
weak, can lead to poor performance. Use activation
characteristics will lead to the brightness of the
reconstructed image with GT , and add the content is lost,
lost the final function consists of three parts ,
(4)
where
78
Authorized licensed use limited to: VIT University. Downloaded on January 20,2024 at 07:08:54 UTC from IEEE Xplore. Restrictions apply.
For further research, we hope can develop the system to [16] L. Wolf, "Face Recognition in Unconstrained Videos with
achieve real-time. Based on the YOLOv4-tiny network, Matched Background Similarity."
the face detection technology has been studied in detail,
and combined with the current technology, the detection
accuracy has been improved under the condition that the
real-time detection of the YOLO network is guaranteed to
a certain extent, but there is still a big improvement.
ACKNOWLEDGEMENT
In our experiment, the National Undergraduate
Irmovation and Entrepreneurship Program (20201 0566022)
and the Undergraduate Innovation Team Program of
Guangdong Ocean University (CXTD2019004) provide us
with a lot of practical help, cameras, venues and technical
guidance.
REFERENCES
[1] Lin Shouguang. Research on fast face recognition algorithm based
on laboratory attendance system [J]. Information Technology,
2019,43 (04):16-18+22.
[2] Chen Qi. Design and implementation of student attendance system
based on face recognition [D]. University of Electronic Science
and Technology of China, 2019.
[3] Kaibing Zhang, Dongdong Zheng, Junfeng Jing. Review of low
resolution face recognition [J]. Computer Engineering and
Applications,2019,55(22):14-24.
[4] Schroff F, Kalenichenko D, Philbin J. Facenet: A unified
embedding for face recognition and clustering[ C]/ / Proceedings
of the IEEE conference on computer vision and pattern recognition.
2015: 815-823.
[5] Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: optimal speed
and accuracy of object detection[J]. In IEEE Conference on
Computer Vision and Pattern Recognition (CVPR),2020.
[6] Redmon J, Farhadi A. Yolov3: an incremental improvement[J]. In
IEEE Conference on Computer Vision and Pattern Recognition
(CVPR).2018.
[7] Joseph Redmon, Ali Farhadi. Yol09000: better, faster, stronger[J].
In IEEE Conference on Computer Vision and Pattern Recognition
(CVPR).2016.
[8] Redmon J, Divvala S, Girshick R. You only look once: unified,
real-time object detection[J]. In IEEE Conference on Computer
Vision and Pattern Recognition (CVPR). 2016.
[9] JIANG N,YU W,TANG S,et al.A cascade detector for rapid face
detection[C]!lIEEE.IEEE 7th International Colloquium on Signal
Processing and Its Applications.Penang:Ignatius,2011:155-158.
[10] Zhang K, Zhang Z, Li Z, et al. Joint face detection and alignment
using multitask cascaded convolutional networksl.I] . IEEE Signal
Processing Letters, 2016, 23( 10) : 1499-1503.
[11] G. B. Huang, M. Mattar, T. Berg, and E. Learned-miller, "Labeled
Faces in the Wild: A Database for Studying Face Recognition in
Unconstrained Environments," Tech. rep., 2008.
[12] S. Yang, et al. : WIDER FACE: A Face Detection Benchmark,
2015.
[13] I. J. Goodfellow et aI., "Generative Adversarial Networks," pp. 1-
9,2014, [Online]. Available: http://arxiv.orglabs/1406.2661.
[14] Ledig, C., Theis, L., Husz'ar, F., Caballero, J., Cunningham, A.,
Acosta, A., Aitken, A., Tejani, A., Totz, 1., Wang, Z., et al.Photo-
realistic single image super-resolution using a generative
adversarial network],l]. In CVPR, 2017.
[15] X. Wang et al., "ESRGAN: Enhanced super-resolution generative
adversarial networks," 2019,
79
Authorized licensed use limited to: VIT University. Downloaded on January 20,2024 at 07:08:54 UTC from IEEE Xplore. Restrictions apply.