Professional Documents
Culture Documents
AND
MACHINE LEARNING
Project Synopsis
Sl. US Na
No N me
Motivation:
Facial recognition technology needs clear images for correct operation. Even minor blur or
distortion hampers recognition accuracy. Current systems struggle when dealing with suboptimal
CCTV footage, such as blurred, obscured, or poorly angled faces. This undermines their utility for
law enforcement agencies and security personnel relying on facial recognition for surveillance
purposes.
Situations like the Bengaluru coffee shop blast highlight the downsides of current facial
recognition technology. Investigators used AI systems to analyse CCTV footage of masked
suspects. But standard face recognition struggles with blurry, blocked, or angled faces on low-
quality CCTV cameras. Our project develops a deep learning system to reconstruct faces from the-
se challenging images. It aims to boost accuracy - providing clearer facial images, even if parts are
obscured or captured at poor angles. The goal is to provide improved security and recognition
capabilities through enhanced clarity. The purpose of this system is to improve facial recognition
accuracy and provide clearer vision, with the ultimate goal being to improve security and
recognition capabilities.
Proposed Methodology:
Data Preparation:
● We gather a diverse dataset of high-resolution facial images and apply variations (cropping,
colour adjustments) to improve model generalizability.
● Techniques like blurring and masking are used to simulate occlusions commonly found in
CCTV footage on the training images.
● A cascaded diffusion model effectively removes noise often present in low-quality CCTV
videos.
● This stage significantly improves the overall image quality, revealing crucial facial details
obscured by noise.
● Pre-trained facial detection models pinpoint the face location within the denoised image.
● Subsequently, techniques like those used in Dlib extract key facial features (eyes, nose,
mouth) for further processing.
● A conditional GAN, specifically an AttnGAN, focuses on the facial region based on the
pre-processed image and extracted landmarks.
● This stage effectively reconstructs a realistic and detailed face that aligns with the
underlying details in the denoised image.
Improved facial reconstructions that make deeper probing possible. Potentially able to view from
multiple angles. In the end, objectives include enhancing identification and security procedures for
various applications. Enhancements in crime prevention, criminal apprehending, and suspicious
activity identification can result from more precise facial recognition. These anticipated results
show how the project has the potential to advance facial recognition technology and its
applications in the surveillance and security fields.
References
[1] Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... &
Bengio, Y. (2014). Generative adversarial nets. In Advances in neural information processing
systems (pp. 2672-2680).
[2] Kingma, D. P., & Welling, M. (2013). Auto-encoding variational bayes. arXiv preprint
arXiv:1312.6114.
[3] Radford, A., Metz, L., & Chintala, S. (2015). Unsupervised representation learning with deep
convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434.
[4] Karras, T., Laine, S., & Aila, T. (2019). A style-based generator architecture for generative
adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and
Pattern Recognition (pp. 4401-4410).
[5] Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in
Neural Information Processing Systems, 33, 6840-6851.
[6] Dollár, P., Welinder, P., & Perona, P. (2010). Cascaded pose regression. In Proceedings of the
IEEE Conference on Computer Vision and Pattern Recognition (pp. 1078-1085).
[8 ]AbdAlmageed, W., Wu, Y., Rawls, S., Harel, S., Hassner, T., Masi, I., ... & Medioni, G.
(2016). Face recognition using deep multi-pose representations. In 2016 IEEE Winter Conference
on Applications of Computer Vision (WACV) (pp. 1-9). IEEE.
[9] Taigman, Y., Yang, M., Ranzato, M., & Wolf, L. (2014). DeepFace: Closing the gap to human-
level performance in face verification. In Proceedings of the IEEE Conference on Computer Vision
and Pattern Recognition (pp. 1701-1708).
[10] Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint face detection and alignment using
multitask cascaded convolutional networks. IEEE Signal Processing Letters, 23(10), 1499-1503.
Plagiarism Report:
Guide’s Comments: