Professional Documents
Culture Documents
net/publication/353630696
CITATION READS
1 262
4 authors, including:
Zhicheng Cao
Xidian University
44 PUBLICATIONS 188 CITATIONS
SEE PROFILE
All content following this page was uploaded by Zhicheng Cao on 01 September 2022.
ABSTRACT
Face detection is one of the most important research topics in the field of computer vision, and it is also the premise
and an essential part of face recognition. With the advent of deep learning-based techniques, the performance of
face detection has been largely improved and more and more daily applications have been witnessed. However,
face detection is greatly affected by environmental illumination. Most of existing face detection algorithms neglect
harsh illumination conditions such as nighttime condition where lighting is insufficient or it is totally dark. These
conditions are often encountered in real-world scenarios, e.g., nighttime surveillance in law enforcement or civil
settings. How to overcome the problem of face detection in the darkness becomes a critical and urgent demand.
We thus in this paper study face detection in the darkness using infrared (IR) imaging. We build an IR face
detection dataset and utilize a deep learning-based model to study the face detection performance. Specifically,
the deep learning model is a Single Stage Detector which has the advantage of fast speed and lower computation
cost compared with other face detectors that consists of multiple stages. In the experiment, we also compare
the performance of the deep learning model with that of a well-known traditional face detection algorithm, Harr
(or Adaboost). In terms of precision, our model significantly outperforms Harr by more than 30% — a dramatic
boost from 68.75% to 98.01%, which suggests our deep learning-based method with IR imaging can indeed meet
the requirement of real-world nighttime face detection applications.
Keywords: Face Detection, low illumination, nighttime, infrared, deep learning.
1. INTRODUCTION
Face recognition has become a relatively mature and widely-used biometrics technology in our daily life after
rapid development in recent years. As a prerequisite for face recognition, face detection has a great influence
on the effect and accuracy of subsequent face recognition. Therefore, the topic of face detection has drawn a
great amount of attention. A lot research works have been conducted,1 among which some algorithms have been
proved to be especially successful and efficiently, such as the AdaBoost algorithm by Viola and Jones.2 As the
era of deep learning comes, many researchers have designed sophisticated convolutional networks to address the
problem of face detection and great achievements have been witnessed, such as Faster RCNN,3 MTCNN,4 and
Cascade CNN.5
However, at present, popular practice of face detection is mostly performed under the traditional choice of
the light spectrum — the visible light band. Although the detection technology has been very mature, but
it is limited by the influence of environmental lighting and visibility factors. In harsh environments, such as
uneven lighting and low visibility, the detection effect is usually not satisfactory since the detection accuracy will
be greatly effected and reduced by those lighting factors. Moreover, the face images need to be pre-processed
prior to the actual detection process to eliminate the influence of light, which leads to the loss of some useful
information.
A more serious case emerges when the face detection task takes place in a totally dark environment where
visible light illuminance is infeasible or should be avoided, such as nighttime surveillance. Many real-world
Further author information: (Send correspondence to Liaojun Pang)
* Liaojun Pang: E-mail: ljpang@mail.xidian.edu.cn, Telephone: +86 2981891070;
Zhicheng Cao: E-mail: zccao@xidian.edu.cn.
surveillance tasks occur during the night. These scenarios are especially common in civil, law enforcement
and military applications. All these scenarios require a quite different light band — the infrared (IR). Latest
advancements have been witnessed in the manufacturing of small and cheap imaging devices sensitive in the
infrared range. Cameras equipped with these sensors have the ability to see at night and through fog and rain
even at long ranges, which makes surveillance in harsh environments more practical and reliable but at the same
time poses new research problems.6–9 Compared with the visible light, IR bands have very different imaging
nature. The reflectivity properties of the facial tissues under IR imaging and visible light imaging are quite
different from each other. Also, the electromagnetic waves of active IR than that of visible light are less affected
by scattering and absorption of smoke or dust, due to different propagation properties.10 Moreover, unlike
imaging in the visible light, IR imaging can be used to extract not only exterior but also useful subcutaneous
anatomical information.11
Therefore, we in this paper study the feasibility of using the infrared as the light band to address the problem
of face detection in the darkness. So far, there have been a few works on face detection that used IR as the
working light band. For example, the work of Yan and Wang utilized the famous Adaboost algorithm to study the
IR face detection problem.12 Zheng et al proposed a projection profile analysis algorithm for face detection and
eyeglasses detection using the thermal IR.13 Shen et al utilized the Active Shape Model (ASM) method to detect
the face features and the low-dimensional Gabor features were extracted for recognition by combining Principal
Component Analysis (PCA) and Linear Discriminant Analysis (LDA).14 The work of Zhou et al designed and
prototyped a novel type of hybrid sensor by combining a pair of near-infrared cameras and a thermal camera.15
To summarize, all these works have adopted traditional non-deep learning methods. Therefore, IR face detection
and especially deep learning-based is of great significance and it remains a challenge.
3.1 Datasets
In our experiments we utilize the dataset of Q-FIRE collected by Schuckers et al. at Clarkson University.19 The
Q-FIRE dataset consists of face images collected under visible light, and IR. The subset of thermal infrared (or
long wave infrared) of Q-FIRE is considered where a total of 84 individuals are involved. The total number
of thermal IR images is 1340, among which 1101 images are divided as the training set, 85 are divided as the
validation set and the remaining 154 are treated as the test set, as listed in Table 1. A sample face image from
the Q-FIRE dataset is shown in Figure 2.
Table 1: Datasets
Dataset Training Validation Test
Q-FIRE 1101 85 154
4. CONCLUSIONS
In this paper, we study the problem of face detection at nighttime using the IR light band, since the IR band of
the light is needed for some special real world face detection applications. We take a deep learning-based approach
to study the feasibility of IR face detection. A DNN model is utilized for the study due to its advantages of
accuracy and speed. We carry out experiments on a thermal IR dataset to study the performance of this DNN
model. We also compare its performance against that of two other traditional methods that are representative.
We find out that thermal IR is a promising subband of IR for reliable face detection with deep learning as the
detection tool. We also observe that deep learning-based methods perform better than traditional hand-crafted
operators.
ACKNOWLEDGMENTS
This research is funded by the National Natural Science Foundation of China under Grant 61906149, the Fun-
damental Research Funds for the Central Universities under Grants JB181206 and XJS201201, and the Natural
Science Basic Research Plan in Shaanxi Province of China under Grants 2021JM-136.
REFERENCES
[1] Hjelmas, E. and Low, B. K., “Face detection: A survey,” Computer Vision and Image Understanding 83(3),
236–274 (2001).
[2] Viola, P. and Jones, M. J., “Robust real-time face detection,” International Journal of Computer Vi-
sion 57(2), 137–154 (2004).
[3] Ren, S., He, K., Girshick, R., and Sun, J., “Faster r-cnn: Towards real-time object detection with region
proposal networks,” IEEE Transactions on Pattern Analysis and Machine Intelligence 39(6), 1137–1149
(2017).
[4] Zhang, K., Zhang, Z., Li, Z., and Qiao, Y., “Joint face detection and alignment using multitask cascaded
convolutional networks,” IEEE Signal Processing Letters 23(10), 1499–1503 (2016).
[5] Deng, J. and Xie, X., “Nested shallow cnn-cascade for face detection in the wild,” in [IEEE International
Conference on Automatic Face and Gesture Recognition ], (2017).
[6] Maeng, H., Liao, S., Kang, D., Lee, S. W., and Jain, A. K., “Nighttime face recognition at long distance:
cross-distance and cross-spectral matching,” in [Asian Conference on Computer Vision], 708–721 (2012).
[7] Nicolo, F. and Schmid, N. A., “Long range cross-spectral face recognition: Matching SWIR against visible
light images,” IEEE Trans. on Inf. Forensics and Security 7(6), 1717–1726 (2012).
[8] Cao, Z., Schmid, N. A., and Li, X., “Image disparity in cross-spectral face recognition: mitigating camera
and atmospheric effects,” Proc. SPIE 9844, 98440Z–98440Z–10 (2016).
[9] Cao, Z. and Schmid, N. A., “Heterogeneous sharpness for cross-spectral face recognition,” Proc. SPIE 10202,
10202 – 10202 – 11 (2017).
[10] Bohren, C. F. and Huffman, D. R., [Absorption and scattering of light by small particles ], John Wiley &
Sons (2008).
[11] Ghiass, R. S., Arandjelovic, O., Bendada, H., and Maldague, X., “Infrared face recognition: a literature
review,” in [Neural Networks (IJCNN), The 2013 International Joint Conference on], 1–10, IEEE (2013).
[12] Yan, C. and Wang, Y., “A novel multi-user face detection under infrared illumination by real adaboost,” in
[International Conference on Computational Intelligence and Software ], 1–6 (2009).
[13] Y, Z., R, B. P., and Y, L. E., “Face detection and eyeglasses detection for thermal face recognition,” in
[Proceedings of SPIE - The International Society for Optical Engineering], 8300:9 (2012).
[14] Shen, P., Liang, Z., Song, J., Hu, X., and Zeng, M., “A near-infrared face detection and recognition system
using ASM and PCA+LDA,” Journal of Networks 9(10) (2014).
[15] Zhou, M., Lin, H., Susan, Y. S., and Yu, J., “Hybrid sensing face detection and registration for low-light
and unconstrained conditions,” Applied Optics 57(1), 69 (2018).
[16] Lecun, Y. and Bengio, Y., [Convolutional networks for images, speech, and time series], MIT Press (1998).
[17] Krizhevsky, A., Sutskever, I., and Hinton, G. E., “Imagenet classification with deep convolutional neural
networks,” in [International Conference on Neural Information Processing Systems], 1097–1105 (2012).
[18] Zhang, S., Zhu, X., Lei, Z., Shi, H., Wang, X., and Li, S. Z., “S3FD: Single shot scale-invariant face
detector,” in [IEEE International Conference on Computer Vision], 192–201 (2017).
[19] Johnson, P. A., Lopez-Meyer, P., Sazonova, N., Hua, F., and Schuckers, S., “Quality in face and iris research
ensemble (Q-FIRE),” in [2010 Fourth IEEE International Conference on Biometrics: Theory, Applications
and Systems (BTAS)], 1–6 (2010).