Professional Documents
Culture Documents
Yoga Pose Classification ICIP Copy
Yoga Pose Classification ICIP Copy
2.1. Preprocessing Local Classifier Per Node. Each node has a binary classifier
to predict whether the label for that node itself. In total, 108
Each image is passed through the MediaPipe framework
classifiers (6+20+82) are used.
which yields a set of 33 keypoints corresponding to human
Local Classifier Per Level. Each level has a multiclass clas-
body parts as shown in figure-2. Each keypoint further has
sifier associated to it. All the levels are mutually independent.
four features - x, y, z coordinates in 3D space and visibility
In total, 3 classifiers are used.
which is a value between 0 and 1 indicating how visible the
body part is. Therefore, our final dataset consists of 33*4 =
132 features representative of the images and the output label 2.3. Notations
indicating the pose name. A 1:3 stratified split is performed
to generate the test and train sets. We define meaningful notations for each keypoint and cus-
tom feature created. Table-2 represents notations for the
keypoints such that each symbol represents the (x, y, z) tuple
2.2. Hierarchical Classification
for the corresponding body part, like Psl for the left shoulder.
We perform hierarchical classification using three algorithms We also present some basic formulas to make subsequent
described below. expressions compact. Let ̸ (.) be a function that computes
−→ − → −→
Local Classifier Per Parent Node. Each parent node has a the three-dimensional angle between three vectors Pa , Pb , Pc
classifier which performs multiclass classification to classify where each of these is a three-dimensional vector where the
children. In total, 27 classifiers (1+6+20) are used. dimensions represent the coordinates of a keypoint in 3D
Hmin = min(̸ Psl Pel Pwl , ̸ Psr Per Pwr )) (3)
Features related to legs are used because a number of yoga
poses such as warrior pose and a sub-category of yoga called
Power Yoga are focused around thighs, knees and legs. This
category enables us to quantify the differences in legs align-
ment. To handle asymmetry, maximum and minimum angles
are taken.
4. RESULTS & CONCLUSION Fig. 5. MediaPipe keypoints obtained for a human pose
Through all the experiments, it is observed that MediaPipe All the numeric results are illustrated in table-3-5. Along with
framework yields the best results overall. Across all three lev- this, graphical plots for the best performing framework (Me-
els, SVM model gave the best results and the results for cus- diaPipe) are also presented in figures-3-5. Our study high-
tom features - Angles + Spatial coordinates are better than raw lights the potential of machine learning combined with early
keypoints data, and only angle and only spatial coordinates. fusion of deep learning frameworks in order to achieve state
of the art results in Yoga Pose classification. For future work,
the current framework can be extended to real-time pose clas-
sification and deployed in a practical setting, such as an online
yoga class.
Raw Keypoints Data (mediapipe) 0.7979 0.6991 0.3585 0.7380 0.8644 0.9111
Raw Keypoints Data (openpose) 0.8727 0.6110 0.5450 0.9207 0.7238 0.8895
Raw Keypoints Data (posenet) 0.2931 0.1762 0.1933 0.4786 0.3947 0.4249
Level-2
Only Angles Data 0.4744 0.4403 0.3846 0.6979 0.7688 0.7596
Only Spatial Coordinates Data 0.5386 0.4430 0.4677 0.6776 0.8147 0.8738
Angles + Spatial Coordinates 0.7347 0.6635 0.5755 0.7857 0.8849 0.9184
Raw Keypoints Data (mediapipe) 0.7979 0.6991 0.3585 0.7380 0.8644 0.9111
Raw Keypoints Data (openpose) 0.6842 0.6014 0.6338 0.9243 0.7551 0.7959
Raw Keypoints Data (posenet) 0.4569 0.4290 0.3085 0.5848 0.5413 0.5517
Level-1
Only Angles Data 0.5392 0.4851 0.4885 0.7403 0.7937 0.7898
Only Spatial Coordinates Data 0.6560 0.5874 0.5690 0.7552 0.8602 0.9101
Angles + Spatial Coordinates 0.7685 0.7314 0.6450 0.8287 0.9117 0.9383
5. REFERENCES [10] Abhishek Sharma, Yash Shah, Yash Agrawal, and Pra-
teek Jain, “Real-time recognition of yoga poses using
computer vision for smart health care,” 2022.
[1] Anil Patange and Punam Sawarkar, “Role of yoga for
the prevention & management of COVID-19-a re- [11] Yash Agrawal, Yash Shah, and Abhishek Sharma, “Im-
view,” International Journal of Research in Pharmaceu- plementation of machine learning technique for iden-
tical Sciences, vol. 11, no. SPL1, pp. 1720–1724, dec tification of yoga poses,” in 2020 IEEE 9th Interna-
2020. tional Conference on Communication Systems and Net-
work Technologies (CSNT), 2020, pp. 40–43.
[2] Chiranjivi Adhikari, Komal Shah, Somen Saha, and [12] Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xin-
Deepak Saxena, “Yoga, immunity and COVID-19: A long Wang, and Zhibin Wang, “Tfpose: Direct human
scoping review,” Journal of Family Medicine and Pri- pose estimation with transformers,” 2021.
mary Care, vol. 11, no. 5, pp. 1683, 2022.
[13] Manisha Verma, Sudhakar Kumawat, Yuta Nakashima,
and Shanmuganathan Raman, “Yoga-82: A new dataset
[3] HR Nagendra, “Yoga for COVID-19,” International for fine-grained classification of human poses,” Apr
Journal of Yoga, vol. 13, no. 2, pp. 87, 2020. 2020.
[14] Shikha Gupta Utkarsh Bahukhandi, “Yoga pose de-
[4] Kshama Gupta Kapil Gupta Abhay Gupta, tection and classification using machine learning tech-
Kuldeep Gupta, “Human activity recognition us- niques,” Dec 2021.
ing pose estimation and machine learning,” Feb [15] Daksh Goyal, Koteswar Rao Jerripothula, and Ankush
2021. Mittal, “Detection of gait abnormalities caused by neu-
rological disorders,” 2020 IEEE 22nd International
Workshop on Multimedia Signal Processing (MMSP),
[5] Ajay Chaudhari, Omkar Dalvi, Onkar Ramade, and 2020.
Dayanand Ambawade, “Yog-guru: Real-time yoga pose
correction system using deep learning methods,” in [16] Camillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris
2021 International Conference on Communication in- McClanahan, Esha Uboweja, Michael Hays, Fan Zhang,
formation and Computing Technology (ICCICT). June Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, and
2021, IEEE. et al., “Mediapipe: A framework for building perception
pipelines,” Jun 2019.
[6] Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, [17] “Yoga with adriene youtube channel,” .
and Yaser Sheikh, “Openpose: Realtime multi-person
[18] “Yoga journal — online yoga archives,” Mar 2022.
2d pose estimation using part affinity fields,” 2018.
[19] Fábio M. Miranda, Niklas Köhnecke, and Bernhard Y.
Renard, “Hiclass: a python library for local hierarchical
[7] Debabrata Swain, Santosh Satapathy, Biswaranjan classification compatible with scikit-learn,” 2021.
Acharya, Madhu Shukla, Vassilis C. Gerogiannis, An-
dreas Kanavos, and Dimitris Giakovis, “Deep learning
models for yoga pose monitoring,” Algorithms, vol. 15,
no. 11, 2022.