Visual Analysis of Human Pose Estimation under Frame Degradation Using MediaPipe and ViTPose

Nada E. Elshami; Ahmad Salah; Amr Abdellatif; Heba Mohsen

Authors

Nada E. Elshami Department of Computer Science, Faculty of Computers and Information Technology, Future University in Egypt, New Cairo, Egypt https://orcid.org/0000-0002-2530-5041
Ahmad Salah College of Computing and Information Sciences, University of Technology and Applied Sciences, Ibri, Sultanate of Oman https://orcid.org/0000-0003-3433-7640
Amr Abdellatif Faculty of Computers and Informatics, Zagazig University, Zagazig, 44519, Egypt https://orcid.org/0000-0003-1153-8538
Heba Mohsen Department of Computer Science, Faculty of Computers and Information Technology, Future University in Egypt, New Cairo, Egypt https://orcid.org/0000-0003-2003-0357

Keywords:

Human Pose Estimation, MediaPipe, ViTPose, Rotation, Low Resolution, Model Performance

Abstract

Human Pose Estimation (HPE) is a significant task in most computer vision applications. However, in the presence of visually degraded inputs, such as low-resolution or rotated video frames, its accuracy tends to reduce. This paper compared two frequently applied pose estimation models including MediaPipe (MP) and ViTPose in terms of their performance on carefully chosen frames extracted from three of our daily videos. In order to emulate non-optimal conditions, we used three kinds of visual filters on the videos, that is, loosy video compression (approximately 70% of the original size), clockwise 90-degree rotation, and 180-degree rotation. Then we used the original frames and compared them with their filtered counterparts using visual overlays of the predicted landmarks. Our results assist in shedding some light on the model reaction to such changes, as they provide a visual representation that could be used to explain anomalies in performance regarding different circumstances. These observations have been pivotal in determining the weakness of HPE systems in unpredictable environments and future opportunities to enhance pose estimation models with a view of their wider and real-life applications.

Downloads

Download data is not yet available.

Visual Analysis of Human Pose Estimation under Frame Degradation Using MediaPipe and ViTPose

Authors

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Make a Submission

Journal Information

Information

Volumes

Info. for Authors and Subscribers

Visitors