Human Pose Estimation Algorithm Using Optimized Symmetric Spatial Transformation Network

Shengqing Lin; Nor Azizah Ali; Azlan bin Mohd Zain; Muhalim Mohamed Amin Amin

doi:10.21123/bsj.2024.9775

PDF (الإنجليزية)

منشور: Feb 25, 2024

DOI: https://doi.org/10.21123/bsj.2024.9775

الكلمات المفتاحية:

رؤية الكمبيوتر؛ تعلم عميق؛ تقدير ما بعد الإنسان؛ التعرف على النقاط الرئيسية؛ التحول المكاني المتماثل

Shengqing Lin

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.&كلية علوم الحاسب والمعلومات، معهد فوتشو للتكنولوجيا، فوتشو، فوجيان، الصين.

https://orcid.org/0009-0002-9088-1384

Nor Azizah Ali

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.

https://orcid.org/0000-0003-2565-3836

Azlan bin Mohd Zain

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.

https://orcid.org/0000-0003-2004-3289

Muhalim Mohamed Amin Amin

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.

الملخص

يعد تقدير وضعية الإنسان موضوعًا بالغ الأهمية في مجال رؤية الكمبيوتر، وقد أصبح نقطة ساخنة للبحث في العديد من الأعمال المتعلقة بالسلوكيات البشرية. يمكن فهم تقدير وضع الإنسان على أنه مشكلة التعرف على النقاط الرئيسية للإنسان والاتصال بها. تقدم هذه الورقة شبكة تحويل مكاني متماثلة محسنة مصممة للتواصل مع شبكة تقدير وضعية الشخص الواحد لاقتراح إطارات مستهدفة بشرية عالية الجودة من الصناديق المحيطة البشرية غير الدقيقة، وتقدم قمعًا بارامتريًا غير أقصى للقضاء على تقدير الوضعية الزائدة عن الحاجة، وتطبق قاعدة الإزالة لإزالة الوضع المماثل للحصول على نتائج فريدة لتقدير الوضع البشري. توضح النتائج الاستكشافية كيف يمكن للتقنية المقترحة أن تتعرف بدقة على القضايا الإنسانية المركزية، وتعمل حقًا على دقة تقييم وضعية الإنسان، ويمكنها التكيف مع المشاهد المعقدة مع الأفراد السميكين والعوائق. وأخيرا، يتم وصف الصعوبات والاتجاهات المستقبلية المحتملة، ويتم عرض تطور المجال.

Received 30/09/2023

Revised 10/02/2024

Accepted 12/02/2024

Published 25/02/2024

كيفية الاقتباس

خوارزمية تقدير وضعية الإنسان باستخدام التناظر الأمثل شبكة التحول المكاني. Baghdad Sci.J [انترنت]. 25 فبراير، 2024 [وثق 19 ديسمبر، 2024];21(2(SI):0755. موجود في: https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/9775

إصدار

مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023

القسم

article

هذا العمل مرخص بموجب Creative Commons Attribution 4.0 International License.

كيفية الاقتباس

تنزيل الاقتباسات

المراجع

Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, et al. A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing. 2020; 408(2): 189-215. https://doi.org/10.1016/j.neucom.2019.10.118.

Alzubaidi L, Zhang J, Humaidi A J, et al. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data. 2021; 8(1): 1-74. https://doi.org/10.1186/ s40537-021-00444-8.

Pareek P, Thakkar A. A survey on video-based human action recognition: recent updates, datasets, challenges, and applications. Artif. Intell. Rev. .2021;54(3): 2259-2322. https://doi.org 10.1007/s10462-020-09904-8.

Shi Y, Zhang Z, Huang K, et al. Human-computer interaction based on face feature localization. J. Vis. Commun. .2020; 70(1): 102740. https://doi.org/10.1016/j.jvcir.2019. 102740.

Zheng C, Wu W, Yang T, et al. Deep learning-based human pose estimation: A survey. arXiv. arXiv:2012;13392.https://doi.org/10.48550/arXiv.2012.13392.

Chen J, Li S, Liu D, et al. Indoor camera pose estimation via style‐transfer 3D models. COMPUT-AIDED CIV INF . 2022;37(3): 335-353. https://doi.org/10.1111/mice.12714.

Li M, Gao Y, Sang N. Exploiting learnable joint groups for hand pose estimation Proceedings of the AAAI Conference on Artificial Intelligence. 2021; 35(3): 1921-1929 https://doi.org/10.1609/aaai.v35i3.16287.

Tang H, Wang Q, Chen H. Research on 3D human pose estimation using RGBD camera 2019 IEEE 9th International Conference on Electronics Information and Emergency Communication (ICEIEC). IEEE, 2019: 538-541. https://doi.org/ 10.1109/iceiec.2019.8784591

9. Bowen Cheng, Bin Xiao, Jingdong Wang, Honghui Shi, Thomas S. Huang, and Lei Zhang. Higherhrnet: Scale aware representation learning for bottom-up human pose estimation. arXiv .2020. https://doi.org/10.48550/arXiv.1908.10357.

Jin S, Liu W, Xie E, et al. Differentiable hierarchical graph grouping for multi-person pose estimation. European Conference on Computer Vision. arXiv. 2020; 718-734. https://doi.org/10.48550/arXiv.2007.11864.

Bao Q, Liu W, Cheng Y, et al. Pose-guided tracking-by-detection: Robust multi-person pose tracking[J]. IEEE Transactions on Multimedia. 2020; 23(10): 161-175. https://doi.org/10.1109/TMM.2020. 2980194.

Dang Q, Yin J, Wang B, et al. Deep learning based 2d human pose estimation: A survey. Tsinghua Sci Technol. 2019; 24(6): 663-676. https://doi.org/ 10.26599/TST.2018.9010100.

Luvizon D C, Picard D, Tabia H. 2d/3d pose estimation and action recognition using multitask deep learning .Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 5137-5146. https://doi.org/ 10.1109/CVPR.2018.00539.

Chen Y, Tian Y, He M. Monocular human pose estimation: A survey of deep learning-based methods[J]. Comput Vis Image Underst. 2020; 192(5): 102897. https://doi.org/ 10.1016/j.cviu.2019.102897

Yang G, Sun D, Jampani V, et al. ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction[J]. Adv. neural inf. process. Syst. 2021; 34..

Qiu S, Zhao H, Jiang N, et al. Multi-sensor information fusion based on machine learning for real applications in human activity recognition: State-of-the-art and research challenges[J]. Information Fusion. 2022; 80(6): 241-26. https://doi.org/ 10.1016/j.inffus.2021.11.006.

Toshev A, Szegedy C. DeepPose: human pose estimation via deep neural networks[C]. 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEEPress. 2014. 1653-1660. https://doi.org/10.1109/ CVPR.2014.214.

Li S, Zhang L, Diao X. Deep-learning-based human intention prediction using RGB images and optical flow[J]. J Intell Robot Syst. 2020; 97(1): 95-107. https://doi.org/ 10.1007/s10846-019-01049-3.

Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, and Jian Sun. Cascaded pyramid network for multi-person pose estimation.2018 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR). 2018; 7103– 7112. https://doi.org/ 10.1109/CVPR. 2018.00742.

Wei SE, Ramakrishna V, Kanade T, Sheikh Y. Convolutional pose machines. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2433-2454..

Sun K, Xiao B, Liu D, et al. Deep high-resolution representation learning for human pose estimation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019 ; 5693-5703. https://doi.org/10.1109/CVPR.2019.00584.

Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, et al. Learning delicate Local Representations for Multi-person Pose Estimation. In European Conference on Computer Vision(ECCV). 2020; 457-472. https://doi.org/10.1109/CVPR.2019.00584.

M. Rajchl et al. DeepCut: Object Segmentation From Bounding Box Annotations Using Convolutional Neural Networks. IEEE Transactions on Medical Imaging. 2017; 36(2).674-683. https://doi.org/10. 1109/ TMI.2016.2621185.

Cao Z,Simon T,WeiS H, et al. Real time multiperson 2D pose estimation using part affinity fields. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Press. 2017; 1302-1310. https://doi.org/10.1109/TPAMI.2020.2983686.

Newell A, Yang KY, Deng J. Stacked hourglass networks for human pose estimation. Computer Vision - ECCV 2016. Lecture Notes in Computer Science. 2016 ; 483-499. Available from: https://doi.org/10.1007/978-3-319-46484-8_29.

Miller LE, Fabio C, Azaroual M, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell. 2017; 39(6):1137-1149. https://doi.org/10.1109/TPAMI.2016.2577031.

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N, et al. Attention is all you need. NeurIPS. 2017; 5998-6008; Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao. ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation. 2022; 38571-38584. https://doi.org/10.48550/arXiv.2212.04246.

Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, et al. SSD: Single shot multibox detector. Computer Vision ECCV(Springer). 2016; 21-37. https://doi.org/10.1007/978-3-319-46448-2_0.

Andriluka M, Pishchulin L, Gehler P, Schiele B. 2D human pose estimation: New benchmark and state of the art analysis. 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2014; 3985-3978. https://doi.org/10.1109/CVPR.2014.471.

المؤلفات المشابهة

Enas Jawad Kadeem, Identification and Quantitative Estimation of Lutein in Iraqi Spinacia oleraceaFamily Chenopodiaceae by Using Chromatographic Methods , مجلة بغداد للعلوم: مجلد 8 عدد 1 (2011): issue 1
The Histological Effect of Aeromonashydrophila on liver of male albino mice , مجلة بغداد للعلوم: مجلد 10 عدد 2 (2013): issue 2
Hana Rashied Ismaeel, تطبيق خوارزمية التجفير المتناظرة المفتاح (تيَ) , مجلة بغداد للعلوم: مجلد 7 عدد 2 (2010): issue 2
Ibtisam J. Sodani, Basima M. Al-Juboori , التغيرات النمائية الجنينية في مناسل ذكور الفئران المرافقة لاستهلاك الرصاص , مجلة بغداد للعلوم: مجلد 7 عدد 1 (2010): issue 1
N.A. Al –BAKRI , Azhaar Raheem Hussien , دراسة نسجية للسقف البصري Optic tectum في أفعى الماء العراقية natrix tesselata tesselata , مجلة بغداد للعلوم: مجلد 7 عدد 1 (2010): issue 1
Haleema S. Ali, Waleeda S. Ali, حل تقريبي لمعادلة فريدهولم التكاملية باستخدام طريقة متعددة حدود برنشتن , مجلة بغداد للعلوم: مجلد 5 عدد 4 (2008): issue 4
Omar Hamad Shehab AL- Obaidi, II- تحضير ودراسة معقدات بعض العناصر الانتقالية لبعض مشتقات قواعد شف , مجلة بغداد للعلوم: مجلد 4 عدد 3 (2007): issue 3
Yousra Abdul Alsahib S. Aldeen, Yusliza Yusoff, Samira Naji Kadhim, الافتتاحية: الحوسبة التطبيقية 2023 , مجلة بغداد للعلوم: مجلد 21 عدد 4 (2024): Issue 4
Suresh Rasappan, Regan Murugesan, Sathish Kumar Kumaravel, Kala Raja Mohan, Nagadevi Bala Nagaram, تحليل كفاءة طريقة جديدة لتشفير الصور بناء على تحويلات أبوده , مجلة بغداد للعلوم: مجلد 21 عدد 5(SI) (2024): 5(Special Issue) ICCDA2023
Abir AlSideiri, Saif AlShamsi, Hajar AlBreiki, Manal AlMoqbali, Mithaa AlMaamari, Shaima AlSaadi, تعزيز الأمن السيبراني من خلال التشفير الهجين: الجمع بين خوارزميات RSA وVigenère في نظام Cypher-X , مجلة بغداد للعلوم: مجلد 21 عدد 5(SI) (2024): 5(Special Issue) ICCDA2023

يمكنك أيضاً إبدأ بحثاً متقدماً عن المشابهات لهذا المؤلَّف.

CS-IF

2.0

CiteScore

1.2

Impact Factor

إنشاء طلب نشر

issn

P-ISSN: 2078-8665 | E-ISSN: 2411-7986

journalindexing

Journal Indexing
SCOPUS
Directory of Open Access Journals DOAJ
Library of Congress
Iraqi Academic Scientific Journal
Open Access Scholarly Publishers Association (OASPA)
SNIP (Source Normalized Impact Per Paper)

journalinfo

Journal Info
Journal: Baghdad Science Journal
Publisher: College of Science for Women/ University of Baghdad
Baghdad Sci. J. is peer-reviewed and open access
Print ISSN: 2078-8665
Electronic ISSN: 2411-7986
Publishing Frequency: Quarterly (from 2004 - 2021) Bi-monthly (from 2022) Monthly (from 2024)
Launched Date: 2004
Abbreviation: Baghdad Sci.J.
Each published paper in Baghdad Sci. J. has a digital object identifier (DOI) number

اللغة

scopus

1.3

2022CiteScore

50th percentile

ca

cope

sjr

locongress

clockss

Ithenticate

Sherpa Romeo

crossref

WHO

sci journal

uob digital repository

Scilit

cc

© 2022 The Author(s). Published by College of Science for Women, University of Baghdad. This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

الشريط الجانبي للمقالة

محتوى المقالة الرئيسي

الملخص

تفاصيل المقالة

كيفية الاقتباس

المراجع

المؤلفات المشابهة