Human Pose Estimation Algorithm Using Optimized Symmetric Spatial Transformation Network

Shengqing Lin; Nor Azizah Ali; Azlan bin Mohd Zain; Muhalim Mohamed Amin Amin

doi:10.21123/bsj.2024.9775

PDF (الإنجليزية)

منشور: Feb 25, 2024

DOI: https://doi.org/10.21123/bsj.2024.9775

الكلمات المفتاحية:

رؤية الكمبيوتر؛ تعلم عميق؛ تقدير ما بعد الإنسان؛ التعرف على النقاط الرئيسية؛ التحول المكاني المتماثل

Shengqing Lin

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.&كلية علوم الحاسب والمعلومات، معهد فوتشو للتكنولوجيا، فوتشو، فوجيان، الصين.

https://orcid.org/0009-0002-9088-1384

Nor Azizah Ali

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.

https://orcid.org/0000-0003-2565-3836

Azlan bin Mohd Zain

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.

https://orcid.org/0000-0003-2004-3289

Muhalim Mohamed Amin Amin

كلية الحاسبات، الجامعة التكنولوجية الماليزية (UTM)، جوهور باهرو، جوهور 81310، ماليزيا.

الملخص

يعد تقدير وضعية الإنسان موضوعًا بالغ الأهمية في مجال رؤية الكمبيوتر، وقد أصبح نقطة ساخنة للبحث في العديد من الأعمال المتعلقة بالسلوكيات البشرية. يمكن فهم تقدير وضع الإنسان على أنه مشكلة التعرف على النقاط الرئيسية للإنسان والاتصال بها. تقدم هذه الورقة شبكة تحويل مكاني متماثلة محسنة مصممة للتواصل مع شبكة تقدير وضعية الشخص الواحد لاقتراح إطارات مستهدفة بشرية عالية الجودة من الصناديق المحيطة البشرية غير الدقيقة، وتقدم قمعًا بارامتريًا غير أقصى للقضاء على تقدير الوضعية الزائدة عن الحاجة، وتطبق قاعدة الإزالة لإزالة الوضع المماثل للحصول على نتائج فريدة لتقدير الوضع البشري. توضح النتائج الاستكشافية كيف يمكن للتقنية المقترحة أن تتعرف بدقة على القضايا الإنسانية المركزية، وتعمل حقًا على دقة تقييم وضعية الإنسان، ويمكنها التكيف مع المشاهد المعقدة مع الأفراد السميكين والعوائق. وأخيرا، يتم وصف الصعوبات والاتجاهات المستقبلية المحتملة، ويتم عرض تطور المجال.

Received 30/09/2023

Revised 10/02/2024

Accepted 12/02/2024

Published 25/02/2024

كيفية الاقتباس

خوارزمية تقدير وضعية الإنسان باستخدام التناظر الأمثل شبكة التحول المكاني. Baghdad Sci.J [انترنت]. 25 فبراير، 2024 [وثق 20 مايو، 2024];21(2(SI):0755. موجود في: https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/9775

إصدار

مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023

القسم

article

هذا العمل مرخص بموجب Creative Commons Attribution 4.0 International License.

كيفية الاقتباس

المراجع

Cervantes J, Garcia-Lamont F, Rodríguez-Mazahua L, et al. A comprehensive survey on support vector machine classification: Applications, challenges and trends. Neurocomputing. 2020; 408(2): 189-215. https://doi.org/10.1016/j.neucom.2019.10.118.

Alzubaidi L, Zhang J, Humaidi A J, et al. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data. 2021; 8(1): 1-74. https://doi.org/10.1186/ s40537-021-00444-8.

Pareek P, Thakkar A. A survey on video-based human action recognition: recent updates, datasets, challenges, and applications. Artif. Intell. Rev. .2021;54(3): 2259-2322. https://doi.org 10.1007/s10462-020-09904-8.

Shi Y, Zhang Z, Huang K, et al. Human-computer interaction based on face feature localization. J. Vis. Commun. .2020; 70(1): 102740. https://doi.org/10.1016/j.jvcir.2019. 102740.

Zheng C, Wu W, Yang T, et al. Deep learning-based human pose estimation: A survey. arXiv. arXiv:2012;13392.https://doi.org/10.48550/arXiv.2012.13392.

Chen J, Li S, Liu D, et al. Indoor camera pose estimation via style‐transfer 3D models. COMPUT-AIDED CIV INF . 2022;37(3): 335-353. https://doi.org/10.1111/mice.12714.

Li M, Gao Y, Sang N. Exploiting learnable joint groups for hand pose estimation Proceedings of the AAAI Conference on Artificial Intelligence. 2021; 35(3): 1921-1929 https://doi.org/10.1609/aaai.v35i3.16287.

Tang H, Wang Q, Chen H. Research on 3D human pose estimation using RGBD camera 2019 IEEE 9th International Conference on Electronics Information and Emergency Communication (ICEIEC). IEEE, 2019: 538-541. https://doi.org/ 10.1109/iceiec.2019.8784591

9. Bowen Cheng, Bin Xiao, Jingdong Wang, Honghui Shi, Thomas S. Huang, and Lei Zhang. Higherhrnet: Scale aware representation learning for bottom-up human pose estimation. arXiv .2020. https://doi.org/10.48550/arXiv.1908.10357.

Jin S, Liu W, Xie E, et al. Differentiable hierarchical graph grouping for multi-person pose estimation. European Conference on Computer Vision. arXiv. 2020; 718-734. https://doi.org/10.48550/arXiv.2007.11864.

Bao Q, Liu W, Cheng Y, et al. Pose-guided tracking-by-detection: Robust multi-person pose tracking[J]. IEEE Transactions on Multimedia. 2020; 23(10): 161-175. https://doi.org/10.1109/TMM.2020. 2980194.

Dang Q, Yin J, Wang B, et al. Deep learning based 2d human pose estimation: A survey. Tsinghua Sci Technol. 2019; 24(6): 663-676. https://doi.org/ 10.26599/TST.2018.9010100.

Luvizon D C, Picard D, Tabia H. 2d/3d pose estimation and action recognition using multitask deep learning .Proceedings of the IEEE conference on computer vision and pattern recognition. 2018: 5137-5146. https://doi.org/ 10.1109/CVPR.2018.00539.

Chen Y, Tian Y, He M. Monocular human pose estimation: A survey of deep learning-based methods[J]. Comput Vis Image Underst. 2020; 192(5): 102897. https://doi.org/ 10.1016/j.cviu.2019.102897

Yang G, Sun D, Jampani V, et al. ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction[J]. Adv. neural inf. process. Syst. 2021; 34..

Qiu S, Zhao H, Jiang N, et al. Multi-sensor information fusion based on machine learning for real applications in human activity recognition: State-of-the-art and research challenges[J]. Information Fusion. 2022; 80(6): 241-26. https://doi.org/ 10.1016/j.inffus.2021.11.006.

Toshev A, Szegedy C. DeepPose: human pose estimation via deep neural networks[C]. 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEEPress. 2014. 1653-1660. https://doi.org/10.1109/ CVPR.2014.214.

Li S, Zhang L, Diao X. Deep-learning-based human intention prediction using RGB images and optical flow[J]. J Intell Robot Syst. 2020; 97(1): 95-107. https://doi.org/ 10.1007/s10846-019-01049-3.

Yilun Chen, Zhicheng Wang, Yuxiang Peng, Zhiqiang Zhang, Gang Yu, and Jian Sun. Cascaded pyramid network for multi-person pose estimation.2018 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR). 2018; 7103– 7112. https://doi.org/ 10.1109/CVPR. 2018.00742.

Wei SE, Ramakrishna V, Kanade T, Sheikh Y. Convolutional pose machines. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016: 2433-2454..

Sun K, Xiao B, Liu D, et al. Deep high-resolution representation learning for human pose estimation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019 ; 5693-5703. https://doi.org/10.1109/CVPR.2019.00584.

Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, et al. Learning delicate Local Representations for Multi-person Pose Estimation. In European Conference on Computer Vision(ECCV). 2020; 457-472. https://doi.org/10.1109/CVPR.2019.00584.

M. Rajchl et al. DeepCut: Object Segmentation From Bounding Box Annotations Using Convolutional Neural Networks. IEEE Transactions on Medical Imaging. 2017; 36(2).674-683. https://doi.org/10. 1109/ TMI.2016.2621185.

Cao Z,Simon T,WeiS H, et al. Real time multiperson 2D pose estimation using part affinity fields. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) IEEE Press. 2017; 1302-1310. https://doi.org/10.1109/TPAMI.2020.2983686.

Newell A, Yang KY, Deng J. Stacked hourglass networks for human pose estimation. Computer Vision - ECCV 2016. Lecture Notes in Computer Science. 2016 ; 483-499. Available from: https://doi.org/10.1007/978-3-319-46484-8_29.

Miller LE, Fabio C, Azaroual M, et al. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell. 2017; 39(6):1137-1149. https://doi.org/10.1109/TPAMI.2016.2577031.

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N, et al. Attention is all you need. NeurIPS. 2017; 5998-6008; Yufei Xu, Jing Zhang, Qiming Zhang, Dacheng Tao. ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation. 2022; 38571-38584. https://doi.org/10.48550/arXiv.2212.04246.

Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, et al. SSD: Single shot multibox detector. Computer Vision ECCV(Springer). 2016; 21-37. https://doi.org/10.1007/978-3-319-46448-2_0.

Andriluka M, Pishchulin L, Gehler P, Schiele B. 2D human pose estimation: New benchmark and state of the art analysis. 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2014; 3985-3978. https://doi.org/10.1109/CVPR.2014.471.

المؤلفات المشابهة

Karrar A. Kadhim, Farhan Mohamed, Fallah H Najjar, Ghalib Ahmed Salman, التشخيص المبكر لمرض الزهايمر عن طريق استخراج مميزات الصور بالرسم البياني واكتشاف حافة كاني باستخدام الشبكة العصبية التلافيفية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Howida Abubaker , Farkhana Muchtar, Alif Ridzuan Khairuddin, Ahmad Najmi Amerhaider Nuar, Zuriahati Mohd Yunos, Carolyn Salimun, استكشاف العوامل المهمة في التنبؤ بأمراض القلب بناءً على نهج اختيار الميزات الإضافية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Asmaa M. Salih Almohaidi, Fikrat M. Hassan, Hussin Rothan, الافتتاحية: التطورات الحالية في استراتيجيات مكافحة العدوى , مجلة بغداد للعلوم: مجلد 20 عدد 5 (2023): Issue 5
Di-Wen Kang, Shao-Qiang Ye, Sharifah Zarith Rahmah Syed Ahmad, Li-Ping Mo, Feng Qin, Pan Zhou, أداة تمييز جزء من الكلام للبحث عن التناغم التكيفي لمربع لغة همونغ كوربوس , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Sen-Yu Yang, Yin-Hong Xiang, Di-Wen Kang, Kai-Qing Zhou, خوارزمية بحث الوقواق المحسنة لزيادة نطاق التغطية لشبكات الاستشعار اللاسلكية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Devi Yurisca Bernanda, Dayang N.A. Jawawi, Shahliza Abd Halim, Fransiskus Adikara, معالجة اللغة الطبيعية لاستنتاج المتطلبات في الجامعة باستخدام خوارزمية KMEANS وMEANSHIFT , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Murad Ibrahim Husin Alzawali, Yusliza Yusoff, Razana Alwee, Zuriahati Mohd Yunos, Mohamad Shukor Talib, Haswadi Hassan, Fahad Taha AL-Dhief, Musatafa Abbas Abbood Albadr, Majid Razaq Mohamed Alsemawi, Sharifah Zarith Rahmah Syed Ahmad, التعرف على صور عاطفة الوجه بناءً على الخوارزمية الجينية الثنائية - الغابة العشوائية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Muhammad S. Alam, Farhan B. Mohamed, AKM B. Hossain, تحديد الموقع الذاتي للروبوتات الموجهة من خلال تصنيف الصور , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
zhihua Xiang , Nor Haizan Mohamed Radzi, Haslina Hashim, دراسة تصنيف المشاعر على أساس الاندماج متعدد الوسائط , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Haider Mohammed Abdulhadi, Yousra Abdul Alsahib S. Aldeen, Maryam A. Yousif, Mays jalal jaseem, Syed Hamid Hussain Madni, تقنية الحوسبة السحابية على الشبكات اللاسلكية المخصصة المستخدمة في المدن الذكية , مجلة بغداد للعلوم: مجلد 20 عدد 6(Suppl.) (2023): Supplement Issue 6

يمكنك أيضاً إبدأ بحثاً متقدماً عن المشابهات لهذا المؤلَّف.

CS-IF

1.3

CiteScore

0.6

Impact Factor

إنشاء طلب نشر

issn

P-ISSN: 2078-8665 | E-ISSN: 2411-7986

journalindexing

Journal Indexing
SCOPUS
Directory of Open Access Journals DOAJ
Library of Congress
Iraqi Academic Scientific Journal
Open Access Scholarly Publishers Association (OASPA)
SNIP (Source Normalized Impact Per Paper)

journalinfo

Journal Info
Journal: Baghdad Science Journal
Publisher: College of Science for Women/ University of Baghdad
Baghdad Sci. J. is peer-reviewed and open access
Print ISSN: 2078-8665
Electronic ISSN: 2411-7986
Publishing Frequency: Quarterly (from 2004 - 2021) Bi-monthly (from 2022) Monthly (from 2024)
Launched Date: 2004
Abbreviation: Baghdad Sci.J.
Each published paper in Baghdad Sci. J. has a digital object identifier (DOI) number

اللغة

scopus

1.3

2022CiteScore

50th percentile

ca

cope

sjr

locongress

clockss

Ithenticate

Sherpa Romeo

crossref

WHO

sci journal

uob digital repository

Scilit

cc

© 2022 The Author(s). Published by College of Science for Women, University of Baghdad. This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

الشريط الجانبي للمقالة

محتوى المقالة الرئيسي

الملخص

تفاصيل المقالة

كيفية الاقتباس

المراجع

المؤلفات المشابهة