Indoor/Outdoor Deep Learning Based Image Classification for Object Recognition Applications

Omar Abdullatif Jassim; Mohammed Jawad Abed; Zenah Hadi Saied

doi:10.21123/bsj.2023.8177

PDF (الإنجليزية)

منشور: Dec 5, 2023

DOI: https://doi.org/10.21123/bsj.2023.8177

الكلمات المفتاحية:

التعلم العميق، نموذج التعلم المدرب مسبقاً واسمه GoogleNet، تصنيف الصور، المشاهد داخل وخارج المنزل، نقل التعلم.

Omar Abdullatif Jassim

قسم هندسة تقنيات الاجهزة الطبية, كلية الحكمة الجامعة, بغداد, العراق

https://orcid.org/0009-0004-1607-2778

Mohammed Jawad Abed

قسم هندسة تقنيات الاجهزة الطبية, كلية الحكمة الجامعة, بغداد, العراق

Zenah Hadi Saied

قسم تقنيات المختبرات الطبية, المعهد الطبي التقني-المنصور, الجامعة التقنية الوسطى, بغداد, العراق

الملخص

مع التطور السريع في تصميم الأجهزة الذكية، أصبحت حياة الناس أسهل خصوصاً أولئك الذين يعانون من فقدان البصر أو العمى. الإنجازات الجديدة في مجال تعلم الآلة والتعلم العميق سمحت لفاقدي البصر بالتعرف على البيئة المحيطة بهم وتمييزها. في الدراسة الحالية، نقوم باستخدام الفعالية والأداء العالي الذي تتمتع به أنظمة التعلم العميق لبناء نظام تصنيف الصور في كلا البيئتين الداخلية والخارجية. تبدأ الطريقة المقترحة بإنشاء مجموعتي بيانات داخلية وخارجية من عدة مصادر بيانات مختلفة. في الخطوة التالية، يتم تقسيم مجموعة البيانات المجمعة إلى مجموعات تدريب وتحقق واختبار. يتم استخدام نموذجي التعلم العميق المدربين مسبقاً المسميين GoogleNet وMobileNet-V2 من التدريب باستخدام مجموعتي البيانات الداخلية والخارجية وينتج عن ذلك نموذجان مدربان. يتم استخدام مجموعات بيانات الاختبار من أجل اختبار النماذج المدربة باستخدام معاملات قياس الأداء (الدقة، معدل القبول الصحيح، معدل الرفض الخاطئ، معدل التخمين الصحيح، ومعدل الاكتشاف الخاطئ). بالنسبة لنموذج GoogleNet تشير النتائج إلى الأداء العالي للأنظمة المدربة حيث تم التوصل لدقات اختبار 99.34% و99.76% لكل من مجموعتي البيانات الداخلية والخارجية على التوالي. أما فيما يخص نموذج MobileNet فقد تم التوصل لدقات 99.27% و 99.68% لكل من مجموعي بيانات الداخلية والخارجية على التوالي. تمت مقارنة الطريقة المقترحة مع الطرق المماثلة في الدراسات السابقة في مجال تصنيف الصور في أنظمة رعاية فاقدي البصر، حيث أظهرت تفوق الطريقة المقترحة من قبلنا.

Received 29/11/2023

Revised 02/04/2023

Accepted 04/04/2023

كيفية الاقتباس

تصنيف الصور القائم باستخدام التعلم العميق لتطبيقات كشف النماذج داخل وخارج المنزل. Baghdad Sci.J [انترنت]. 5 ديسمبر، 2023 [وثق 23 نوفمبر، 2024];20(6(Suppl.):2540. موجود في: https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/8177

إصدار

مجلد 20 عدد 6(Suppl.) (2023): Supplement Issue 6

القسم

article

هذا العمل مرخص بموجب Creative Commons Attribution 4.0 International License.

كيفية الاقتباس

تنزيل الاقتباسات

المراجع

Lv H, Shi S, Gursoy D. A look back and a leap forward: a review and synthesis of big data and artificial intelligence literature in hospitality and tourism. J Hosp Mark Manag. 2022; 31(2): 145-75. https://doi.org/10.1080/19368623.2021.1937434

Rozenwald MB, Galitsyna AA, Sapunov GV, Khrameeva EE, Gelfand MS. A machine learning framework for the prediction of chromatin folding in Drosophila using epigenetic features. Peer J Comput Sci. 2020; 6(30). https://doi.org/10.7717/peerj-cs.307

Sharma N, Sharma R, Jindal N. Machine learning and deep learning applications-a vision. Glob Trans Proc. 2021; 2(1): 24-8. https://doi.org/10.1016/j.gltp.2021.01.004

Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, et al. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J Big Data. 2021; 8(1): 1-74. DOI:10.1186/s40537-021-00444-8

Adeel A, Gogate M, Hussain A. Contextual deep learning-based audio-visual switching for speech enhancement in real-world environments. Inf Fusion. 2020; 1(59): 163-70. https://doi.org/10.1016/j.inffus.2019.08.008

Tian H, Chen SC, Shyu ML. Evolutionary programming based deep learning feature selection and network construction for visual data classification. Inf Syst Front. 2020 Oct; 22(5): 1053-66. https://doi.org/10.1007/s10796-020-10023-6

Lee SB, Gui X, Manquen M, Hamilton ER. Use of training, validation, and test sets for developing automated classifiers in quantitative ethnography. Int Conf Quant Ethn. 2019; 117-127. https://doi.org/10.1007/978-3-030-33232-7_10

Vishwakarma M, Singh HP, Kumar N, Arora M. The Need of Smart Guidance Systems for Blind People in the World. Proc Int Conf Big Data Mach Learn App. 2021; 191-195. https://doi.org/10.1007/978-981-15-8377-3_17

Durgadevi S, Thirupurasundari K, Komathi C, Balaji SM. Smart machine learning system for blind assistance. Int Conf Power Energy Control Trans Syst. IEEE. 2020; 1-4. https://doi.org/10.1109/ICPECTS49113.2020.9337031

Tapu R, Mocanu B, Zaharia T. DEEP-SEE: Joint object detection, tracking and recognition with application to visually impaired navigational assistance. Sensors. 2017; 17(11): 2473. https://doi.org/10.3390/s17112473

Yadav AV, Verma SS, Singh DD. Virtual Assistant for blind people. Int J Adv Sci Res Eng Trends. 2021; 6(5):156-159. http://ijasret.com/VolumeArticles/FullTextPDF/831_36.VIRTUAL_ASSISTANT_FOR_BLIND_PEOPLE.pdf

Ephzibah EP. Assisting Blind People Using Machine Learning Algorithms. Tur. Co. Mat. 2021; 12(8): 3162-70. https://doi.org/10.17762/turcomat.v12i8.4161

Qureshi TA, Rajbhar M, Pisat Y, Bhosale V. AI Based App for Blind People. Int Res J Eng Technol. 2021; 8(03): 2883-7. https://doi.org/10.1177/02646196221131746

Mamun SA, Daud ME, Mahmud M, Kaiser MS, Rossi AL. ALO: AI for least observed people. Int Conf Appl Intell Inform. Springer. 2021; 306-317. https://www.springerprofessional.de/en/alo-ai-for-least-observed-people/19396306

Chaurasia MA, Rasool S, Afroze M, Jalal SA, Zareen R, Fatima U, et al. Automated Navigation System with Indoor Assistance for Blind. In Contactless Healthcare Facilitation and Commodity Delivery Management During COVID 19 Pandemic. Springer, Singapore. 2022; 119-128. https://doi.org/10.1007/978-981-16-5411-4_10

Mone S, Salunke N, Jadhav O, Barge A, Magar N. Machine Learning Based Computer Vision Application for Visually Disabled People. Int J Sci Res Comput Sci Eng Inf Technol. 2021; 7(3): 488-494. https://doi.org/10.32628/CSEIT2173130

Wang K, Chen CM, Hossain MS, Muhammad G, Kumar S, Kumari S. Transfer reinforcement learning-based road object detection in next generation IoT domain. Comput Netw. 2021; 193:1-12. https://doi.org/10.1016/j.comnet.2021.108078

Bouteraa Y. Design and Development of a Wearable Assistive Device Integrating a Fuzzy Decision Support System for Blind and Visually Impaired People. Micromachines. 2021; 12(9): 1082. https://doi.org/10.3390/mi12091082

Periša M, Peraković D, Cvitić I, Krstić M. Innovative ecosystem for informing visual impaired person in smart shopping environment: IoT Shop. Wirel Netw. 2022; 28(1): 469-79. https://doi.org/10.1007/s11276-021-02591-5

Dhou S, Alnabulsi A, Al-Ali AR, Arshi M, Darwish F, Almaazmi S, et al. An IoT machine learning-based mobile sensors unit for visually impaired people. Sensors. 2022; 22(14): 5202. https://doi.org/10.3390/s22145202

Zhang E, fruit recognition dataset. Kaggle. 2022. [Online]. https://www.kaggle.com/datasets/sshikamaru/fruit-recognition .

Buyukkinaci M., fruit images for object detection dataset. Kaggle. 2018. https://www.kaggle.com/datasets/mbkinaci/fruit-images-for-object-detection .

Roy P, Ghosh S, Bhattacharya S, Pal U. Natural images dataset. Kaggle, 2018. https://www.kaggle.com/datasets/prasunroy/natural-images .

Annamraju. Weapon detection dataset. Kaggle. 2019. https://www.kaggle.com/datasets/abhishek4273/gun-detection-dataset .

Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, et al. Going deeper with convolutions. Proc. IEEE conf Comput Vis Pattern Recognit. 2015: 1-9. https://doi.org/10.48550/arXiv.1409.4842

Hub T. Convolutional neural network architectures. Principles and Labs for Deep Learning. 2021: 6: 201. https://doi.org/10.1016/C2020-0-03408-0

AL-Huseiny MS, Sajit AS. Transfer learning with GoogLeNet for detection of lung cancer. Indones J Electr Eng. 2021; 22(2): 1078-86. http://doi.org/10.11591/ijeecs.v22.i2.pp1078-1086

Sharma S, Kumar H. Detection and classification of plant diseases by Alexnet and GoogleNet deep learning architecture. Int J Sci Res Eng Trends. 2022; 8(1): 218-23. https://ijsret.com/wp-content/uploads/2022/01/IJSRET_V8_issue1_120.pdf

Zakaria N, Mohamed F, Abdelghani R, Sundaraj K. VGG16, ResNet-50, and GoogLeNet Deep Learning Architecture for Breathing Sound Classification: A Comparative Study. Int Conf Artif Intell Cyber Secur Syst. IEEE. 2021:1-6. https://doi.org/10.1109/AI-CSP52968.2021.9671124

Mayya A, Khozama S. A. Novel Medical Support Deep Learning Fusion Model for the Diagnosis of COVID-19. Int Conf Adv Trends Multidiscip Res Innov. IEEE. 2020: 1-6. https://doi.org/10.1109/ICATMRI51801.2020.9398317

Abdullah TH, Alizadeh F, Abdullah BH. COVID-19 Diagnosis System using SimpNet Deep Model. Baghdad Sci J. 2022; 19(5): 1078-1089. DOI: http://dx.doi.org/10.21123/bsj.2022.19.4.ID0000.

Zaki SM, Jaber MM, Kashmoola MA. Diagnosing COVID-19 Infection in Chest X-Ray Images Using Neural Network. Baghdad Sci J. 2022; 19(6): 1356-1361. DOI: https://dx.doi.org/10.21123/bsj.2022.5965.

Sandler M, Howard A, Zhu M, Zhmoginov A, Chen LC. Mobilenetv2: Inverted residuals and linear bottlenecks. Proc IEEE conf Comput Vis Pattern Recognit. 2018: 4510-4520. https://doi.org/10.48550/arXiv.1801.04381

Feigenbaum J J. A machine learning approach to census record linking. Harvard University. 2016. https://ranabr.people.stanford.edu/sites/g/files/sbiybj26066/files/media/file/machine_learning_approach.pdf

المؤلفات المشابهة

Faisal Bin Al Abid, Aryati, Md. Golam, Jia Uddin, Shefayatuj Johara Chowdhury, نهج مبتكر مبسط للتصنيف الدقيق لحالات تخلف الموظفين عن العمل باستخدام MCDM، ونهج مبدأ De-Pareto، والتعلم الآلي , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
يون أوي, Zhen-Jie تشو, Di-Wen كانغ, مِقلاة تشو, Xue-Wei Liu, التعرف على أحرف Qin Seal Script بمعلومات غامضة وغير كاملة , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Abubakar Salihu Abba, Noorfa Haszlinna Mustaffa, Siti Zaiton Mohd Hashim, Razana Alwee, تصنيف الانسكابات النفطية على أساس الصور الفضائية باستخدام تقنيات التعلم العميق , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Qi Diao, Apri Junaidi, WengHowe Chan, Azland Mohd Zain, Hao long Yang, SBOA: خوارزمية تحسين إرشادية جديدة , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Muhammad Alrashidi, Roliana Ibrahim, Ali Selamat, نظام التوصية الهجين القائم على CNN , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Karrar A. Kadhim, Farhan Mohamed, Fallah H Najjar, Ghalib Ahmed Salman, التشخيص المبكر لمرض الزهايمر عن طريق استخراج مميزات الصور بالرسم البياني واكتشاف حافة كاني باستخدام الشبكة العصبية التلافيفية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Howida Abubaker , Farkhana Muchtar, Alif Ridzuan Khairuddin, Ahmad Najmi Amerhaider Nuar, Zuriahati Mohd Yunos, Carolyn Salimun, استكشاف العوامل المهمة في التنبؤ بأمراض القلب بناءً على نهج اختيار الميزات الإضافية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Asmaa M. Salih Almohaidi, Fikrat M. Hassan, Hussin Rothan, الافتتاحية: التطورات الحالية في استراتيجيات مكافحة العدوى , مجلة بغداد للعلوم: مجلد 20 عدد 5 (2023): Issue 5
Sen-Yu Yang, Yin-Hong Xiang, Di-Wen Kang, Kai-Qing Zhou, خوارزمية بحث الوقواق المحسنة لزيادة نطاق التغطية لشبكات الاستشعار اللاسلكية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023
Murad Ibrahim Husin Alzawali, Yusliza Yusoff, Razana Alwee, Zuriahati Mohd Yunos, Mohamad Shukor Talib, Haswadi Hassan, Fahad Taha AL-Dhief, Musatafa Abbas Abbood Albadr, Majid Razaq Mohamed Alsemawi, Sharifah Zarith Rahmah Syed Ahmad, التعرف على صور عاطفة الوجه بناءً على الخوارزمية الجينية الثنائية - الغابة العشوائية , مجلة بغداد للعلوم: مجلد 21 عدد 2(SI) (2024): 2(Special Issue) ICAC2023/PARS2023

يمكنك أيضاً إبدأ بحثاً متقدماً عن المشابهات لهذا المؤلَّف.

CS-IF

2.0

CiteScore

1.2

Impact Factor

إنشاء طلب نشر

issn

P-ISSN: 2078-8665 | E-ISSN: 2411-7986

journalindexing

Journal Indexing
SCOPUS
Directory of Open Access Journals DOAJ
Library of Congress
Iraqi Academic Scientific Journal
Open Access Scholarly Publishers Association (OASPA)
SNIP (Source Normalized Impact Per Paper)

journalinfo

Journal Info
Journal: Baghdad Science Journal
Publisher: College of Science for Women/ University of Baghdad
Baghdad Sci. J. is peer-reviewed and open access
Print ISSN: 2078-8665
Electronic ISSN: 2411-7986
Publishing Frequency: Quarterly (from 2004 - 2021) Bi-monthly (from 2022) Monthly (from 2024)
Launched Date: 2004
Abbreviation: Baghdad Sci.J.
Each published paper in Baghdad Sci. J. has a digital object identifier (DOI) number

اللغة

scopus

1.3

2022CiteScore

50th percentile

ca

cope

sjr

locongress

clockss

Ithenticate

Sherpa Romeo

crossref

WHO

sci journal

uob digital repository

Scilit

cc

© 2022 The Author(s). Published by College of Science for Women, University of Baghdad. This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 International License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

الشريط الجانبي للمقالة

محتوى المقالة الرئيسي

الملخص

تفاصيل المقالة

كيفية الاقتباس

المراجع

المؤلفات المشابهة