Abstract
Identification of medicinal plant leaves is very crucial as their cultivation and production are essential for the medicine industry. Many different classes of medicinal leaves look identical but serve different purposes in the medicine industry and have different remedies for different diseases. Hence it is imperative to use methods that are automated, faster, and produce good accuracy. Cutting-edge models have been trained to discern the subtle distinctions between various species of leaves, accounting for a myriad of factors such as leaf texture, shape, and color variations, which are often imperceptible to the human eye. In this research, Transfer learning (TL) based VGG16 and Vision Transformer (ViT) models such as ConvMixer and Compact Convolutional Transformer (CCT) are implemented for the classification of medicinal leaf images using a dataset of 38066 leaf images having 10 different classes. The proposed customized Convolutional Neural Network (CNN) and hybrid CNN-ViT models both have a very low number of parameters compared to the other models in comparison making them light and capable of being less computationally expensive. In the experimental evaluation, all the results are collected for 30 epochs. VGG16, CCT, and ConvMixer produce AUC scores of 0.50, 0.79, and 0.50, respectively for the dataset while the proposed CNN and hybrid model gave AUC scores of 0.83 and 0.74, respectively. In addition, a hybrid denoising approach with Wavelet thresholding and Gaussian blurring is utilized to minimize the noises in the images by retaining the original image quality.
Keywords
Classification, CNN-ViT, Medicinal plant leaf, Transfer learning, Vision transformer
Subject Area
Computer Science
Article Type
Article
First Page
1065
Last Page
1076
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite this Article
Hossain, Shahriar; Hasan, Rizbanul; and Uddin, Jia
(2025)
"Medicinal Plant Leaf Classification using Deep Learning and Vision Transformers,"
Baghdad Science Journal: Vol. 22:
Iss.
3, Article 30.
DOI: https://doi.org/10.21123/bsj.2024.10844