Image 4_MoSViT: a lightweight vision transformer framework for efficient disease detection via precision attention mechanism.tif

figure

posted on 2025-03-26, 06:30 authored by Yuanqi Chen, Aiping Wang, Ziyang Liu, Jie Yue, Enxu Zhang, Fei Li, Ning Zhang

Maize, a globally essential staple crop, suffers significant yield losses due to diseases. Traditional diagnostic methods are often inefficient and subjective, posing challenges for timely and accurate pest management. This study introduces MoSViT, an innovative classification model leveraging advanced machine learning and computer vision technologies. Built on the MobileViT V2 framework, MoSViT integrates the CLA focus mechanism, DRB module, MoSViT Block, and the LeakyRelu6 activation function to enhance feature extraction accuracy while reducing computational complexity. Trained on a dataset of 3,850 images encompassing Blight, Common Rust, Gray Leaf Spot, and Healthy conditions, MoSViT achieves exceptional performance, with classification accuracy, Precision, Recall, and F1 Score of 98.75%, 98.73%, 98.72%, and 98.72%, respectively. These results surpass leading models such as Swin Transformer V2, DenseNet121, and EfficientNet V2 in both accuracy and parameter efficiency. Additionally, the model's interpretability is enhanced through heatmap analysis, providing insights into its decision-making process. Testing on small sample datasets further demonstrates MoSViT's generalization capability and potential for small-sample detection scenarios.

History

Usage metrics

Keywords

precision attention maize disease detection deep learning MobileViT V2 parallel attention mechanism few-shot object detection

Licence

CC BY 4.0

Image 4_MoSViT: a lightweight vision transformer framework for efficient disease detection via precision attention mechanism.tif

History

Usage metrics

Categories

Keywords

Licence

Exports