Ablation experiments on custom dataset.
Pyrotechnic detection has always been one of the critical issues in blasting safety. Due to the complex environment of blasting sites, irregular detonator wire postures, and the differences in object scales, making the detection of pyrotechnics more challenging. To address these challenges, this paper proposes an improved algorithm based on a multi-scale parallel attention mechanism and wavelet-separable convolution, called WA-YOLO. First, we integrate wavelet convolution into depthwise separable convolution and propose a novel convolutional block (WSDConv, Wavelet Separable Depthwise Convolution). This new convolutional block is added to the model’s backbone, improving feature extraction while also lowering computational parameters. Furthermore, we introduce an improved Cross Stage Partial (CSP) structure by combining multi-scale convolutions with a parallel attention mechanism, embedding it into the C2f module of the neck network to improve the model’s ability to detect objects of varying scales in complex backgrounds. To tackle the detection accuracy drop caused by the irregular shapes and varying aspect ratios of detonator wires, the model uses the Wise-IoU loss function. This enhances the model’s generalization and robustness by improving the precision of overlap calculations for bounding boxes. The experimental results show that the improved model achieved an average precision increase of 12.6% on the self-built dataset, particularly with an average precision increase of 8.3% in the detection of detonators. Additionally, the model performance also improved on the VOC2012 dataset, with a recall increase of 1.3% and an average precision increase of 1.6%. These results indicate that the proposed model exhibits strong generalization capabilities, can work effectively across different datasets, and provides an effective solution to the challenges of target detection in blasting environments.