Abstract:To address the issues of low detection accuracy for hand-washing actions, significant environmental noise interference, and confusion between similar actions in the existing YOLOv7 model, an improved CCL-YOLO object detection algorithm based on YOLOv7 is proposed. This algorithm introduces the self-attention mechanism EALN-CotAttention to obtain more contextual information; it utilizes CARAFE to optimize the upsampling of the nearest neighbor interpolation method in YOLOv7, promoting efficient content perception and feature recombination, thereby improving the accuracy of action detection; it replaces the SPPCSPC in YOLOv7 with SPPFCSPC, achieving the same receptive field with fewer parameters in the convolutional kernel; it adopts the LADH-Dectect detection head instead of the original coupled detection head in the YOLO network, enhancing model performance. Compared to ordinary decoupled detection heads, this reduces network parameters and GFLOPs, thereby increasing inference speed. Experiments on a custom dataset show that the proposed algorithm improves average precision to 81.2%, which represents a 7.2% increase in detection accuracy, a 2.9% increase in precision, and an 11% improvement in recall rate compared to the YOLOv7 algorithm, better meeting the practical needs of hand-washing action detection.