TY - GEN
T1 - Normalized Gaussian Wasserstein Distance and Vision Transformer Based Erythrocytes and Spirochaete Detection with Bi-Level Routing Attention and Soft-NMS
AU - Lu, Guotao
AU - Li, Chen
AU - Xu, Ning
AU - Yao, Yudong
AU - Sun, Hongzan
AU - Huang, Xinyu
AU - Grzegorzek, Marcin
AU - Zeng, Qiaolin
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - To solve the problem of detecting erythrocytes and spirochaete in blood, an improved YOLOV7 algorithm is proposed that integrates the attention mechanism, optimization loss function and NMS module, called SE-YOLOv7. This method used Normalized Gaussian Wasserstein Distance to optimize the original loss function, enhancing the model's detection performance for erythrocytes and spirochaete. Use soft-NMS instead of the NMS module to alleviate the network's detection problem of erythrocytes overlap. The addition of Bi-Level Routing Attention enhances the model's flexibility in computational allocation and content awareness. Experiments on DMID-ESD using Faster RCNN, SSD, Retinanet, YOLOV7 and SE-YOLOv7. Experimental results show that SE-YOLOv7 has a better detection effect, and its mAP is increased by 2.3% compared to YOLOv7. This is of great significance for achieving accurate detection of erythrocytes and spirochaete.
AB - To solve the problem of detecting erythrocytes and spirochaete in blood, an improved YOLOV7 algorithm is proposed that integrates the attention mechanism, optimization loss function and NMS module, called SE-YOLOv7. This method used Normalized Gaussian Wasserstein Distance to optimize the original loss function, enhancing the model's detection performance for erythrocytes and spirochaete. Use soft-NMS instead of the NMS module to alleviate the network's detection problem of erythrocytes overlap. The addition of Bi-Level Routing Attention enhances the model's flexibility in computational allocation and content awareness. Experiments on DMID-ESD using Faster RCNN, SSD, Retinanet, YOLOV7 and SE-YOLOv7. Experimental results show that SE-YOLOv7 has a better detection effect, and its mAP is increased by 2.3% compared to YOLOv7. This is of great significance for achieving accurate detection of erythrocytes and spirochaete.
KW - Bi-level routing attention
KW - Erythrocytes
KW - Normalized Gaussian Wasserstein distance
KW - Soft-NMS
KW - Spirochaete
KW - YOLOV7
UR - https://www.scopus.com/pages/publications/105011980877
UR - https://www.scopus.com/pages/publications/105011980877#tab=citedBy
U2 - 10.1007/978-981-96-2771-4_45
DO - 10.1007/978-981-96-2771-4_45
M3 - Conference contribution
AN - SCOPUS:105011980877
SN - 9789819627707
T3 - Lecture Notes in Electrical Engineering
SP - 511
EP - 522
BT - Proceedings of the 3rd International Conference on Internet of Things, Communication and Intelligent Technology - Intelligent Technology
A2 - Dong, Jian
A2 - Zhang, Long
A2 - Zheng, Tongxing
T2 - 3rd International Conference on Internet of Things, Communication and Intelligent Technology, IoTCIT 2024
Y2 - 29 June 2024 through 1 July 2024
ER -