SW-YOLO: Lightweight Attitude Estimation Algorithm Based on Weighted Convolution and Star Network
Download PDF
$currentUrl="http://$_SERVER[HTTP_HOST]$_SERVER[REQUEST_URI]"

Keywords

YOLO11-Pose
WConv
StarNet
Lightweight algorithms
Feature fusion

DOI

10.26689/jera.v9i5.11990

Submitted : 2025-09-17
Accepted : 2025-10-02
Published : 2025-10-17

Abstract

This paper proposes SW-YOLO (StarNet Weighted-Conv YOLO), a lightweight human pose estimation network for edge devices. Current mainstream pose estimation algorithms are computationally inefficient and have poor feature capture capabilities for complex poses and occlusion scenarios. This work introduces a lightweight backbone architecture that integrates WConv (Weighted Convolution) and StarNet modules to address these issues. Leveraging StarNet’s superior capabilities in multi-level feature fusion and long-range dependency modeling, this architecture enhances the model’s spatial perception of human joint structures and contextual information integration. These improvements significantly enhance robustness in complex scenarios involving occlusion and deformation. Additionally, the introduction of WConv convolution operations, based on weight recalibration and receptive field optimization, dynamically adjusts feature importance during convolution. This reduces redundant computations while maintaining or enhancing feature representation capabilities at an extremely low computational cost. Consequently, SW-YOLO substantially reduces model complexity and inference latency while preserving high accuracy, significantly outperforming existing lightweight networks.

References

Ma N, Zhang X, Zheng HT, et al., 2018, ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design, in Ferrari V, Hebert M, Sminchisescu C, Weiss Y, (eds) Computer Vision – ECCV 2018. Lecture Notes in Computer Science, vol 11218, Springer, Cham.

Zhang X, Zhou X, Lin M, et al., 2017, ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. arXiv. https://arxiv.org/abs/1707.01083

Yu C, Xiao B, Gao C, et al., 2021, Lite-HRNet: A Lightweight High-Resolution Network, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Kreiss S, Bertoni L, Alahi A, 2022, OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association. IEEE Transactions on Intelligent Transportation Systems, 23(8): 13498–13511.

Cao Z, Hidalgo G, Simon T, et al., 2021, OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(1): 172–186.

Howard AG, Zhu M, Chen B, et al., 2017, MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv. https://arxiv.org/abs/1704.04861

Ma X, Dai X, Bai Y, et al., 2024, Rewrite the Stars, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

Cammarasana S, 2025, Optimal Weighted Convolution for Classification and Denosing. arXiv. https://arxiv.org/abs/2505.24558

Khanam R, Hussain M, 2024, YOLOv11: An Overview of the Key Architectural Enhancements. arXiv. https://arxiv.org/abs/2410.17725

Ioffe S, Szegedy C, 2015, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift, Proceedings of the 32nd International Conference on Machine Learning, PMLR 37: 448–456.

Ba JL, 2016, Layer Normalization. arXiv. https://arxiv.org/abs/1607.06450

Chollet F, 2016, Xception: Deep Learning with Depthwise Separable Convolutions. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1800–1807.

Sandler M, Howard A, Zhu M, et al., 2018, MobileNetV2: Inverted Residuals and Linear Bottlenecks, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

Wang J, Sun K, Cheng T, et al., 2021, Deep High-Resolution Representation Learning for Visual Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(10): 3349–3364.

Yuan F, Lin Z, Tian Z, et al., 2025, Bio-Inspired Hybrid Path Planning for Efficient and Smooth Robotic Navigation. Int J Intell Robot Appl.

Liang B, Yuan F, Deng J, et al., 2025, Cs-pbft: A Comprehensive Scoring-Based Practical Byzantine Fault Tolerance Consensus Algorithm. J Supercomput, 81: 859.

Yuan F, Huang X, Jiang H, et al., 2025, An xLSTM–XGBoost Ensemble Model for Forecasting Non-Stationary and Highly Volatile Gasoline Price. Computers, 14: 256.

Zhang K, Yuan F, Jiang Y, et al., 2025, A Particle Swarm Optimization-Guided Ivy Algorithm for Global Optimization Problems. Biomimetics (Basel), 10(5): 342.