TY - JOUR
T1 - Multi-modal fusion in ergonomic health
T2 - bridging visual and pressure for sitting posture detection
AU - Quan, Qinxiao
AU - Gao, Yang
AU - Bai, Yang
AU - Jin, Zhanpeng
N1 - Publisher Copyright:
© China Computer Federation (CCF) 2024.
PY - 2024/12
Y1 - 2024/12
N2 - As the contradiction between the pursuit of health and the increasing duration of sedentary office work intensifies, there has been a growing focus on maintaining correct sitting posture while working in recent years. Scientific studies have shown that sitting posture correction plays a positive role in alleviating physical pain. With the rapid development of artificial intelligence technology, a significant amount of research has shifted towards implementing sitting posture detection and recognition systems using machine learning approaches. In this paper, we introduce an innovative sitting posture recognition system that integrates visual and pressure modalities. The system employs a differentiated pre-training strategy for training the bimodal models and features a feature fusion module designed based on feed-forward networks. Our system utilizes commonly available built-in cameras in laptops for collecting visual data and thin-film pressure sensor mats for pressure data in office scenarios. It achieved an F1-Macro score of 95.43% on a dataset with complex composite actions, marking an improvement of 7.13% and 10.79% over systems that rely solely on pressure or visual modalities, respectively, and a 7.07% improvement over systems using a uniform pre-training strategy.
AB - As the contradiction between the pursuit of health and the increasing duration of sedentary office work intensifies, there has been a growing focus on maintaining correct sitting posture while working in recent years. Scientific studies have shown that sitting posture correction plays a positive role in alleviating physical pain. With the rapid development of artificial intelligence technology, a significant amount of research has shifted towards implementing sitting posture detection and recognition systems using machine learning approaches. In this paper, we introduce an innovative sitting posture recognition system that integrates visual and pressure modalities. The system employs a differentiated pre-training strategy for training the bimodal models and features a feature fusion module designed based on feed-forward networks. Our system utilizes commonly available built-in cameras in laptops for collecting visual data and thin-film pressure sensor mats for pressure data in office scenarios. It achieved an F1-Macro score of 95.43% on a dataset with complex composite actions, marking an improvement of 7.13% and 10.79% over systems that rely solely on pressure or visual modalities, respectively, and a 7.07% improvement over systems using a uniform pre-training strategy.
KW - Computer vision
KW - Feature fusion
KW - Multi-label classification
KW - Pressure sensing
KW - Sitting posture recognition
UR - https://www.scopus.com/pages/publications/85202204438
U2 - 10.1007/s42486-024-00164-x
DO - 10.1007/s42486-024-00164-x
M3 - 文章
AN - SCOPUS:85202204438
SN - 2524-521X
VL - 6
SP - 380
EP - 393
JO - CCF Transactions on Pervasive Computing and Interaction
JF - CCF Transactions on Pervasive Computing and Interaction
IS - 4
M1 - 112451
ER -