TY - GEN
T1 - CASCO
T2 - 40th Computer Graphics International Conference, CGI 2023
AU - Zhang, Xinxin
AU - Liu, Hang
AU - Chen, Xinru
AU - Qin, Rui
AU - Zhu, Yan
AU - Li, Wenfang
AU - Hu, Menghan
AU - Zhang, Jian
N1 - Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2024
Y1 - 2024
N2 - Cough is a common symptom of respiratory disease, which produces a specific sound. Cough detection has great significance to prevent, assess, and control epidemics. This paper proposes CASCO (Cough Analysis System using Short-Time Fourier Transform (STFT) and Convolutional Neural Networks (CNN) in the WeChat mini Program), a cough detection system capable of quantifying the number of coughs through an audio division algorithm. This system combines STFT with CNN, achieving accuracy, precision, recall, and F1-score with 97.0%, 95.6%, 98.7%, and 0.97 respectively in cough detection. The model is embedded into the WeChat mini program to make it feasible to apply cough detection on smartphones and realize large-scale and contactless cough screening. Future research can combine audio and video signals to further improve the accuracy of large-scale cough screening.
AB - Cough is a common symptom of respiratory disease, which produces a specific sound. Cough detection has great significance to prevent, assess, and control epidemics. This paper proposes CASCO (Cough Analysis System using Short-Time Fourier Transform (STFT) and Convolutional Neural Networks (CNN) in the WeChat mini Program), a cough detection system capable of quantifying the number of coughs through an audio division algorithm. This system combines STFT with CNN, achieving accuracy, precision, recall, and F1-score with 97.0%, 95.6%, 98.7%, and 0.97 respectively in cough detection. The model is embedded into the WeChat mini program to make it feasible to apply cough detection on smartphones and realize large-scale and contactless cough screening. Future research can combine audio and video signals to further improve the accuracy of large-scale cough screening.
KW - Audio Signal Processing
KW - Cough detection
KW - Deep neural network
UR - https://www.scopus.com/pages/publications/85180814622
U2 - 10.1007/978-3-031-50078-7_23
DO - 10.1007/978-3-031-50078-7_23
M3 - 会议稿件
AN - SCOPUS:85180814622
SN - 9783031500770
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 287
EP - 300
BT - Advances in Computer Graphics - 40th Computer Graphics International Conference, CGI 2023, Proceedings
A2 - Sheng, Bin
A2 - Bi, Lei
A2 - Kim, Jinman
A2 - Magnenat-Thalmann, Nadia
A2 - Thalmann, Daniel
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 28 August 2023 through 1 September 2023
ER -