TY - GEN
T1 - Exploring wav2vec 2.0 model for heart murmur detection
AU - Panah, Davoud Shariat
AU - Hines, Andrew
AU - McKeever, Susan
N1 - Publisher Copyright:
© 2023 European Signal Processing Conference, EUSIPCO. All rights reserved.
PY - 2023
Y1 - 2023
N2 - The lack of access to cardiology resources in many regions of the world has motivated the development of automatic diagnostic systems based on cardiac signals. In recent years, a wide range of supervised learning models have been proposed that can make an initial diagnosis of heart disease from heart sounds. To achieve high accuracy, however, such supervised learning models generally require a large amount of labeled data, which can be costly to obtain. In this regard, self-supervised learning has been recently employed to reduce the over-reliance on annotated data. Wav2vec 2.0 is an audio self-supervised learning model that has shown promising results in a variety of speech-related tasks. In this paper, we adapted the wav2vec 2.0 for murmur detection from heart sound signals. For this purpose, we pre-trained and fine-tuned this model on the Circor DigiScope heart sound dataset. The results confirm the feasibility of using the wav2vec 2.0 model for heart sound classification. The model shows a competitive performance by achieving a weighted accuracy of 0.80 and a UAR of 0.70 for murmur detection on the holdout test set. To investigate the impact of the fine-tuning data size on the downstream performance, we also fine-tuned the wav2vec 2.0 model on small sizes of annotated data. The results confirm that this model is robust to small fine-tuning data sizes, and as a result, can reduce our reliance on large, annotated heart sound data.
AB - The lack of access to cardiology resources in many regions of the world has motivated the development of automatic diagnostic systems based on cardiac signals. In recent years, a wide range of supervised learning models have been proposed that can make an initial diagnosis of heart disease from heart sounds. To achieve high accuracy, however, such supervised learning models generally require a large amount of labeled data, which can be costly to obtain. In this regard, self-supervised learning has been recently employed to reduce the over-reliance on annotated data. Wav2vec 2.0 is an audio self-supervised learning model that has shown promising results in a variety of speech-related tasks. In this paper, we adapted the wav2vec 2.0 for murmur detection from heart sound signals. For this purpose, we pre-trained and fine-tuned this model on the Circor DigiScope heart sound dataset. The results confirm the feasibility of using the wav2vec 2.0 model for heart sound classification. The model shows a competitive performance by achieving a weighted accuracy of 0.80 and a UAR of 0.70 for murmur detection on the holdout test set. To investigate the impact of the fine-tuning data size on the downstream performance, we also fine-tuned the wav2vec 2.0 model on small sizes of annotated data. The results confirm that this model is robust to small fine-tuning data sizes, and as a result, can reduce our reliance on large, annotated heart sound data.
KW - Heart Sound
KW - Murmur Detection
KW - Self-supervised Learning
KW - wav2vec 2.0
UR - http://www.scopus.com/inward/record.url?scp=85178329532&partnerID=8YFLogxK
U2 - 10.23919/EUSIPCO58844.2023.10289947
DO - 10.23919/EUSIPCO58844.2023.10289947
M3 - Conference contribution
AN - SCOPUS:85178329532
T3 - European Signal Processing Conference
SP - 1010
EP - 1014
BT - 31st European Signal Processing Conference, EUSIPCO 2023 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 31st European Signal Processing Conference, EUSIPCO 2023
Y2 - 4 September 2023 through 8 September 2023
ER -