首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Enhancing Speech Recognition for Parkinson's Disease Patient Using Transfer Learning Technique
Authors:YU Qing  MA Yi  LI Yongfu
Abstract:Parkinson's disease patients suffer from disorders of speech.The most frequently reported speech problems are weak,hoarse,nasal or monotonous voice,imprecise articulation,slow or fast speech,difficulty starting speech,impaired stress or rhythm,stuttering,and tremor.To improve the speech quality and assist the patient with speech rehabilitation therapy,we have proposed the speech recognition model for Parkinson's disease patients using transfer learning technique (PSTL),where we have pre-trained the long short-term memory (LSTM)neural network model with our developed publicly available dataset that has been obtained from healthy people through the social media platform.Then,we applied the transfer learning technique to improve the performance of the PSTL framework.The frequency spectrogram masking data augmentation method has been used to alleviate the over-fitting problem so that the word error rate (WER) is further reduced.Even with a limited dataset,our proposed model has effectively reduced the WER from 58% to 44.5% on the original speech dataset and 53.1% to 43% on the denoised speech dataset,which demonstrated the feasibility of our framework.
Keywords:speech recognition  parkinson's disease  transfer learning technique  data augmentation  scarce data
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号