ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants

Enoch Hsin Ho Huang, Rong Chao, Yu Tsao, Chao Min Wu

研究成果: 雜誌貢獻期刊論文同行評審

2 引文 斯高帕斯(Scopus)

摘要

ElectrodeNet, a deep-learning-based sound coding strategy for the cochlear implant (CI), is proposed to emulate the advanced combination encoder (ACE) strategy by replacing the conventional envelope detection using various artificial neural networks. The extended ElectrodeNet-CS strategy further incorporates the channel selection (CS). Network models of deep neural network (DNN), convolutional neural network (CNN), and long short-term memory (LSTM) were trained using the fast Fourier transformed bins and channel envelopes obtained from the processing of clean speech by the ACE strategy. Objective speech understanding using short-time objective intelligibility (STOI) and normalized covariance metric (NCM) was estimated for ElectrodeNet using CI simulations. Sentence recognition tests for vocoded Mandarin speech were conducted with normal-hearing listeners. DNN, CNN, and LSTM-based ElectrodeNets exhibited strong correlations to ACE in objective and subjective scores using mean squared error (MSE), linear correlation coefficient (LCC), and Spearman's rank correlation coefficient (SRCC). The ElectrodeNet-CS strategy was capable of producing N-of-M compatible electrode patterns using a modified DNN network to embed maxima selection, and to perform in similar or even slightly higher average in STOI and sentence recognition compared to ACE. The methods and findings demonstrated the feasibility and potential of using deep learning in the CI coding strategy.

原文???core.languages.en_GB???
頁(從 - 到)346-357
頁數12
期刊IEEE Transactions on Cognitive and Developmental Systems
16
發行號1
DOIs
出版狀態已出版 - 1 2月 2024

指紋

深入研究「ElectrodeNet - A Deep-Learning-Based Sound Coding Strategy for Cochlear Implants」主題。共同形成了獨特的指紋。

引用此