Adaptive voice smoothing with optimal playback delay based on the ITU-T E-model

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Perceived voice quality is mainly affected by IP network impairments such as delay, jitter and packet loss. Adaptive smoothing buffer at the receiving end can compensate for the effects of jitter based on a tradeoff between delay and loss to archive a best voice quality. This work formulates an online loss model which incorporates buffer sizes and applies the ITU-T E-model approach to optimize the delay-loss problem. Distinct from the other optimal smoothers, the proposed optimal smoother suitable for most of codecs carries the lowest complexity. Since the adaptive smoothing scheme introduces variable playback delays, the buffer re-synchronization between the capture and the playback becomes essential. This work also presents a buffer re-synchronization algorithm based on silence skipping to prevent unacceptable increase in the buffer preloading delay and even buffer overflow. Simulation experiments validate that the proposed adaptive smoother archives significant improvement in the voice quality.

Original languageEnglish
Title of host publicationEmbedded and Ubiquitous Computing - International Conference EUC 2005, Proceedings
PublisherSpringer Verlag
Pages805-815
Number of pages11
ISBN (Print)3540308075, 9783540308072
DOIs
StatePublished - 2005
EventInternational Conference on Embedded and Ubiquitous Computing, EUC 2005 - Nagasaki, Japan
Duration: 6 Dec 20059 Dec 2005

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3824 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Embedded and Ubiquitous Computing, EUC 2005
Country/TerritoryJapan
CityNagasaki
Period6/12/059/12/05

Fingerprint

Dive into the research topics of 'Adaptive voice smoothing with optimal playback delay based on the ITU-T E-model'. Together they form a unique fingerprint.

Cite this