VLSI design for SC-based speaker recognition

Chien Yao Wang, Min Shih, Tzu Chiang Tai, Po Chuan Lin, Shih Ting Huang, Jia Hao Zhao, Jia Ching Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This work presents an efficient VLSI architecture design for sparse coding (SC)-based speaker recognition system. The proposed system first extracts the linear predictive cepstral coefficients (LPCCs). Then, we applied orthogonal matching pursuit (OMP) for sparse coding and using the sparse coefficients as feature to do classification task. To speed up the computation time, our proposed chip comprises a LPCC module and an OMP module. The LPCC module computes the linear predictive coefficients (LPCs) and then converts LPCs to LPCCs. The OMP module includes residual unit, atom selection unit, QR decomposition unit, triangular matrix inverse unit and matrix multiplication unit. This designed chip has ability to handle a large dictionary size for sparse coding in OMP modules. The prototype chip is implemented using TSMC 90 nm CMOS technology on a die with a size of approximately 1.9×1.9 mm2.

Original languageEnglish
Title of host publicationProceedings of the 2015 10th IEEE Conference on Industrial Electronics and Applications, ICIEA 2015
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages335-338
Number of pages4
ISBN (Electronic)9781467373173
DOIs
StatePublished - 20 Nov 2015
Event10th IEEE Conference on Industrial Electronics and Applications, ICIEA 2015 - Auckland, New Zealand
Duration: 15 Jun 201517 Jun 2015

Publication series

NameProceedings of the 2015 10th IEEE Conference on Industrial Electronics and Applications, ICIEA 2015

Conference

Conference10th IEEE Conference on Industrial Electronics and Applications, ICIEA 2015
Country/TerritoryNew Zealand
CityAuckland
Period15/06/1517/06/15

Keywords

  • sparse coding
  • Speaker recognition
  • VLSI

Fingerprint

Dive into the research topics of 'VLSI design for SC-based speaker recognition'. Together they form a unique fingerprint.

Cite this