Voice Activity Detection Using Weighted K-Means Thresholding Algorithm
Alimi Sheriff¹, Yussuff I. O. Abayomi²

¹Alimi Sheriff, Department of Computer Science, Babcock University, Ilishan Remo (Ogun State), Nigeria.

²Yussuff I. O. Abayomi, Associate Professor, Department of Electronic and Computer Engineering, Lagos State University, Epe (Lagos), Nigeria.

Open Access | Editorial and Publishing Policies | Cite | Zenodo | OJS | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Voice activity detection (VAD) separates speech segments from silent segments of an audio signal, and it is valuable for many speech-processing applications because it assists in improving performance and system efficiency; such applications include speech recognition and speaker verification. In this study, K-means, a clustering algorithm, was extended to a thresholding algorithm termed K-means weighted thresholding and was utilised for discriminating voiced/speech segments from silent segments in audio or speech signals. The voice signal was fragmented into frames of 2048 samples, and the spectral power of the frames served as input for computing the threshold value by the extended k-means algorithm; hence, any frame whose spectral power is greater than or equal to the threshold value is considered to part of the voice segments; otherwise, it is tagged as a silent frame. The implemented voice activity detection system achieved outstanding performances with an actual acceptance rate (sensitivity), false acceptance rate, actual rejection rate (specificity), false rejection rate (miss rate), and a classification accuracy of 100%, 0.025%, 100%, 0%, and 99.97%, respectively.

Keywords: K-Means, Thresholding Algorithm, Voice Activity Detection.
Scope of the Article: Artificial Intelligence and Methods

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US

D105114040325

Share this entry

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US