Post-Processing for the Mask of Computational Auditory Scene Analysis in Monaural Speech Segregation

Lai, Wen-Hsing; Yang, Cheng-Jia; Wang, Siou-Lin


Título del documento:	Post-Processing for the Mask of Computational Auditory Scene Analysis in Monaural Speech Segregation
Revista:	Computación y sistemas
Base de datos:	PERIÓDICA
Número de sistema:	000423283
ISSN:	1405-5546
Autores:	Lai, Wen-Hsing¹ Yang, Cheng-Jia¹ Wang, Siou-Lin¹
Instituciones:	¹University of Science and Technology, Institute of Computer and Communication Engineering Kaohsiung First, Taiwan. China
Año:	2017
Periodo:	Oct-Dic
Volumen:	21
Número:	4
País:	México
Idioma:	Inglés
Tipo de documento:	Artículo
Enfoque:	Aplicado, descriptivo
Resumen en inglés	Speech segregation is one of the most difficult tasks in speech processing. This paper uses computational auditory scene analysis, support vector machine classifier, and post-processing on binary mask to separate speech from background noise. Mel-frequency cepstral coefficients and pitch are the two features used for support vector machine classification. Connected Component Labeling, Hole Filling, and Morphology are applied on the resulting binary mask as post-processing. Experimental results show that our method separates speech from background noise effectively
Disciplinas:	Ciencias de la computación, Literatura y lingüística
Palabras clave:	Lingüística aplicada, Procesamiento del discurso, Segregación del discurso, Marcaje de componentes conectados
Keyword:	Applied linguistics, Speech processing, Speech segregation, Connected component labeling
Texto completo:	Texto completo (Ver HTML) Texto completo (Ver PDF)

Post-Processing for the Mask of Computational Auditory Scene Analysis in Monaural Speech Segregation

Espere un momento...