A simple but efficient voice activity detection algorithm through Hilbert transform and dynamic threshold for speech pathologies
A simple but efficient voice activity detector based on the Hilbert transform and a dynamic threshold is presented to be used on the pre-processing of audio signals -- The algorithm to define the dynamic threshold is a modification of a convex combination found in literature -- This scheme allows th...
- Autores:
-
Ortiz P., D.
Villa, Luisa F.
Salazar, Carlos
Quintero, O.L.
Ortiz P., D.
Villa, Luisa F.
Salazar, Carlos
Quintero, O.L.
- Tipo de recurso:
- Fecha de publicación:
- 2016
- Institución:
- Universidad EAFIT
- Repositorio:
- Repositorio EAFIT
- Idioma:
- eng
- OAI Identifier:
- oai:repository.eafit.edu.co:10784/8373
- Acceso en línea:
- http://hdl.handle.net/10784/8373
- Palabra clave:
- Transformada de Hilbert
Cancelación de ruidos
Señal monofónica
PROCESAMIENTO DE SEÑALES
PROCESAMIENTO DE SEÑALES - TÉCNICAS DIGITALES
MEDICIÓN DEL RUIDO
FILTROS ADAPTIVOS
ANÁLISIS DE FOURIER
TEORÍA ESPECTRAL (MATEMÁTICAS)
ANÁLISIS ESPECTRAL
PROCESOS DE GAUSS
UMBRAL AUDITIVO
Signal processing
Signal processing - Digital techniques
Noise - Measurement
Adaptive filters
Fourier analysis
Spectral theory (mathematics)
Spectrum analysis
Gaussian processes
Auditory threshold
Signal processing
Signal processing - Digital techniques
Noise - Measurement
Adaptive filters
Fourier analysis
Spectral theory (mathematics)
Spectrum analysis
Gaussian processes
Auditory threshold
Transformada de Hilbert
Cancelación de ruidos
Señal monofónica
- Rights
- License
- Acceso abierto
Summary: | A simple but efficient voice activity detector based on the Hilbert transform and a dynamic threshold is presented to be used on the pre-processing of audio signals -- The algorithm to define the dynamic threshold is a modification of a convex combination found in literature -- This scheme allows the detection of prosodic and silence segments on a speech in presence of non-ideal conditions like a spectral overlapped noise -- The present work shows preliminary results over a database built with some political speech -- The tests were performed adding artificial noise to natural noises over the audio signals, and some algorithms are compared -- Results will be extrapolated to the field of adaptive filtering on monophonic signals and the analysis of speech pathologies on futures works |
---|