A prelingual tool for the education of altered voices
This paper addresses the problem of Computer-Aided Voice Therapy for altered voices. The proposal of the work is to develop a set of free activities called PreLingua for providing interactive voice therapy to a population of individuals with voice disorders. The interactive tools are designed to tra...
- Autores:
- Tipo de recurso:
- Fecha de publicación:
- 2012
- Institución:
- Universidad del Rosario
- Repositorio:
- Repositorio EdocUR - U. Rosario
- Idioma:
- eng
- OAI Identifier:
- oai:repository.urosario.edu.co:10336/27720
- Acceso en línea:
- https://doi.org/10.1016/j.specom.2011.05.006
https://repository.urosario.edu.co/handle/10336/27720
- Palabra clave:
- Altered voice
Voice therapy
Speech processing
Formant normalization
Vocal tract length estimation
- Rights
- License
- Restringido (Acceso a grupos específicos)
id |
EDOCUR2_9f8a8d53fc441d04097f4c54a383775c |
---|---|
oai_identifier_str |
oai:repository.urosario.edu.co:10336/27720 |
network_acronym_str |
EDOCUR2 |
network_name_str |
Repositorio EdocUR - U. Rosario |
repository_id_str |
|
spelling |
798055896008a33594b-a7e0-416c-b950-82fb57cfb1160bb16363-d9bc-486b-a1a7-39ef38ebe3402020-08-19T14:43:31Z2020-08-19T14:43:31Z2012-06This paper addresses the problem of Computer-Aided Voice Therapy for altered voices. The proposal of the work is to develop a set of free activities called PreLingua for providing interactive voice therapy to a population of individuals with voice disorders. The interactive tools are designed to train voice skills like: voice production, intensity, blow, vocal onset, phonation time, tone, and vocalic articulation for Spanish language. The development of these interactive tools along with the underlying speech technologies that support them requires the existence of speech processing, whose algorithms must be robust with respect to the sources of speech variability that are characteristic of this population of speakers. One of the main problem addressed is how to estimate reliably formant frequencies in high-pitched speech (typical in children and women) and how to normalize these estimations independently of the characteristics of the speakers. Linear prediction coding, homomorphic analysis and modeling of the vocal tract are the core of the speech processing techniques used to allow such normalization through vocal tract length. This paper also presents the result of an experimental study where PreLingua was applied in a population with voice disorders and pathologies in special education centers in Spain and Colombia. Promising results were obtained in this preliminary study after 12 weeks of therapy, as it showed improvements in the voice capabilities of a remarkable number of users and the ability of the tool to educate impaired users with voice alterations. This improvement was assessed by the evaluation of the educators before and after the study and also by the performance of the subjects in the activities of PreLingua. The results were very encouraging to keep working in this direction, with the overall aim of providing further functionalities and robustness to the system.application/pdfhttps://doi.org/10.1016/j.specom.2011.05.006ISSN: 0167-6393EISSN: 1872-7182https://repository.urosario.edu.co/handle/10336/27720engElsevier600No. 5583Speech CommunicationVol. 54Speech Communication, ISSN: 0167-6393;EISSN: 1872-7182, Vol.54, No.5 (2012); pp. 583-600https://www.sciencedirect.com/science/article/abs/pii/S0167639311000719Restringido (Acceso a grupos específicos)http://purl.org/coar/access_right/c_16ecSpeech Communicationinstname:Universidad del Rosarioreponame:Repositorio Institucional EdocURAltered voiceVoice therapySpeech processingFormant normalizationVocal tract length estimationA prelingual tool for the education of altered voicesUna herramienta prelingual para la educación de voces alteradasarticleArtículohttp://purl.org/coar/version/c_970fb48d4fbd8a85http://purl.org/coar/resource_type/c_6501Rodríguez-Dueñas, William R.Saz, OscarLleida, Eduardo10336/27720oai:repository.urosario.edu.co:10336/277202021-10-15 06:10:39.24https://repository.urosario.edu.coRepositorio institucional EdocURedocur@urosario.edu.co |
dc.title.spa.fl_str_mv |
A prelingual tool for the education of altered voices |
dc.title.TranslatedTitle.spa.fl_str_mv |
Una herramienta prelingual para la educación de voces alteradas |
title |
A prelingual tool for the education of altered voices |
spellingShingle |
A prelingual tool for the education of altered voices Altered voice Voice therapy Speech processing Formant normalization Vocal tract length estimation |
title_short |
A prelingual tool for the education of altered voices |
title_full |
A prelingual tool for the education of altered voices |
title_fullStr |
A prelingual tool for the education of altered voices |
title_full_unstemmed |
A prelingual tool for the education of altered voices |
title_sort |
A prelingual tool for the education of altered voices |
dc.subject.keyword.spa.fl_str_mv |
Altered voice Voice therapy Speech processing Formant normalization Vocal tract length estimation |
topic |
Altered voice Voice therapy Speech processing Formant normalization Vocal tract length estimation |
description |
This paper addresses the problem of Computer-Aided Voice Therapy for altered voices. The proposal of the work is to develop a set of free activities called PreLingua for providing interactive voice therapy to a population of individuals with voice disorders. The interactive tools are designed to train voice skills like: voice production, intensity, blow, vocal onset, phonation time, tone, and vocalic articulation for Spanish language. The development of these interactive tools along with the underlying speech technologies that support them requires the existence of speech processing, whose algorithms must be robust with respect to the sources of speech variability that are characteristic of this population of speakers. One of the main problem addressed is how to estimate reliably formant frequencies in high-pitched speech (typical in children and women) and how to normalize these estimations independently of the characteristics of the speakers. Linear prediction coding, homomorphic analysis and modeling of the vocal tract are the core of the speech processing techniques used to allow such normalization through vocal tract length. This paper also presents the result of an experimental study where PreLingua was applied in a population with voice disorders and pathologies in special education centers in Spain and Colombia. Promising results were obtained in this preliminary study after 12 weeks of therapy, as it showed improvements in the voice capabilities of a remarkable number of users and the ability of the tool to educate impaired users with voice alterations. This improvement was assessed by the evaluation of the educators before and after the study and also by the performance of the subjects in the activities of PreLingua. The results were very encouraging to keep working in this direction, with the overall aim of providing further functionalities and robustness to the system. |
publishDate |
2012 |
dc.date.created.spa.fl_str_mv |
2012-06 |
dc.date.accessioned.none.fl_str_mv |
2020-08-19T14:43:31Z |
dc.date.available.none.fl_str_mv |
2020-08-19T14:43:31Z |
dc.type.eng.fl_str_mv |
article |
dc.type.coarversion.fl_str_mv |
http://purl.org/coar/version/c_970fb48d4fbd8a85 |
dc.type.coar.fl_str_mv |
http://purl.org/coar/resource_type/c_6501 |
dc.type.spa.spa.fl_str_mv |
Artículo |
dc.identifier.doi.none.fl_str_mv |
https://doi.org/10.1016/j.specom.2011.05.006 |
dc.identifier.issn.none.fl_str_mv |
ISSN: 0167-6393 EISSN: 1872-7182 |
dc.identifier.uri.none.fl_str_mv |
https://repository.urosario.edu.co/handle/10336/27720 |
url |
https://doi.org/10.1016/j.specom.2011.05.006 https://repository.urosario.edu.co/handle/10336/27720 |
identifier_str_mv |
ISSN: 0167-6393 EISSN: 1872-7182 |
dc.language.iso.spa.fl_str_mv |
eng |
language |
eng |
dc.relation.citationEndPage.none.fl_str_mv |
600 |
dc.relation.citationIssue.none.fl_str_mv |
No. 5 |
dc.relation.citationStartPage.none.fl_str_mv |
583 |
dc.relation.citationTitle.none.fl_str_mv |
Speech Communication |
dc.relation.citationVolume.none.fl_str_mv |
Vol. 54 |
dc.relation.ispartof.spa.fl_str_mv |
Speech Communication, ISSN: 0167-6393;EISSN: 1872-7182, Vol.54, No.5 (2012); pp. 583-600 |
dc.relation.uri.spa.fl_str_mv |
https://www.sciencedirect.com/science/article/abs/pii/S0167639311000719 |
dc.rights.coar.fl_str_mv |
http://purl.org/coar/access_right/c_16ec |
dc.rights.acceso.spa.fl_str_mv |
Restringido (Acceso a grupos específicos) |
rights_invalid_str_mv |
Restringido (Acceso a grupos específicos) http://purl.org/coar/access_right/c_16ec |
dc.format.mimetype.none.fl_str_mv |
application/pdf |
dc.publisher.spa.fl_str_mv |
Elsevier |
dc.source.spa.fl_str_mv |
Speech Communication |
institution |
Universidad del Rosario |
dc.source.instname.none.fl_str_mv |
instname:Universidad del Rosario |
dc.source.reponame.none.fl_str_mv |
reponame:Repositorio Institucional EdocUR |
repository.name.fl_str_mv |
Repositorio institucional EdocUR |
repository.mail.fl_str_mv |
edocur@urosario.edu.co |
_version_ |
1814167583927042048 |