Identificación de Personas por Medio de Reconocimiento de Voz

84 páginas

Autores:: Vacca Urrea, Edgar Santiago

Tipo de recurso:: Trabajo de grado de pregrado

Fecha de publicación:: 2013

Institución:: Universidad EIA .

Repositorio:: Repositorio EIA .

Idioma:: spa

id	REIA2_eac5839fc1465855a8548174f4b6f657
oai_identifier_str	oai:repository.eia.edu.co:11190/6468
network_acronym_str	REIA2
network_name_str	Repositorio EIA .
repository_id_str
dc.title.none.fl_str_mv	Identificación de Personas por Medio de Reconocimiento de Voz
title	Identificación de Personas por Medio de Reconocimiento de Voz
spellingShingle	Identificación de Personas por Medio de Reconocimiento de Voz Algoritmo Esperanza-Maximización (EM) Red Neuronal Artificial ART Modelos de Mezclas Gaussianas (GMM) Coeficientes Cepstrales de Frecuencia Mel (MFCC) Reconocimiento de locutor Expectation–maximization (EM) algorithm Artificial Neural Network ART Gaussian Mixture Models (GMM) Mel frequency Cepstral coefficients (MFCC) Speaker Recognition
title_short	Identificación de Personas por Medio de Reconocimiento de Voz
title_full	Identificación de Personas por Medio de Reconocimiento de Voz
title_fullStr	Identificación de Personas por Medio de Reconocimiento de Voz
title_full_unstemmed	Identificación de Personas por Medio de Reconocimiento de Voz
title_sort	Identificación de Personas por Medio de Reconocimiento de Voz
dc.creator.fl_str_mv	Vacca Urrea, Edgar Santiago
dc.contributor.author.none.fl_str_mv	Vacca Urrea, Edgar Santiago
dc.subject.proposal.spa.fl_str_mv	Algoritmo Esperanza-Maximización (EM) Red Neuronal Artificial ART Modelos de Mezclas Gaussianas (GMM) Coeficientes Cepstrales de Frecuencia Mel (MFCC) Reconocimiento de locutor
topic	Algoritmo Esperanza-Maximización (EM) Red Neuronal Artificial ART Modelos de Mezclas Gaussianas (GMM) Coeficientes Cepstrales de Frecuencia Mel (MFCC) Reconocimiento de locutor Expectation–maximization (EM) algorithm Artificial Neural Network ART Gaussian Mixture Models (GMM) Mel frequency Cepstral coefficients (MFCC) Speaker Recognition
dc.subject.proposal.eng.fl_str_mv	Expectation–maximization (EM) algorithm Artificial Neural Network ART Gaussian Mixture Models (GMM) Mel frequency Cepstral coefficients (MFCC) Speaker Recognition
description	84 páginas
publishDate	2013
dc.date.issued.none.fl_str_mv	2013
dc.date.accessioned.none.fl_str_mv	2024-03-01T15:16:09Z
dc.date.available.none.fl_str_mv	2024-03-01T15:16:09Z
dc.type.none.fl_str_mv	Trabajo de grado - Pregrado
dc.type.coar.none.fl_str_mv	http://purl.org/coar/resource_type/c_7a1f
dc.type.driver.none.fl_str_mv	info:eu-repo/semantics/bachelorThesis
dc.type.version.none.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.content.none.fl_str_mv	Text
dc.type.redcol.none.fl_str_mv	http://purl.org/redcol/resource_type/TP
dc.type.coarversion.none.fl_str_mv	http://purl.org/coar/version/c_970fb48d4fbd8a85
format	http://purl.org/coar/resource_type/c_7a1f
status_str	publishedVersion
dc.identifier.uri.none.fl_str_mv	https://repository.eia.edu.co/handle/11190/6468
url	https://repository.eia.edu.co/handle/11190/6468
dc.language.iso.none.fl_str_mv	spa
language	spa
dc.rights.none.fl_str_mv	Derechos Reservados - Univesidad EIA - 2013
dc.rights.uri.none.fl_str_mv	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.license.none.fl_str_mv	Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0)
dc.rights.accessrights.none.fl_str_mv	info:eu-repo/semantics/openAccess
dc.rights.coar.none.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	Derechos Reservados - Univesidad EIA - 2013 https://creativecommons.org/licenses/by-nc-nd/4.0/ Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0) http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv	openAccess
dc.format.mimetype.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidad EIA
dc.publisher.program.none.fl_str_mv	Ingeniería Mecatrónica
dc.publisher.faculty.none.fl_str_mv	Escuela de Ingeniería y Ciencias Básicas
dc.publisher.place.none.fl_str_mv	Envigado, Antioquia
publisher.none.fl_str_mv	Universidad EIA
institution	Universidad EIA .
bitstream.url.fl_str_mv	https://repository.eia.edu.co/bitstreams/623d397b-e3e1-44ee-9f9d-eaa5bc10d48b/download https://repository.eia.edu.co/bitstreams/73640811-bd91-497f-b9c8-d33a2bf26986/download https://repository.eia.edu.co/bitstreams/f47f5957-8de0-4e93-af63-5e1ea0856d48/download https://repository.eia.edu.co/bitstreams/bc55a218-fa8c-49fc-90ce-63f2d6a1048b/download
bitstream.checksum.fl_str_mv	d02b4450031677f21265fe9a55fd5511 2264fce645ac2952653ce3f3b8fa781e 9b9cbb56066f7e72cd56d331d9b75c5a 96a679b3240b8955c987f5f53ce6389b
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio Institucional Universidad EIA
repository.mail.fl_str_mv	bdigital@metabiblioteca.com
_version_	1839635930625867776
spelling	Vacca Urrea, Edgar Santiago2024-03-01T15:16:09Z2024-03-01T15:16:09Z2013https://repository.eia.edu.co/handle/11190/646884 páginasEste trabajo implementa un sistema de reconocimiento de voz para identificación de personas, esto con el objetivo de que sólo ellas puedan acceder a su información personal. Para su desarrollo, se realizó inicialmente una investigación en la que se tomaron diferentes técnicas de reconocimiento de voz para ser evaluadas y escoger la que mejor se acomodaba al sistema planteado. Posteriormente se convocaron varios individuos a los cuales se les tomaron datos personales para la realización de una base de datos con la que se comprobó el funcionamiento del sistema. Basados en la investigación realizada, se llegó a la conclusión que el sistema debe tener las propiedades de un sistema de reconocimiento automático de locutor independiente del texto, por esta razón, se escogieron los Coeficientes Cepstrales de Frecuencia Mel (MFCC) y los Modelos de Mezclas Gaussianas para el procesamiento de la señal de voz y así obtener los modelos paramétricos del locutor necesarios para la identificación. El paso siguiente fue implementar una red neuronal ART que actualiza los modelos paramétricos permitiendo que este se vaya adaptando a las características de la voz del locutor que van cambiando con el tiempo. Finalmente, se diseñó un programa en Matlab en el cual, el usuario puede escoger entre el registro para ingresar a la base de datos, el entrenamiento para ingresar al sistema de reconocimiento o la identificación donde el usuario espera que el sistema lo reconozca por las características de su voz.Abstract: This work developed a system of voice recognition for the identification of people, with the objective that only they can accede to their personal information. For its realization, initially was made an investigation, taking different techniques of voice recognition with the purpose to evaluate them and then take the best for the system. Subsequent, a group of people were called to take their personal information for a data base in order to test the work of the recognition system. Base on the investigation, the conclusion was that the system has to have the properties of a Text–Independent Speaker Identification System, for this reason, were chosen the Mel Frequency Cepstral Coefficients (MFCC) and the Gaussian Mixture Models for processing the speech signal and obtain the necessary parametric models for speaker identification. The next step was to implement an Artificial Neuronal Network ART which allow the actualization of the parametric models, permitting the adaptation of the voice characteristics of the speaker that change with the time. Finally, a Matlab program was designed, in which the user can choose between recording to enter the database, the training to enter the system for recognition or identifying where the user expects the system to recognize the characteristics of their voice.PregradoIngeniero Mecatrónicoapplication/pdfspaUniversidad EIAIngeniería MecatrónicaEscuela de Ingeniería y Ciencias BásicasEnvigado, AntioquiaDerechos Reservados - Univesidad EIA - 2013https://creativecommons.org/licenses/by-nc-nd/4.0/Atribución-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0)info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Identificación de Personas por Medio de Reconocimiento de VozTrabajo de grado - Pregradohttp://purl.org/coar/resource_type/c_7a1finfo:eu-repo/semantics/bachelorThesisinfo:eu-repo/semantics/publishedVersionTexthttp://purl.org/redcol/resource_type/TPhttp://purl.org/coar/version/c_970fb48d4fbd8a85Algoritmo Esperanza-Maximización (EM)Red Neuronal Artificial ARTModelos de Mezclas Gaussianas (GMM)Coeficientes Cepstrales de Frecuencia Mel (MFCC)Reconocimiento de locutorExpectation–maximization (EM) algorithmArtificial Neural Network ARTGaussian Mixture Models (GMM)Mel frequency Cepstral coefficients (MFCC)Speaker RecognitionPublicationORIGINALVaccaEdgar_2013_IdentificaciónPersonasMedio.pdfVaccaEdgar_2013_IdentificaciónPersonasMedio.pdfapplication/pdf9186057https://repository.eia.edu.co/bitstreams/623d397b-e3e1-44ee-9f9d-eaa5bc10d48b/downloadd02b4450031677f21265fe9a55fd5511MD51LICENSElicense.txtlicense.txttext/plain; charset=utf-82553https://repository.eia.edu.co/bitstreams/73640811-bd91-497f-b9c8-d33a2bf26986/download2264fce645ac2952653ce3f3b8fa781eMD52TEXTVaccaEdgar_2013_IdentificaciónPersonasMedio.pdf.txtVaccaEdgar_2013_IdentificaciónPersonasMedio.pdf.txtExtracted texttext/plain102265https://repository.eia.edu.co/bitstreams/f47f5957-8de0-4e93-af63-5e1ea0856d48/download9b9cbb56066f7e72cd56d331d9b75c5aMD53THUMBNAILVaccaEdgar_2013_IdentificaciónPersonasMedio.pdf.jpgVaccaEdgar_2013_IdentificaciónPersonasMedio.pdf.jpgGenerated Thumbnailimage/jpeg7138https://repository.eia.edu.co/bitstreams/bc55a218-fa8c-49fc-90ce-63f2d6a1048b/download96a679b3240b8955c987f5f53ce6389bMD5411190/6468oai:repository.eia.edu.co:11190/64682024-03-02 03:00:48.619https://creativecommons.org/licenses/by-nc-nd/4.0/Derechos Reservados - Univesidad EIA - 2013open.accesshttps://repository.eia.edu.coRepositorio Institucional Universidad EIAbdigital@metabiblioteca.comCjxjZW50ZXI+PGI+QVZJU08gREUgUFJJVkFDSURBRDwvYj48L2NlbnRlcj4KPGJyPgo8cD5MYSBFc2N1ZWxhIGRlIEluZ2VuaWVyw61hIGRlIEFudGlvcXVpYSBhIHRyYXbDqXMgZGUgZXN0ZSBhdmlzbywgaW5mb3JtYSBhIGxvcyB0aXR1bGFyZXMgZGUgZGF0b3MgcGVyc29uYWxlcyBxdWUgc2UgZW5jdWVudHJlbiBlbiBzdXMgYmFzZXMgZGUgZGF0b3MgcXVlIGxhcyBwb2zDrXRpY2FzIGRlIHRyYXRhbWllbnRvIGRlIGRhdG9zIHBlcnNvbmFsZXMgbGEgRUlBIHNvbjo8L3A+CjxwPkFsIHRpdHVsYXIgZGUgbG9zIGRhdG9zIHBlcnNvbmFsZXMgZW4gdHJhdGFtaWVudG8sIHNlIGxlIHJlc3BldGFyw6FuIHN1cyBkZXJlY2hvcyBhIGNvbm9jZXIgw61udGVncmFtZW50ZSB5IGRlIGZvcm1hIGdyYXR1aXRhIHN1cyBkYXRvcyBwZXJzb25hbGVzLCBhc8OtIGNvbW8gYSBhY3R1YWxpemFybG9zIHkgcmVjdGlmaWNhcmxvcyBmcmVudGUgYSBsYSBFSUEgbyBsb3MgZW5jYXJnYWRvcyBkZWwgdHJhdGFtaWVudG8uPC9wPgo8cD5BbCB0aXR1bGFyIGRlIGxvcyBkYXRvcyBwZXJzb25hbGVzIGVuIHRyYXRhbWllbnRvLCBwb2Ryw6EgY29ub2NlciBlbCB1c28gcXVlIHNlIGxlIGhhIGRhZG8gYSBzdXMgZGF0b3MgcGVyc29uYWxlcywgcHJldmlhIHNvbGljaXR1ZC48L3A+CjxwPkVsIHRpdHVsYXIgZGUgbG9zIGRhdG9zIHBlcnNvbmFsZXMgZW4gdHJhdGFtaWVudG8sIHBvZHLDoSBzb2xpY2l0YXIgcHJ1ZWJhIGRlIGxhIGF1dG9yaXphY2nDs24gb3RvcmdhZGEgYSBsYSBFSUEuIHNhbHZvIGN1YW5kbyBleHByZXNhbWVudGUgc2UgZXhjZXB0w7plIGNvbW8gcmVxdWlzaXRvIHBhcmEgZWwgdHJhdGFtaWVudG8sIGRlIGNvbmZvcm1pZGFkIGNvbiBsYSBsZXkuPC9wPgo8cD5FbCB0aXR1bGFyIGRlIGxvcyBkYXRvcyBwdWVkZSByZXZvY2FyIGxhIGF1dG9yaXphY2nDs24geSBzb2xpY2l0YXIgbGEgc3VwcmVzacOzbiBkZWwgZGF0byBjdWFuZG8gZW4gZWwgdHJhdGFtaWVudG8gbm8gc2UgcmVzcGV0ZW4gbG9zIHByaW5jaXBpb3MsIGRlcmVjaG9zIHkgZ2FyYW50w61hcyBjb25zdGl0dWNpb25hbGVzIHkgbGVnYWxlcy4gTGEgcmV2b2NhdG9yaWEgeSBzdXByZXNpw7NuIHByb2NlZGVyw6EgY3VhbmRvIGxhIFN1cGVyaW50ZW5kZW5jaWEgZGUgSW5kdXN0cmlhIHkgQ29tZXJjaW8gKFNJQykgaGF5YSBkZXRlcm1pbmFkbyBxdWUgZW4gZWwgdHJhdGFtaWVudG8sIGxhIEVTQ1VFTEEgREUgSU5HRU5JRVLDjUEgREUgQU5USU9RVUlBIGhhIGluY3VycmlkbyBlbiBjb25kdWN0YXMgY29udHJhcmlhcyBhIGVzdGEgTGV5IHkgYSBsYSBDb25zdGl0dWNpw7NuIFBvbMOtdGljYS48L3A+CjxwPlBhcmEgZWZlY3RvcyBkZSBlamVyY2VyIHN1cyBkZXJlY2hvcyBkZSBjb25vY2VyLCBhY3R1YWxpemFyLCByZWN0aWZpY2FyIHkgc3VwcmltaXIgaW5mb3JtYWNpw7NuLCByZXZvY2FyIGxhIGF1dG9yaXphY2nDs24sIGVudHJlIG90cm9zOyBlbCB0aXR1bGFyIGRlIGxvcyBkYXRvcyBwb2Ryw6EgYWN1ZGlyIGEgbGEgRVNDVUVMQSBERSBJTkdFTklFUsONQSBERSBBTlRJT1FVSUEsIGNvbW8gcmVzcG9uc2FibGUgZGVsIHRyYXRhbWllbnRvIGRlIGRhdG9zIGFsIMOhcmVhIGRlIGNvbXVuaWNhY2lvbmVzLCBtZWRpYW50ZSBjb3JyZW8gZWxlY3Ryw7NuaWNvIGEgd2VibWFzdGVyQGVpYS5lZHUuY28gLjwvcD4KPHA+RW4gY2FzbyBkZSBpbmZyYWNjaW9uZXMgYSBsYSBsZXkgMTU4MSBkZSAyMDEyLCBlbCB0aXR1bGFyIGRlIGxvcyBkYXRvcyBwb2Ryw6EgcHJlc2VudGFyIHF1ZWphIGFudGUgbGEgU3VwZXJpbnRlbmRlbmNpYSBkZSBJbmR1c3RyaWEgeSBDb21lcmNpbyAoU0lDKS48L3A+CjxwPkVsIHRpdHVsYXIgc2Vyw6EgaW5mb3JtYWRvIGFjZXJjYSBkZSBsYSBubyBvYmxpZ2F0b3JpZWRhZCBkZSBsYXMgcmVzcHVlc3RhcyBhIGxhcyBwcmVndW50YXMgcXVlIGxlIHNlYW4gaGVjaGFzLCBjdWFuZG8gw6lzdGFzIHZlcnNlbiBzb2JyZSBkYXRvcyBzZW5zaWJsZXMsIHRhbGVzIGNvbW8gb3JpZ2VuIHJhY2lhbCBvIMOpdG5pY28sIG9yaWVudGFjacOzbiBwb2zDrXRpY2EsIGNvbnZpY2Npb25lcyByZWxpZ2lvc2FzICwgcGVydGVuZW5jaWEgYSBzaW5kaWNhdG9zLCBvcmdhbml6YWNpb25lcyBzb2NpYWxlcyBkZSBkZXJlY2hvcyBodW1hbm9zLCBkYXRvcyByZWxhdGl2b3MgYSBsYSBzYWx1ZCwgYSBsYSB2aWRhIHNleHVhbCB5IGRhdG9zIGJpb23DqXRyaWNvcyBvIHNvYnJlIGxvcyBkYXRvcyBkZSBsb3MgbmnDsW9zLCBuacOxYXMgeSBhZG9sZXNjZW50ZXMuPC9wPgo8cD5FbCB0aXR1bGFyIHBvZHLDoSBjb25vY2VyIG51ZXN0cmEgcG9sw610aWNhIGRlIHRyYXRhbWllbnRvLCBsb3MgZGF0b3Mgc3VzdGFuY2lhbGVzIHF1ZSBzZSBsbGVndWVuIGEgcHJvZHVjaXIgZW4gZWwgcHJlc2VudGUgYXZpc28gbyBlbiBsYXMgcG9sw610aWNhcyBkZSB0cmF0YW1pZW50bywgc2Vyw6FuIHB1YmxpY2FkYXMgZW4gbnVlc3RybyBzaXRpbyB3ZWIsIG1lZGlvIGVsZWN0csOzbmljbyBoYWJpdHVhbCBkZSBjb250YWN0byBjb24gbG9zIHRpdHVsYXJlcy4K

Identificación de Personas por Medio de Reconocimiento de Voz

Publicaciones similares