Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo

In the educational sector, tests of student´s performance are constantly being applied; these tests produce data that are not analyzed in a deep way. The goal of this project is to study and / or compare the aforementioned data through a platform for the analysis of such data using protocols to get...

Full description

Autores:: Osorio Salcedo, Karen Paola

Tipo de recurso:

Fecha de publicación:: 2016

Institución:: Universidad del Norte

Repositorio:: Repositorio Uninorte

Idioma:: spa

id	REPOUNORT2_4ff3c93ed692daf81c94c760f6101137
oai_identifier_str	oai:manglar.uninorte.edu.co:10584/5860
network_acronym_str	REPOUNORT2
network_name_str	Repositorio Uninorte
repository_id_str
dc.title.es_ES.fl_str_mv	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
dc.title.en_US.fl_str_mv	Design and Implementation of a Data Analysis Platform in the Educational Sector
title	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
spellingShingle	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo BigData A BigData data
title_short	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
title_full	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
title_fullStr	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
title_full_unstemmed	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
title_sort	Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo
dc.creator.fl_str_mv	Osorio Salcedo, Karen Paola
dc.contributor.advisor.none.fl_str_mv	Jimeno, Miguel Wightman, Pedro Salazar, Augusto
dc.contributor.author.none.fl_str_mv	Osorio Salcedo, Karen Paola
dc.subject.es_ES.fl_str_mv	BigData A BigData
topic	BigData A BigData data
dc.subject.en_US.fl_str_mv	data
description	In the educational sector, tests of student´s performance are constantly being applied; these tests produce data that are not analyzed in a deep way. The goal of this project is to study and / or compare the aforementioned data through a platform for the analysis of such data using protocols to get enough bases and obtain resources for decision-making. the tools used for development are: Apache Spark Platform and Software alternates: Rapidminer, Weka, R. The solution is about an implementation of a framework called Apache Spark, for the configuration and development of a strategic environment for analyzing data. To achieve this project´s goal it was divided into two phases .The first phase, was based on the hardware and software design of a data analysis platform. The second phase was based on the design and implementation of data architecture for the platform. When designing and implementing a hardware and software infrastructure that supports a data analysis platform, the first tests were performed using virtual machines. The best environments in which the Apache Spark platform could be installed were VmWare and no virtualization. The other options did not support the large amount of information that was going to use or simply because of the computer capacity. The Apache Spark platform was compared with common applications for data mining. Apache Spark excelled in using time and resources to other applications. The analysis of any type of data allows us to obtain a global or specific sample of estimates that contribute for making a decision. Experimenting with these new technologies and comparing them to common technologies show how efficient and optimal the results of a sample of data can be to find similarities in them.
publishDate	2016
dc.date.accessioned.none.fl_str_mv	2016-11-25T22:59:12Z
dc.date.available.none.fl_str_mv	2016-11-25T22:59:12Z
dc.date.issued.none.fl_str_mv	2016-11-25
dc.type.es_ES.fl_str_mv	article
dc.type.coar.fl_str_mv	http://purl.org/coar/resource_type/c_6501
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/10584/5860
url	http://hdl.handle.net/10584/5860
dc.language.iso.es_ES.fl_str_mv	spa
language	spa
dc.rights.es_ES.fl_str_mv	Universidad del Norte
dc.rights.coar.fl_str_mv	http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv	Universidad del Norte http://purl.org/coar/access_right/c_abf2
dc.publisher.es_ES.fl_str_mv	Barranquilla, Universidad del Norte, 2016.
institution	Universidad del Norte
bitstream.url.fl_str_mv	http://172.16.14.36:8080/bitstream/10584/5860/2/license.txt
bitstream.checksum.fl_str_mv	8a4605be74aa9ea9d79846c1fba20a33
bitstream.checksumAlgorithm.fl_str_mv	MD5
repository.name.fl_str_mv	Repositorio Digital de la Universidad del Norte
repository.mail.fl_str_mv	mauribe@uninorte.edu.co
_version_	1849968551167787008
spelling	Jimeno, MiguelWightman, PedroSalazar, AugustoOsorio Salcedo, Karen Paola2016-11-25T22:59:12Z2016-11-25T22:59:12Z2016-11-25http://hdl.handle.net/10584/5860In the educational sector, tests of student´s performance are constantly being applied; these tests produce data that are not analyzed in a deep way. The goal of this project is to study and / or compare the aforementioned data through a platform for the analysis of such data using protocols to get enough bases and obtain resources for decision-making. the tools used for development are: Apache Spark Platform and Software alternates: Rapidminer, Weka, R. The solution is about an implementation of a framework called Apache Spark, for the configuration and development of a strategic environment for analyzing data. To achieve this project´s goal it was divided into two phases .The first phase, was based on the hardware and software design of a data analysis platform. The second phase was based on the design and implementation of data architecture for the platform. When designing and implementing a hardware and software infrastructure that supports a data analysis platform, the first tests were performed using virtual machines. The best environments in which the Apache Spark platform could be installed were VmWare and no virtualization. The other options did not support the large amount of information that was going to use or simply because of the computer capacity. The Apache Spark platform was compared with common applications for data mining. Apache Spark excelled in using time and resources to other applications. The analysis of any type of data allows us to obtain a global or specific sample of estimates that contribute for making a decision. Experimenting with these new technologies and comparing them to common technologies show how efficient and optimal the results of a sample of data can be to find similarities in them.En el sector educativo, constantemente se están aplicando pruebas a estudiantes para evaluar su desempeño académico; estas pruebas brindan una gran cantidad de información, la cual no es analizada a profundidad. Este proyecto busca estudiar y/o comparar datos por medio de una plataforma para el análisis de datos del sector educativo. Entre las herramientas utilizadas para el desarrollo se encuentran: Plataforma Apache Spark y Software alternos: Rapidminer, Weka, R. La solución propuesta consiste en la implementación de un framework llamado Apache Spark, para la configuración y el desarrollo de un ambiente estratégico para analizar datos. Para cumplir con los objetivos de este proyecto, se dividió en dos fases: la primera fase, se basó en el diseño hardware y software de una plataforma de análisis de datos; la segunda fase, se basó en el diseño e implementación de una arquitectura de datos para la plataforma. A la hora de diseñar e implementar una infraestructura de hardware y software que dé soporte a una plataforma de análisis de datos, se tomaron las primeras pruebas utilizando varios métodos, con y sin virtualizaciòn. Los mejores entornos en los que se podía instalar la plataforma de Apache Spark fueron VmWare y sin virtualización. Las demás opciones no soportaban la gran cantidad de información con la que se iba a trabajar, o simplemente por los recursos del computador no eran los más eficientes. Se comparó la plataforma de Apache Spark con aplicaciones comunes para la minería de datos, donde Apache Spark superó en uso de tiempo y recursos a las demás aplicaciones. La ejecución del análisis de cualquier tipo de datos nos permite obtener una muestra global o específica de estimados que aportan a la toma de una decisión. Experimentar con estas nuevas tecnologías y compararlas a las tecnologías comunes muestran cuán eficientes y óptimos pueden ser los resultados de una muestra de datos para encontrar relaciones en los mismos.spaBarranquilla, Universidad del Norte, 2016.Universidad del Nortehttp://purl.org/coar/access_right/c_abf2BigDataABigDatadataDiseño e Implementación de una Plataforma de Análisis de Datos en el Sector EducativoDesign and Implementation of a Data Analysis Platform in the Educational Sectorarticlehttp://purl.org/coar/resource_type/c_6501LICENSElicense.txtlicense.txttext/plain; charset=utf-81748http://172.16.14.36:8080/bitstream/10584/5860/2/license.txt8a4605be74aa9ea9d79846c1fba20a33MD5210584/5860oai:172.16.14.36:10584/58602017-05-22 12:04:50.784Repositorio Digital de la Universidad del Nortemauribe@uninorte.edu.co

Diseño e Implementación de una Plataforma de Análisis de Datos en el Sector Educativo

Publicaciones similares