Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos

The main objective of this research project is to create a model for the prediction of undergraduate student desertion at the Universidad de la Costa - CUC, based on the analysis of different socioeconomic and academic factors. The study required the execution of a series of phases: characterization...

Full description

Autores:
Camargo García, Aníbal José
Tipo de recurso:
Fecha de publicación:
2020
Institución:
Corporación Universidad de la Costa
Repositorio:
REDICUC - Repositorio CUC
Idioma:
spa
OAI Identifier:
oai:repositorio.cuc.edu.co:11323/7077
Acceso en línea:
https://hdl.handle.net/11323/7077
https://repositorio.cuc.edu.co/
Palabra clave:
Higher education
Dropout
Data mining
Decision tree
Classification
Prediction
Educación superior
Deserción
Minería de datos
Árboles de decisión
Clasificación
Predicción
Rights
openAccess
License
Attribution-NonCommercial-ShareAlike 4.0 International
id RCUC2_07778aacc95d763518148a481476953b
oai_identifier_str oai:repositorio.cuc.edu.co:11323/7077
network_acronym_str RCUC2
network_name_str REDICUC - Repositorio CUC
repository_id_str
dc.title.spa.fl_str_mv Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
title Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
spellingShingle Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
Higher education
Dropout
Data mining
Decision tree
Classification
Prediction
Educación superior
Deserción
Minería de datos
Árboles de decisión
Clasificación
Predicción
title_short Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
title_full Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
title_fullStr Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
title_full_unstemmed Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
title_sort Modelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datos
dc.creator.fl_str_mv Camargo García, Aníbal José
dc.contributor.advisor.spa.fl_str_mv De la hoz Franco, Emiro
Mendoza Palechor, Fabio
dc.contributor.author.spa.fl_str_mv Camargo García, Aníbal José
dc.subject.spa.fl_str_mv Higher education
Dropout
Data mining
Decision tree
Classification
Prediction
Educación superior
Deserción
Minería de datos
Árboles de decisión
Clasificación
Predicción
topic Higher education
Dropout
Data mining
Decision tree
Classification
Prediction
Educación superior
Deserción
Minería de datos
Árboles de decisión
Clasificación
Predicción
description The main objective of this research project is to create a model for the prediction of undergraduate student desertion at the Universidad de la Costa - CUC, based on the analysis of different socioeconomic and academic factors. The study required the execution of a series of phases: characterization, experimentation, development and evaluation. During the characterization phase, a dataset was constructed, based on the compilation of demographic, cultural, social, family, educational, socioeconomic status and psychological profile data of each student, for the periods between 2013-1 and 2018-2. Such information was collected from the registration forms that students fill out when they enter the institution, a total of 1,606 unique student records were collected. During the experimental phase, different machine learning techniques were evaluated for the categories: Bayesian networks, support vector machines, and decision trees. The algorithm with which the best hit rate was obtained was Random forest (from the decision tree category), with an accuracy of 84.8%. In the development phase, the model was integrated into an application that allows us to predict whether a student or a group of students will drop out or not. Finally, in the evaluation phase, the application was subjected to different types of tests to evaluate both the functionality of the graphic interface with the final user and the success rate in terms of desertion prediction, the results have coincided with the precision obtained in the experimental phase.
publishDate 2020
dc.date.accessioned.none.fl_str_mv 2020-09-08T23:29:10Z
dc.date.available.none.fl_str_mv 2020-09-08T23:29:10Z
dc.date.issued.none.fl_str_mv 2020
dc.type.spa.fl_str_mv Trabajo de grado - Maestría
dc.type.content.spa.fl_str_mv Text
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/masterThesis
dc.type.redcol.spa.fl_str_mv http://purl.org/redcol/resource_type/TM
dc.type.version.spa.fl_str_mv info:eu-repo/semantics/acceptedVersion
status_str acceptedVersion
dc.identifier.uri.spa.fl_str_mv https://hdl.handle.net/11323/7077
dc.identifier.instname.spa.fl_str_mv Corporación Universidad de la Costa
dc.identifier.reponame.spa.fl_str_mv REDICUC - Repositorio CUC
dc.identifier.repourl.spa.fl_str_mv https://repositorio.cuc.edu.co/
url https://hdl.handle.net/11323/7077
https://repositorio.cuc.edu.co/
identifier_str_mv Corporación Universidad de la Costa
REDICUC - Repositorio CUC
dc.language.iso.none.fl_str_mv spa
language spa
dc.relation.references.spa.fl_str_mv Aboubakar, M., Kellil, M., Bouabdallah, A., & Roux, P. (2019). Toward Intelligent Reconfiguration of RPL Networks using Supervised Learning. IFIP Wireless Days, 2019- April, 1–4. https://doi.org/10.1109/WD.2019.8734236
Aggarwal, C. (2015). Data Mining: The Textbook. Springer International. https://doi.org/10.1007/978-3-319-14142-8 ISBN
Ahuja, R., & Kankane, Y. (2017). Predicting the probability of student’s degree completion by using different data mining techniques. 2017 Fourth International Conference on Image Information Processing (ICIIP), 1–4. https://doi.org/10.1109/ICIIP.2017.8313763
Alkhasawneh, R., & Hobson, R. (2011). Modeling student retention in science and engineering disciplines using neural networks. 2011 IEEE Global Engineering Education Conference, EDUCON 2011, 660–663. https://doi.org/10.1109/EDUCON.2011.5773209
Asif, R., Merceron, A., Ali, S. A., & Haider, N. G. (2017). Analyzing undergraduate students’ performance using educational data mining. Computers & Education, 113, 177–194. https://doi.org/10.1016/j.compedu.2017.05.007
Askinadze, A., & Conrad, S. (2017). Application of the Dynamic Time Warping Distance for the Student Drop-out Prediction on Time Series Data. Proceedings of the 10th International Conference on Educational Data Mining, 342–343.
Azevedo, A., & Santos, M. F. (2008). KDD, semma and CRISP-DM: A parallel overview. MCCSIS’08 - IADIS Multi Conference on Computer Science and Information Systems; Proceedings of Informatics 2008 and Data Mining 2008, June, 182–185.
Aziz, A. A., Ismail, N. H., Ahmad, F., & Hassan, H. (2015). A framework for students’ academic performance analysis using naïve bayes classifier. Jurnal Teknologi, 75(3), 13–19. https://doi.org/10.11113/jt.v75.5037
Barbosa Manhães, L. M., Serra da Cruz, S. M., & Zimbrão, G. (2014). WAVE: an Architecture for Predicting Dropout in Undergraduate Courses using EDM. Proceeding SAC ’14 Proceedings of the 29th Annual ACM Symposium on Applied Computing, 243–247. https://doi.org/10.1145/2554850.2555135
Barker, K., Trafalis, T., & Reed Rhoads, T. (2004). LEARNING FROM STUDENT DATA. Proceedings of the 2004 Systems and Information Engineering Design Symposium Matthew. https://doi.org/10.1109/SIEDS.2004.239819
Barnes, T., Desmarais, M., Romero, C., & Ventura, S. (2009). EDM’09 - Educational Data Mining 2009: 2nd International Conference on Educational Data Mining. In EDM’09 - Educational Data Mining 2009: 2nd International Conference on Educational Data Mining.
Bayer, J., Bydzovská, H., Géryk, J., Obsivac, T., & Popelínský, L. (2012). Predicting drop-out from social behaviour of students. Proceedings of the 5th International Conference on Educational Data Mining, Dm, 103–109.
Beaulac, C., & Rosenthal, J. S. (2019). Predicting University Students ’ Academic Success and Choice of Major using Random Forests.
Beltran, B. (2016). MINERÍA DE DATOS (Vol. 30, Issue 1). https://doi.org/10.1016/0032- 0633(82)90071-X
Betancourt, G. A. (2005). LAS MÁQUINAS DE SOPORTE VECTORIAL (SVMs). Scientia Et Technica, XI(27), 67–72. https://doi.org/10.22517/23447214.6895
Birjali, M., Beni-hssane, A., & Erritali, M. (2018). Learning with Big Data Technology: The Future of Education. 565. https://doi.org/10.1007/978-3-319-60834-1
Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992). A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, 144–152. https://doi.org/10.1145/130385.130401
Brown, M. S. (2014). Data Mining for Dummies. https://doi.org/10.1007/978-1-4614-7669-6
Burgueño, M. J., García-Bastos, J. L., & González-Buitrago, J. M. (1995). ROC curves in the evaluation of diagnostic tests. Medicina Clínica, 104(17), 661–670.
Cambruzzi, W., Rigo, S. J., & Barbosa, J. L. V. (2015). Dropout prediction and reduction in distance education courses with the learning analytics multitrail approach. Journal of Universal Computer Science, 21(1), 23–47.
Carmona Suárez, E. J. (2014). Máquinas de Vectores Soporte (SVM). Dpto. de Inteligencia Artificial, ETS de Ingeniería Inforática, Universidad Nacional de Educación a Distancia (UNED), 1–25. http://www.ia.uned.es/~ejcarmona/publicaciones/[2013-Carmona] SVM.pdf
Castaño, E., Gallón, S., Gómez, K., & Vásquez, J. (2004). Deserción estudiantil universitaria una aplicación de modelos de duración. Lecturas de Economia, 60(60), 39–65.
Chai, K. E. K., & Gibson, D. (2015). Predicting the risk of attrition for undergraduate students with time based modelling. Proceedings of the 12th International Conference on Cognition and Exploratory Learning in the Digital Age, CELDA 2015, Celda, 109–116.
Cheewaprakobkit, P. (2013). Study of factors analysis affecting academic achievement of undergraduate students in international program. Lecture Notes in Engineering and Computer Science, 2202, 332–336.
Christian, T. M., & Ayub, M. (2014). Exploration of classification using NBTree for predicting students’ performance. Proceedings of 2014 International Conference on Data and Software Engineering, ICODSE 2014, 1–6. https://doi.org/10.1109/ICODSE.2014.7062654 Cortes, C., & Vapnik, V. (1995). Support-Vector Networks. Mach. Learn. , 20 (3), 44(13), 273– 297. Predicting Students Drop Out A Case Study, (2009).
Delen, D. (2010). A comparative analysis of machine learning techniques for student retention management. Decision Support Systems, 49(4), 498–506. https://doi.org/10.1016/j.dss.2010.06.003
Delen, D. (2011). Predicting student attrition with data mining methods. Journal of College Student Retention: Research, Theory and Practice, 13(1), 17–35. https://doi.org/10.2190/CS.13.1.b
Devasia, T., Vinushree, T. P., & Hegde, V. (2016). Prediction of students performance using Educational Data Mining. Proceedings of 2016 International Conference on Data Mining and Advanced Computing, SAPIENCE 2016, 91–95. https://doi.org/10.1109/SAPIENCE.2016.7684167
Dharmawan, T., Ginardi, H., & Munif, A. (2018). Dropout Detection Using Non-Academic Data. Proceedings - 2018 4th International Conference on Science and Technology, ICST 2018, 1, 1–4. https://doi.org/10.1109/ICSTC.2018.8528619
Edwards, W., & Fasolo, B. (2001). Decision Technology. 581–606.
Fernandes, E., Holanda, M., Victorino, M., Borges, V., Carvalho, R., & Erven, G. Van. (2019). Educational data mining: Predictive analysis of academic performance of public school students in the capital of Brazil. Journal of Business Research, 94(August 2017), 335–343. https://doi.org/10.1016/j.jbusres.2018.02.012
Friedman, N., Geiger, D., & Goldszmidt, M. (1997). Bayesian Network Classifier. 131–163.
Gandhi, R. (2018). Support Vector Machine - Introduction to Machine Learning Algorithms. https://towardsdatascience.com/support-vector-machine-introduction-to-machine-learningalgorithms-934a444fca47
García, J. G., Puga, J. L., Cano Guillén, C. J., Gea, A. B., & de la Fuente Sánchez, L. (2006). Aplicación de las redes bayesianas al modelado de las actitudes emprendedoras. IV Congreso de Metodología de Encuestas, August 2015, 235–242.
Gulati, H. (2015). Predictive analytics using data mining technique. 2015 International Conference on Computing for Sustainable Global Development, INDIACom 2015, 713–716.
Güner, N., Yaldir, A., Gündüz, G., Çomak, E., Tokat, S., & Iplikçi, S. (2014). Predicting academically at-risk engineering students: A soft computing application. Acta Polytechnica Hungarica, 11(5), 199–216. https://doi.org/10.12700/aph.11.05.2014.05.12
Guzmán Ruiz, C., Muriel Durán, D., & Franco Gallego, J. (2009). Deserción estudiantil en la educación superior colombiana. Metodología de seguimiento, diagnóstico y elementos para su prevención. http://www.mineducacion.gov.co/sistemasdeinformacion/1735/articles- 254702_libro_desercion.pdf
Han, J., Kamber, M., & Pei, J. (2012). Data mining concepts and techniques. https://doi.org/10.1109/ICMIRA.2013.45
Hasbun, T., Araya, A., & Villalon, J. (2016). Extracurricular activities as dropout prediction factors in higher education using decision trees. Proceedings - IEEE 16th International Conference on Advanced Learning Technologies, ICALT 2016, 242–244. https://doi.org/10.1109/ICALT.2016.66
Heredia, D., Amaya, Y., & Barrientos, E. (2015). Student Dropout Predictive Model Using Data Mining Techniques. IEEE Latin America Transactions, 13(9), 3127–3134. https://doi.org/10.1109/TLA.2015.7350068
Hernandez Gonzalez, A. G., Melendez Armenta, R. A., Morales Rosales, L. A., Garcia Barrientos, A., Tecpanecatl Xihuitl, J. L., & Algredo, I. (2016). Comparative Study of Algorithms to Predict the Desertion in the Students at the ITSM-Mexico. IEEE Latin America Transactions, 14(11), 4573–4578. https://doi.org/10.1109/TLA.2016.7795831
Hoffait, A. S., & Schyns, M. (2017). Early detection of university students with potential difficulties. Decision Support Systems, 101, 1–11. https://doi.org/10.1016/j.dss.2017.05.003
Huber, S., Wiemer, H., Schneider, D., & Ihlenfeldt, S. (2019). DMME: Data mining methodology for engineering applications - A holistic extension to the CRISP-DM model. Procedia CIRP, 79, 403–408. https://doi.org/10.1016/j.procir.2019.02.106
Jin, Q., Imbrie, P. K., Lin, J. J. J., & Chen, X. (2011). A multi-outcome hybrid model for predicting student success in engineering. ASEE Annual Conference and Exposition, Conference Proceedings.
Kabakchieva, D., Stefanova, K., & Kisimov, V. (2011). Analyzing university data for determining student profiles and predicting performance. EDM 2011 - Proceedings of the 4th International Conference on Educational Data Mining.
Kalles, D., & Pierrakeas, C. (2006a). Analyzing student performance in distance learning with genetic algorithms and decision trees. Applied Artificial Intelligence, 20(8), 655–674. https://doi.org/10.1080/08839510600844946
Kalles, D., & Pierrakeas, C. (2006b). Using genetic algorithms and decision trees for a posteriori analysis and evaluation of tutoring practices based on student failure models. IFIP International Federation for Information Processing, 204(August), 9–18. https://doi.org/10.1007/0-387-34224-9_2
Kingsford, C., & Salzberg, S. L. (2008). What are decision trees. Nat Biotechnol, 23(1), 1–7. https://doi.org/10.1038/nbt0908-1011
Kotsiantis, S. B., & Pintelas, P. E. (2005). Predicting students’ marks in Hellenic Open University. Proceedings - 5th IEEE International Conference on Advanced Learning Technologies, ICALT 2005, 2005, 664–668. https://doi.org/10.1109/ICALT.2005.223
Kotsiantis, S., Pierrakeas, C., & Pintelas, P. (2004). Predicting students’ performance in distance learning using machine learning techniques. Applied Artificial Intelligence, 18(5), 411–426. https://doi.org/10.1080/08839510490442058
Krishna Kishore, K. V., Venkatramaphanikumar, S., & Alekhya, S. (2014). Prediction of student academic progression: A case study on Vignan University. 2014 International Conference on Computer Communication and Informatics: Ushering in Technologies of Tomorrow, Today, ICCCI 2014, 2, 1–6. https://doi.org/10.1109/ICCCI.2014.6921731
Kumar Baradwaj, B., & Pal, S. (2011). Mining Educational Data to Analyze Students’ Performance. JACSA) International Journal of Advanced Computer Science and Applications, 02.
Lee, S., & Chung, J. Y. (2019). The Machine Learning-Based Dropout Early Warning System for Improving the Performance of Dropout Prediction. Applied Sciences, 9(15), 3093. https://doi.org/10.3390/app9153093
Lesinski, G., Corns, S., & Dagli, C. (2016). Application of an Artificial Neural Network to Predict Graduation Success at the United States Military Academy. Procedia Computer Science, 95, 375–382. https://doi.org/10.1016/j.procs.2016.09.348
López de Ullibarri, G. I., & Píta Fernández, S. (1998). Curvas ROC. Cad Aten Primaria, 5(4), 229–235.
Lykourentzou, I., Giannoukos, I., Nikolopoulos, V., Mpardis, G., & Loumos, V. (2009). Dropout prediction in e-learning courses through the combination of machine learning techniques. Computers and Education, 53(3), 950–965. https://doi.org/10.1016/j.compedu.2009.05.010
Manhães, L. M. B., Da Cruz, S. M. S., & Zimbrão, G. (2014). The impact of high dropout rates in a large public brazilian university a quantitative approach using educational data mining. CSEDU 2014 - Proceedings of the 6th International Conference on Computer Supported Education, 3, 124–129. https://doi.org/10.5220/0004958601240129
Márquez-Vera, C., Cano, A., Romero, C., Noaman, A. Y. M., Mousa Fardoun, H., & Ventura, S. (2016). Early dropout prediction using data mining: A case study with high school students. Expert Systems, 33(1), 107–124. https://doi.org/10.1111/exsy.12135
Márquez-Vera, C., Romero Morales, C., & Ventura Soto, S. (2013). Predicting school failure and dropout by using data mining techniques. Revista Iberoamericana de Tecnologias Del Aprendizaje, 8(1), 7–14. https://doi.org/10.1109/RITA.2013.2244695
Mayilvaganan, M., & Kalpanadevi, D. (2015). Comparison of classification techniques for predicting the performance of students academic environment. 2014 International Conference on Communication and Network Technologies, ICCNT 2014, 2015-March, 113–118. https://doi.org/10.1109/CNT.2014.7062736
Ministerio de Educación de Colombia. (2006). La Revolución Educativa 2002 – 2006. Media, 1– 6.
Ministerio de Educación de Colombia. (2019). Qué es el SPADIES. https://www.mineducacion.gov.co/sistemasinfo/spadies/Informacion- Institucional/254648:Que-es-el-SPADIES
Miranda, M. A., & Guzmán, J. (2017). Análisis de la deserción de estudiantes universitarios usando técnicas de minería de datos. Formacion Universitaria, 10(3), 61–68. https://doi.org/10.4067/S0718-50062017000300007
Mishra, A. (2018). Metrics to Evaluate your Machine Learning Algorithm. https://towardsdatascience.com/metrics-to-evaluate-your-machine-learning-algorithmf10ba6e38234
Mishra, T., Kumar, D., & Gupta, S. (2014). Mining students’ data for prediction performance. International Conference on Advanced Computing and Communication Technologies, ACCT, 255–262. https://doi.org/10.1109/ACCT.2014.105
Mitchell, T. M. (1997). Machine Learning. In McGraw-Hill Science/Engineering/Math.
Moseley, L. G., & Mead, D. M. (2008). Predicting who will drop out of nursing courses: A machine learning exercise. Nurse Education Today, 28(4), 469–475. https://doi.org/10.1016/j.nedt.2007.07.012
Mustafa, M. N., Chowdhury, L., & Kamal, M. S. (2012). Students dropout prediction for intelligent system from tertiary level in developing country. 2012 International Conference on Informatics, Electronics and Vision, ICIEV 2012, 113–118. https://doi.org/10.1109/ICIEV.2012.6317441
Oskouei, R. J., & Askari, M. (2014). Predicting Academic Performance with Applying Data Mining Techniques (Generalizing the results of two Different Case Studies). Computer Engineering and Applications Journal, 3(2), 79–88. https://doi.org/10.18495/comengapp.v3i2.81
Osmanbegovi, E. (2012). Data Mining Approach for Predicting Student Performance. Economic Review : Journal of Economics and Business, X(1), 3–12.
Peralta, B., Poblete, T., & Caro, L. (2017). Automatic feature selection for desertion and graduation prediction: A chilean case. Proceedings - International Conference of the Chilean Computer Science Society, SCCC. https://doi.org/10.1109/SCCC.2016.7836055
Pereira, R. T., Romero, A. C., & Toledo, J. J. (2013). Extraction student dropout patterns with data mining techniques in undergraduate programs. IC3K 2013; KDIR 2013 - 5th International Conference on Knowledge Discovery and Information Retrieval and KMIS 2013 - 5th International Conference on Knowledge Management and Information Sharing, Proc., 136–142. https://doi.org/10.5220/0004543001360142
Pérez, A., Grandón, E. E., Caniupán, M., & Vargas, G. (2019). Comparative Analysis of Prediction Techniques to Determine Student Dropout: Logistic Regression vs Decision Trees. Proceedings - International Conference of the Chilean Computer Science Society, SCCC, 2018-Novem. https://doi.org/10.1109/SCCC.2018.8705262
Perez, B., Castellanos, C., & Correal, D. (2018). Applying Data Mining Techniques to Predict Student Dropout: A Case Study. 2018 IEEE 1st Colombian Conference on Applications in Computational Intelligence, ColCACI 2018 - Proceedings, 1–6. https://doi.org/10.1109/ColCACI.2018.8484847
Perez, M. (2014). Minería de datos a treves de ejemplos. 22. http://www.rclibros.es/pdf/capitulo_mineria.pdf
Picard, R. W., Papert, S., Bender, W., Blumberg, B., Breazeal, C., Cavallo, D., Machover, T., Resnick, M., Roy, D., & Strohecker, C. (2004). Affective learning - a manifesto. BT Technology Journal, 22(4), 253–269. https://doi.org/10.1023/B:BTTJ.0000047603.37042.33
Pradeep, A., Das, S., & Kizhekkethottam, J. J. (2015). Students dropout factor prediction using EDM techniques. Proceedings of the IEEE International Conference on Soft-Computing and Network Security, ICSNS 2015, 1–7. https://doi.org/10.1109/ICSNS.2015.7292372
Quadri, M., & Kalyankar, D. (2010). Drop out feature of student data for academic performance using decision tree techniques. Global Journal of Computer, 10(2), 2–5. http://computerresearch.org/stpr/index.php/gjcst/article/viewArticle/128
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1(1), 81–106. https://doi.org/10.1007/bf00116251
Recuero, P. (2018). Machine Learning a tu alcance: La matriz de confusión. https://empresas.blogthinkbig.com/ml-a-tu-alcance-matriz-confusion/
Salazar, A., Gosálbez, J., Bosch, I., Miralles, R., & Vergara, L. (2004). A case study of knowledge discovery on academic achievement, student desertion and student retention. ITRE 2004 - 2nd International Conference on Information Technology: Research and Education - Proceedings, January, 150–154. https://doi.org/10.1109/itre.2004.1393665
Sangodiah, A., Beleya, P., Muniandy, M., Heng, L. E., & Ramendran Spr, C. (2015). Minimizing student attrition in higher learning institutions in Malaysia using support vector machine. Journal of Theoretical and Applied Information Technology, 71(3), 377–385.
Santana, M. A., Costa, E. B., Neto, B. F. S., Silva, I. C. L., & Rego, J. B. A. (2015). A predictive model for identifying students with dropout profiles in online courses. CEUR Workshop Proceedings, 1446.
Şara, N. B., Halland, R., Igel, C., & Alstrup, S. (2015). High-school dropout prediction using machine learning: A Danish large-scale study. 23rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2015 - Proceedings, April, 319–324.
Saravanan, R., & Sujatha, P. (2018). Algorithms : A Perspective of Supervised Learning Approaches in Data Classification. 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), Iciccs, 945–949.
Sarker, F., Tiropanis, T., & Davis, H. C. (2014). Linked data, data mining and external open data for better prediction of at-risk students. Proceedings - 2014 International Conference on Control, Decision and Information Technologies, CoDIT 2014, 652–657. https://doi.org/10.1109/CoDIT.2014.6996973
Segura-Morales, M., & Loza-Aguirre, E. (2018). Using Decision Trees for Predicting Academic Performance Based on Socio-Economic Factors. Proceedings - 2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017, 1132– 1136. https://doi.org/10.1109/CSCI.2017.197
Shahiri, A. M., Husain, W., & Rashid, N. A. (2015). A Review on Predicting Student’s Performance Using Data Mining Techniques. Procedia Computer Science, 72(February 2016), 414–422. https://doi.org/10.1016/j.procs.2015.12.157
Sharabiani, A., Karim, F., Sharabiani, A., Atanasov, M., & Darabi, H. (2014). An enhanced bayesian network model for prediction of students’ academic performance in engineering programs. IEEE Global Engineering Education Conference, EDUCON, April, 832–837. https://doi.org/10.1109/EDUCON.2014.6826192
Siri, A. (2015). Predicting Students’ Dropout at University Using Artificial Neural Networks. Italian Journal of Sociology of Education, 7(2), 225–247.
Solis, M., Moreira, T., Gonzalez, R., Fernandez, T., & Hernandez, M. (2018). Perspectives to Predict Dropout in University Students with Machine Learning. 2018 IEEE International Work Conference on Bioinspired Intelligence, IWOBI 2018 - Proceedings. https://doi.org/10.1109/IWOBI.2018.8464191
Tair, M. M. A. (2015). Mining Educational Data to Improve Students ’ Performance : A Case Study Mining Educational Data t o Improve Students ’ Performance : A Case Study. October.
Thomas, E. H., & Galambos, N. (2004). What satisfies students? Mining student-opinion data with regression and decision tree analysis. Research in Higher Education, 45(3), 251–269. Timarán Pereira, S. R., Hernández Arteaga, I., Caicedo Zambrano, S. J., Hidalgo Troya, A., & Alvarado
Pérez, J. C. (2016). Descubrimiento de patrones de desempeño académico con árboles de decisión en las competencias genéricas de la formación profesional. Descubrimiento de Patrones de Desempeño Académico Con Árboles de Decisión En Las Competencias Genéricas de La Formación Profesional, 2016, 63–86. https://doi.org/10.16925/9789587600490
Tsai, C. F., Tsai, C. T., Hung, C. S., & Hwang, P. Sen. (2011). Data mining techniques for identifying students at risk of failing a computer proficiency test required for graduation. Australasian Journal of Educational Technology, 27(3), 481–498. https://doi.org/10.14742/ajet.956
Universidad Pedagógica y Tecnológica de Colombia. (2004). Unidad 1 Estadistica Descriptiva. https://virtual.uptc.edu.co/ova/estadistica/docs/libros/h_men_prob_est/lecciones_html/un1/1 _8_3.html
Veitch, W. R. (2004). Identifying Characteristics of High School Dropouts: Data Mining with A Decision Tree Model. Online Submission, 1–11.
Wirth, R. (2000). CRISP-DM : Towards a Standard Process Model for Data Mining. Proceedings of the Fourth International Conference on the Practical Application of Knowledge Discovery and Data Mining, 24959, 29–39. https://doi.org/10.1.1.198.5133
Yehuala, M. A. (2015). Application Of Data Mining Techniques For Student Success And Failure Prediction The Case Of DebreMarkos University. International Journal of Scientific & Technology Research, 4(4), 91–94.
Zaki, M., & Meira, W. J. (2013). Data Mining and Analysis: Fundamental Concepts and Algorithms. https://doi.org/10.1145/3054925
Zeng, W., Chin, S.-C., Zeimet, B., Kuang, R., & Chi, C.-L. (2017). Dropout Prediction in Home Care Training. Proceedings of the 10th International Conference on Educational Data Mining, 442–447.
Zhang, Y., & Oussena, S. (2010). USE DATA MINING TO IMPROVE STUDENT RETENTION IN HIGHER EDUCATION – A CASE STUDY. Middlesex University Research Repository.
dc.rights.spa.fl_str_mv Attribution-NonCommercial-ShareAlike 4.0 International
dc.rights.uri.spa.fl_str_mv http://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv Attribution-NonCommercial-ShareAlike 4.0 International
http://creativecommons.org/licenses/by-nc-sa/4.0/
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.publisher.spa.fl_str_mv Corporación Universidad de la Costa
dc.publisher.program.spa.fl_str_mv Maestría en Ingeniería
institution Corporación Universidad de la Costa
bitstream.url.fl_str_mv https://repositorio.cuc.edu.co/bitstreams/837274c4-814e-440b-9668-ad287c3b15c5/download
https://repositorio.cuc.edu.co/bitstreams/efa03807-b78b-4671-9df5-ba26d7a0e4e4/download
https://repositorio.cuc.edu.co/bitstreams/df3c7a58-9b94-40c0-b9e0-750669def4e3/download
https://repositorio.cuc.edu.co/bitstreams/6a1ba468-2cce-486c-9ec5-76081059f2c2/download
https://repositorio.cuc.edu.co/bitstreams/d8c1cf3a-3005-4c49-b1dd-76ee57dd4a81/download
https://repositorio.cuc.edu.co/bitstreams/b303bf15-1e1b-4c23-b514-5a09cc7715b8/download
bitstream.checksum.fl_str_mv 3fa8da0438279c3839c5d31a1d2864ed
934f4ca17e109e0a05eaeaba504d7ce4
e30e9215131d99561d40d6b0abbe9bad
631709ed4e9bcd5e097b8300de6b4a43
631709ed4e9bcd5e097b8300de6b4a43
883c081178af3b7ca2843441b5a3c256
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositorio de la Universidad de la Costa CUC
repository.mail.fl_str_mv repdigital@cuc.edu.co
_version_ 1828166904099897344
spelling De la hoz Franco, EmiroMendoza Palechor, FabioCamargo García, Aníbal José2020-09-08T23:29:10Z2020-09-08T23:29:10Z2020https://hdl.handle.net/11323/7077Corporación Universidad de la CostaREDICUC - Repositorio CUChttps://repositorio.cuc.edu.co/The main objective of this research project is to create a model for the prediction of undergraduate student desertion at the Universidad de la Costa - CUC, based on the analysis of different socioeconomic and academic factors. The study required the execution of a series of phases: characterization, experimentation, development and evaluation. During the characterization phase, a dataset was constructed, based on the compilation of demographic, cultural, social, family, educational, socioeconomic status and psychological profile data of each student, for the periods between 2013-1 and 2018-2. Such information was collected from the registration forms that students fill out when they enter the institution, a total of 1,606 unique student records were collected. During the experimental phase, different machine learning techniques were evaluated for the categories: Bayesian networks, support vector machines, and decision trees. The algorithm with which the best hit rate was obtained was Random forest (from the decision tree category), with an accuracy of 84.8%. In the development phase, the model was integrated into an application that allows us to predict whether a student or a group of students will drop out or not. Finally, in the evaluation phase, the application was subjected to different types of tests to evaluate both the functionality of the graphic interface with the final user and the success rate in terms of desertion prediction, the results have coincided with the precision obtained in the experimental phase.El objetivo principal de este proyecto de investigación es crear un modelo para la predicción de la deserción de estudiantes de pregrado en la Universidad de la Costa - CUC, a partir del análisis de diferentes factores socioeconómicos y académicos. El estudio requirió de la ejecución de una serie de fases: caracterización, experimentación, desarrollo y evaluación. Durante la fase de caracterización se construyó un conjunto de datos (dataset), a partir de la compilación de los datos demográficos, culturales, sociales, familiares, educativos, estatus socioeconómico y perfil psicológico de cada estudiante, de los periodos comprendidos entre 2013-1 y 2018-2. Tal información fue recopilada a partir de los formatos de inscripción que diligencian los estudiantes cuando ingresan a la institución, un total de 1.606 registros únicos de estudiantes fueron recopilados. Durante la fase de experimentación se evaluaron distintas técnicas de aprendizaje automático (Machine Learning) de las categorías: redes bayesianas, máquinas de soporte vectorial y árboles de decisiones. El algoritmo con el cual se obtuvo la mejor tasa de aciertos fue Random forest (de la categoría árboles de decisión), con una exactitud del 84.8%. En la fase de desarrollo se integró el modelo a una aplicación que permite predecir si un estudiante o un grupo de ellos desertará o no. Por último, en la fase de evaluación se sometió la aplicación a diferentes tipos de pruebas para valorar tanto la funcionalidad de la interface gráfica con el usuario final como la tasa de aciertos en cuanto a la predicción de la deserción, los resultados han coincidido con la precisión obtenida en la fase experimental.Camargo García, Aníbal JoséspaCorporación Universidad de la CostaMaestría en IngenieríaAttribution-NonCommercial-ShareAlike 4.0 Internationalhttp://creativecommons.org/licenses/by-nc-sa/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Higher educationDropoutData miningDecision treeClassificationPredictionEducación superiorDeserciónMinería de datosÁrboles de decisiónClasificaciónPredicciónModelo para la predicción de la deserción de estudiantes de pregrado, basado en técnicas de minería de datosTrabajo de grado - MaestríaTextinfo:eu-repo/semantics/masterThesishttp://purl.org/redcol/resource_type/TMinfo:eu-repo/semantics/acceptedVersionAboubakar, M., Kellil, M., Bouabdallah, A., & Roux, P. (2019). Toward Intelligent Reconfiguration of RPL Networks using Supervised Learning. IFIP Wireless Days, 2019- April, 1–4. https://doi.org/10.1109/WD.2019.8734236Aggarwal, C. (2015). Data Mining: The Textbook. Springer International. https://doi.org/10.1007/978-3-319-14142-8 ISBNAhuja, R., & Kankane, Y. (2017). Predicting the probability of student’s degree completion by using different data mining techniques. 2017 Fourth International Conference on Image Information Processing (ICIIP), 1–4. https://doi.org/10.1109/ICIIP.2017.8313763Alkhasawneh, R., & Hobson, R. (2011). Modeling student retention in science and engineering disciplines using neural networks. 2011 IEEE Global Engineering Education Conference, EDUCON 2011, 660–663. https://doi.org/10.1109/EDUCON.2011.5773209Asif, R., Merceron, A., Ali, S. A., & Haider, N. G. (2017). Analyzing undergraduate students’ performance using educational data mining. Computers & Education, 113, 177–194. https://doi.org/10.1016/j.compedu.2017.05.007Askinadze, A., & Conrad, S. (2017). Application of the Dynamic Time Warping Distance for the Student Drop-out Prediction on Time Series Data. Proceedings of the 10th International Conference on Educational Data Mining, 342–343.Azevedo, A., & Santos, M. F. (2008). KDD, semma and CRISP-DM: A parallel overview. MCCSIS’08 - IADIS Multi Conference on Computer Science and Information Systems; Proceedings of Informatics 2008 and Data Mining 2008, June, 182–185.Aziz, A. A., Ismail, N. H., Ahmad, F., & Hassan, H. (2015). A framework for students’ academic performance analysis using naïve bayes classifier. Jurnal Teknologi, 75(3), 13–19. https://doi.org/10.11113/jt.v75.5037Barbosa Manhães, L. M., Serra da Cruz, S. M., & Zimbrão, G. (2014). WAVE: an Architecture for Predicting Dropout in Undergraduate Courses using EDM. Proceeding SAC ’14 Proceedings of the 29th Annual ACM Symposium on Applied Computing, 243–247. https://doi.org/10.1145/2554850.2555135Barker, K., Trafalis, T., & Reed Rhoads, T. (2004). LEARNING FROM STUDENT DATA. Proceedings of the 2004 Systems and Information Engineering Design Symposium Matthew. https://doi.org/10.1109/SIEDS.2004.239819Barnes, T., Desmarais, M., Romero, C., & Ventura, S. (2009). EDM’09 - Educational Data Mining 2009: 2nd International Conference on Educational Data Mining. In EDM’09 - Educational Data Mining 2009: 2nd International Conference on Educational Data Mining.Bayer, J., Bydzovská, H., Géryk, J., Obsivac, T., & Popelínský, L. (2012). Predicting drop-out from social behaviour of students. Proceedings of the 5th International Conference on Educational Data Mining, Dm, 103–109.Beaulac, C., & Rosenthal, J. S. (2019). Predicting University Students ’ Academic Success and Choice of Major using Random Forests.Beltran, B. (2016). MINERÍA DE DATOS (Vol. 30, Issue 1). https://doi.org/10.1016/0032- 0633(82)90071-XBetancourt, G. A. (2005). LAS MÁQUINAS DE SOPORTE VECTORIAL (SVMs). Scientia Et Technica, XI(27), 67–72. https://doi.org/10.22517/23447214.6895Birjali, M., Beni-hssane, A., & Erritali, M. (2018). Learning with Big Data Technology: The Future of Education. 565. https://doi.org/10.1007/978-3-319-60834-1Boser, B. E., Guyon, I. M., & Vapnik, V. N. (1992). A training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, 144–152. https://doi.org/10.1145/130385.130401Brown, M. S. (2014). Data Mining for Dummies. https://doi.org/10.1007/978-1-4614-7669-6Burgueño, M. J., García-Bastos, J. L., & González-Buitrago, J. M. (1995). ROC curves in the evaluation of diagnostic tests. Medicina Clínica, 104(17), 661–670.Cambruzzi, W., Rigo, S. J., & Barbosa, J. L. V. (2015). Dropout prediction and reduction in distance education courses with the learning analytics multitrail approach. Journal of Universal Computer Science, 21(1), 23–47.Carmona Suárez, E. J. (2014). Máquinas de Vectores Soporte (SVM). Dpto. de Inteligencia Artificial, ETS de Ingeniería Inforática, Universidad Nacional de Educación a Distancia (UNED), 1–25. http://www.ia.uned.es/~ejcarmona/publicaciones/[2013-Carmona] SVM.pdfCastaño, E., Gallón, S., Gómez, K., & Vásquez, J. (2004). Deserción estudiantil universitaria una aplicación de modelos de duración. Lecturas de Economia, 60(60), 39–65.Chai, K. E. K., & Gibson, D. (2015). Predicting the risk of attrition for undergraduate students with time based modelling. Proceedings of the 12th International Conference on Cognition and Exploratory Learning in the Digital Age, CELDA 2015, Celda, 109–116.Cheewaprakobkit, P. (2013). Study of factors analysis affecting academic achievement of undergraduate students in international program. Lecture Notes in Engineering and Computer Science, 2202, 332–336.Christian, T. M., & Ayub, M. (2014). Exploration of classification using NBTree for predicting students’ performance. Proceedings of 2014 International Conference on Data and Software Engineering, ICODSE 2014, 1–6. https://doi.org/10.1109/ICODSE.2014.7062654 Cortes, C., & Vapnik, V. (1995). Support-Vector Networks. Mach. Learn. , 20 (3), 44(13), 273– 297. Predicting Students Drop Out A Case Study, (2009).Delen, D. (2010). A comparative analysis of machine learning techniques for student retention management. Decision Support Systems, 49(4), 498–506. https://doi.org/10.1016/j.dss.2010.06.003Delen, D. (2011). Predicting student attrition with data mining methods. Journal of College Student Retention: Research, Theory and Practice, 13(1), 17–35. https://doi.org/10.2190/CS.13.1.bDevasia, T., Vinushree, T. P., & Hegde, V. (2016). Prediction of students performance using Educational Data Mining. Proceedings of 2016 International Conference on Data Mining and Advanced Computing, SAPIENCE 2016, 91–95. https://doi.org/10.1109/SAPIENCE.2016.7684167Dharmawan, T., Ginardi, H., & Munif, A. (2018). Dropout Detection Using Non-Academic Data. Proceedings - 2018 4th International Conference on Science and Technology, ICST 2018, 1, 1–4. https://doi.org/10.1109/ICSTC.2018.8528619Edwards, W., & Fasolo, B. (2001). Decision Technology. 581–606.Fernandes, E., Holanda, M., Victorino, M., Borges, V., Carvalho, R., & Erven, G. Van. (2019). Educational data mining: Predictive analysis of academic performance of public school students in the capital of Brazil. Journal of Business Research, 94(August 2017), 335–343. https://doi.org/10.1016/j.jbusres.2018.02.012Friedman, N., Geiger, D., & Goldszmidt, M. (1997). Bayesian Network Classifier. 131–163.Gandhi, R. (2018). Support Vector Machine - Introduction to Machine Learning Algorithms. https://towardsdatascience.com/support-vector-machine-introduction-to-machine-learningalgorithms-934a444fca47García, J. G., Puga, J. L., Cano Guillén, C. J., Gea, A. B., & de la Fuente Sánchez, L. (2006). Aplicación de las redes bayesianas al modelado de las actitudes emprendedoras. IV Congreso de Metodología de Encuestas, August 2015, 235–242.Gulati, H. (2015). Predictive analytics using data mining technique. 2015 International Conference on Computing for Sustainable Global Development, INDIACom 2015, 713–716.Güner, N., Yaldir, A., Gündüz, G., Çomak, E., Tokat, S., & Iplikçi, S. (2014). Predicting academically at-risk engineering students: A soft computing application. Acta Polytechnica Hungarica, 11(5), 199–216. https://doi.org/10.12700/aph.11.05.2014.05.12Guzmán Ruiz, C., Muriel Durán, D., & Franco Gallego, J. (2009). Deserción estudiantil en la educación superior colombiana. Metodología de seguimiento, diagnóstico y elementos para su prevención. http://www.mineducacion.gov.co/sistemasdeinformacion/1735/articles- 254702_libro_desercion.pdfHan, J., Kamber, M., & Pei, J. (2012). Data mining concepts and techniques. https://doi.org/10.1109/ICMIRA.2013.45Hasbun, T., Araya, A., & Villalon, J. (2016). Extracurricular activities as dropout prediction factors in higher education using decision trees. Proceedings - IEEE 16th International Conference on Advanced Learning Technologies, ICALT 2016, 242–244. https://doi.org/10.1109/ICALT.2016.66Heredia, D., Amaya, Y., & Barrientos, E. (2015). Student Dropout Predictive Model Using Data Mining Techniques. IEEE Latin America Transactions, 13(9), 3127–3134. https://doi.org/10.1109/TLA.2015.7350068Hernandez Gonzalez, A. G., Melendez Armenta, R. A., Morales Rosales, L. A., Garcia Barrientos, A., Tecpanecatl Xihuitl, J. L., & Algredo, I. (2016). Comparative Study of Algorithms to Predict the Desertion in the Students at the ITSM-Mexico. IEEE Latin America Transactions, 14(11), 4573–4578. https://doi.org/10.1109/TLA.2016.7795831Hoffait, A. S., & Schyns, M. (2017). Early detection of university students with potential difficulties. Decision Support Systems, 101, 1–11. https://doi.org/10.1016/j.dss.2017.05.003Huber, S., Wiemer, H., Schneider, D., & Ihlenfeldt, S. (2019). DMME: Data mining methodology for engineering applications - A holistic extension to the CRISP-DM model. Procedia CIRP, 79, 403–408. https://doi.org/10.1016/j.procir.2019.02.106Jin, Q., Imbrie, P. K., Lin, J. J. J., & Chen, X. (2011). A multi-outcome hybrid model for predicting student success in engineering. ASEE Annual Conference and Exposition, Conference Proceedings.Kabakchieva, D., Stefanova, K., & Kisimov, V. (2011). Analyzing university data for determining student profiles and predicting performance. EDM 2011 - Proceedings of the 4th International Conference on Educational Data Mining.Kalles, D., & Pierrakeas, C. (2006a). Analyzing student performance in distance learning with genetic algorithms and decision trees. Applied Artificial Intelligence, 20(8), 655–674. https://doi.org/10.1080/08839510600844946Kalles, D., & Pierrakeas, C. (2006b). Using genetic algorithms and decision trees for a posteriori analysis and evaluation of tutoring practices based on student failure models. IFIP International Federation for Information Processing, 204(August), 9–18. https://doi.org/10.1007/0-387-34224-9_2Kingsford, C., & Salzberg, S. L. (2008). What are decision trees. Nat Biotechnol, 23(1), 1–7. https://doi.org/10.1038/nbt0908-1011Kotsiantis, S. B., & Pintelas, P. E. (2005). Predicting students’ marks in Hellenic Open University. Proceedings - 5th IEEE International Conference on Advanced Learning Technologies, ICALT 2005, 2005, 664–668. https://doi.org/10.1109/ICALT.2005.223Kotsiantis, S., Pierrakeas, C., & Pintelas, P. (2004). Predicting students’ performance in distance learning using machine learning techniques. Applied Artificial Intelligence, 18(5), 411–426. https://doi.org/10.1080/08839510490442058Krishna Kishore, K. V., Venkatramaphanikumar, S., & Alekhya, S. (2014). Prediction of student academic progression: A case study on Vignan University. 2014 International Conference on Computer Communication and Informatics: Ushering in Technologies of Tomorrow, Today, ICCCI 2014, 2, 1–6. https://doi.org/10.1109/ICCCI.2014.6921731Kumar Baradwaj, B., & Pal, S. (2011). Mining Educational Data to Analyze Students’ Performance. JACSA) International Journal of Advanced Computer Science and Applications, 02.Lee, S., & Chung, J. Y. (2019). The Machine Learning-Based Dropout Early Warning System for Improving the Performance of Dropout Prediction. Applied Sciences, 9(15), 3093. https://doi.org/10.3390/app9153093Lesinski, G., Corns, S., & Dagli, C. (2016). Application of an Artificial Neural Network to Predict Graduation Success at the United States Military Academy. Procedia Computer Science, 95, 375–382. https://doi.org/10.1016/j.procs.2016.09.348López de Ullibarri, G. I., & Píta Fernández, S. (1998). Curvas ROC. Cad Aten Primaria, 5(4), 229–235.Lykourentzou, I., Giannoukos, I., Nikolopoulos, V., Mpardis, G., & Loumos, V. (2009). Dropout prediction in e-learning courses through the combination of machine learning techniques. Computers and Education, 53(3), 950–965. https://doi.org/10.1016/j.compedu.2009.05.010Manhães, L. M. B., Da Cruz, S. M. S., & Zimbrão, G. (2014). The impact of high dropout rates in a large public brazilian university a quantitative approach using educational data mining. CSEDU 2014 - Proceedings of the 6th International Conference on Computer Supported Education, 3, 124–129. https://doi.org/10.5220/0004958601240129Márquez-Vera, C., Cano, A., Romero, C., Noaman, A. Y. M., Mousa Fardoun, H., & Ventura, S. (2016). Early dropout prediction using data mining: A case study with high school students. Expert Systems, 33(1), 107–124. https://doi.org/10.1111/exsy.12135Márquez-Vera, C., Romero Morales, C., & Ventura Soto, S. (2013). Predicting school failure and dropout by using data mining techniques. Revista Iberoamericana de Tecnologias Del Aprendizaje, 8(1), 7–14. https://doi.org/10.1109/RITA.2013.2244695Mayilvaganan, M., & Kalpanadevi, D. (2015). Comparison of classification techniques for predicting the performance of students academic environment. 2014 International Conference on Communication and Network Technologies, ICCNT 2014, 2015-March, 113–118. https://doi.org/10.1109/CNT.2014.7062736Ministerio de Educación de Colombia. (2006). La Revolución Educativa 2002 – 2006. Media, 1– 6.Ministerio de Educación de Colombia. (2019). Qué es el SPADIES. https://www.mineducacion.gov.co/sistemasinfo/spadies/Informacion- Institucional/254648:Que-es-el-SPADIESMiranda, M. A., & Guzmán, J. (2017). Análisis de la deserción de estudiantes universitarios usando técnicas de minería de datos. Formacion Universitaria, 10(3), 61–68. https://doi.org/10.4067/S0718-50062017000300007Mishra, A. (2018). Metrics to Evaluate your Machine Learning Algorithm. https://towardsdatascience.com/metrics-to-evaluate-your-machine-learning-algorithmf10ba6e38234Mishra, T., Kumar, D., & Gupta, S. (2014). Mining students’ data for prediction performance. International Conference on Advanced Computing and Communication Technologies, ACCT, 255–262. https://doi.org/10.1109/ACCT.2014.105Mitchell, T. M. (1997). Machine Learning. In McGraw-Hill Science/Engineering/Math.Moseley, L. G., & Mead, D. M. (2008). Predicting who will drop out of nursing courses: A machine learning exercise. Nurse Education Today, 28(4), 469–475. https://doi.org/10.1016/j.nedt.2007.07.012Mustafa, M. N., Chowdhury, L., & Kamal, M. S. (2012). Students dropout prediction for intelligent system from tertiary level in developing country. 2012 International Conference on Informatics, Electronics and Vision, ICIEV 2012, 113–118. https://doi.org/10.1109/ICIEV.2012.6317441Oskouei, R. J., & Askari, M. (2014). Predicting Academic Performance with Applying Data Mining Techniques (Generalizing the results of two Different Case Studies). Computer Engineering and Applications Journal, 3(2), 79–88. https://doi.org/10.18495/comengapp.v3i2.81Osmanbegovi, E. (2012). Data Mining Approach for Predicting Student Performance. Economic Review : Journal of Economics and Business, X(1), 3–12.Peralta, B., Poblete, T., & Caro, L. (2017). Automatic feature selection for desertion and graduation prediction: A chilean case. Proceedings - International Conference of the Chilean Computer Science Society, SCCC. https://doi.org/10.1109/SCCC.2016.7836055Pereira, R. T., Romero, A. C., & Toledo, J. J. (2013). Extraction student dropout patterns with data mining techniques in undergraduate programs. IC3K 2013; KDIR 2013 - 5th International Conference on Knowledge Discovery and Information Retrieval and KMIS 2013 - 5th International Conference on Knowledge Management and Information Sharing, Proc., 136–142. https://doi.org/10.5220/0004543001360142Pérez, A., Grandón, E. E., Caniupán, M., & Vargas, G. (2019). Comparative Analysis of Prediction Techniques to Determine Student Dropout: Logistic Regression vs Decision Trees. Proceedings - International Conference of the Chilean Computer Science Society, SCCC, 2018-Novem. https://doi.org/10.1109/SCCC.2018.8705262Perez, B., Castellanos, C., & Correal, D. (2018). Applying Data Mining Techniques to Predict Student Dropout: A Case Study. 2018 IEEE 1st Colombian Conference on Applications in Computational Intelligence, ColCACI 2018 - Proceedings, 1–6. https://doi.org/10.1109/ColCACI.2018.8484847Perez, M. (2014). Minería de datos a treves de ejemplos. 22. http://www.rclibros.es/pdf/capitulo_mineria.pdfPicard, R. W., Papert, S., Bender, W., Blumberg, B., Breazeal, C., Cavallo, D., Machover, T., Resnick, M., Roy, D., & Strohecker, C. (2004). Affective learning - a manifesto. BT Technology Journal, 22(4), 253–269. https://doi.org/10.1023/B:BTTJ.0000047603.37042.33Pradeep, A., Das, S., & Kizhekkethottam, J. J. (2015). Students dropout factor prediction using EDM techniques. Proceedings of the IEEE International Conference on Soft-Computing and Network Security, ICSNS 2015, 1–7. https://doi.org/10.1109/ICSNS.2015.7292372Quadri, M., & Kalyankar, D. (2010). Drop out feature of student data for academic performance using decision tree techniques. Global Journal of Computer, 10(2), 2–5. http://computerresearch.org/stpr/index.php/gjcst/article/viewArticle/128Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1(1), 81–106. https://doi.org/10.1007/bf00116251Recuero, P. (2018). Machine Learning a tu alcance: La matriz de confusión. https://empresas.blogthinkbig.com/ml-a-tu-alcance-matriz-confusion/Salazar, A., Gosálbez, J., Bosch, I., Miralles, R., & Vergara, L. (2004). A case study of knowledge discovery on academic achievement, student desertion and student retention. ITRE 2004 - 2nd International Conference on Information Technology: Research and Education - Proceedings, January, 150–154. https://doi.org/10.1109/itre.2004.1393665Sangodiah, A., Beleya, P., Muniandy, M., Heng, L. E., & Ramendran Spr, C. (2015). Minimizing student attrition in higher learning institutions in Malaysia using support vector machine. Journal of Theoretical and Applied Information Technology, 71(3), 377–385.Santana, M. A., Costa, E. B., Neto, B. F. S., Silva, I. C. L., & Rego, J. B. A. (2015). A predictive model for identifying students with dropout profiles in online courses. CEUR Workshop Proceedings, 1446.Şara, N. B., Halland, R., Igel, C., & Alstrup, S. (2015). High-school dropout prediction using machine learning: A Danish large-scale study. 23rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2015 - Proceedings, April, 319–324.Saravanan, R., & Sujatha, P. (2018). Algorithms : A Perspective of Supervised Learning Approaches in Data Classification. 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), Iciccs, 945–949.Sarker, F., Tiropanis, T., & Davis, H. C. (2014). Linked data, data mining and external open data for better prediction of at-risk students. Proceedings - 2014 International Conference on Control, Decision and Information Technologies, CoDIT 2014, 652–657. https://doi.org/10.1109/CoDIT.2014.6996973Segura-Morales, M., & Loza-Aguirre, E. (2018). Using Decision Trees for Predicting Academic Performance Based on Socio-Economic Factors. Proceedings - 2017 International Conference on Computational Science and Computational Intelligence, CSCI 2017, 1132– 1136. https://doi.org/10.1109/CSCI.2017.197Shahiri, A. M., Husain, W., & Rashid, N. A. (2015). A Review on Predicting Student’s Performance Using Data Mining Techniques. Procedia Computer Science, 72(February 2016), 414–422. https://doi.org/10.1016/j.procs.2015.12.157Sharabiani, A., Karim, F., Sharabiani, A., Atanasov, M., & Darabi, H. (2014). An enhanced bayesian network model for prediction of students’ academic performance in engineering programs. IEEE Global Engineering Education Conference, EDUCON, April, 832–837. https://doi.org/10.1109/EDUCON.2014.6826192Siri, A. (2015). Predicting Students’ Dropout at University Using Artificial Neural Networks. Italian Journal of Sociology of Education, 7(2), 225–247.Solis, M., Moreira, T., Gonzalez, R., Fernandez, T., & Hernandez, M. (2018). Perspectives to Predict Dropout in University Students with Machine Learning. 2018 IEEE International Work Conference on Bioinspired Intelligence, IWOBI 2018 - Proceedings. https://doi.org/10.1109/IWOBI.2018.8464191Tair, M. M. A. (2015). Mining Educational Data to Improve Students ’ Performance : A Case Study Mining Educational Data t o Improve Students ’ Performance : A Case Study. October.Thomas, E. H., & Galambos, N. (2004). What satisfies students? Mining student-opinion data with regression and decision tree analysis. Research in Higher Education, 45(3), 251–269. Timarán Pereira, S. R., Hernández Arteaga, I., Caicedo Zambrano, S. J., Hidalgo Troya, A., & AlvaradoPérez, J. C. (2016). Descubrimiento de patrones de desempeño académico con árboles de decisión en las competencias genéricas de la formación profesional. Descubrimiento de Patrones de Desempeño Académico Con Árboles de Decisión En Las Competencias Genéricas de La Formación Profesional, 2016, 63–86. https://doi.org/10.16925/9789587600490Tsai, C. F., Tsai, C. T., Hung, C. S., & Hwang, P. Sen. (2011). Data mining techniques for identifying students at risk of failing a computer proficiency test required for graduation. Australasian Journal of Educational Technology, 27(3), 481–498. https://doi.org/10.14742/ajet.956Universidad Pedagógica y Tecnológica de Colombia. (2004). Unidad 1 Estadistica Descriptiva. https://virtual.uptc.edu.co/ova/estadistica/docs/libros/h_men_prob_est/lecciones_html/un1/1 _8_3.htmlVeitch, W. R. (2004). Identifying Characteristics of High School Dropouts: Data Mining with A Decision Tree Model. Online Submission, 1–11.Wirth, R. (2000). CRISP-DM : Towards a Standard Process Model for Data Mining. Proceedings of the Fourth International Conference on the Practical Application of Knowledge Discovery and Data Mining, 24959, 29–39. https://doi.org/10.1.1.198.5133Yehuala, M. A. (2015). Application Of Data Mining Techniques For Student Success And Failure Prediction The Case Of DebreMarkos University. International Journal of Scientific & Technology Research, 4(4), 91–94.Zaki, M., & Meira, W. J. (2013). Data Mining and Analysis: Fundamental Concepts and Algorithms. https://doi.org/10.1145/3054925Zeng, W., Chin, S.-C., Zeimet, B., Kuang, R., & Chi, C.-L. (2017). Dropout Prediction in Home Care Training. Proceedings of the 10th International Conference on Educational Data Mining, 442–447.Zhang, Y., & Oussena, S. (2010). USE DATA MINING TO IMPROVE STUDENT RETENTION IN HIGHER EDUCATION – A CASE STUDY. Middlesex University Research Repository.PublicationORIGINALMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdfMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdfapplication/pdf1809812https://repositorio.cuc.edu.co/bitstreams/837274c4-814e-440b-9668-ad287c3b15c5/download3fa8da0438279c3839c5d31a1d2864edMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-81031https://repositorio.cuc.edu.co/bitstreams/efa03807-b78b-4671-9df5-ba26d7a0e4e4/download934f4ca17e109e0a05eaeaba504d7ce4MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-83196https://repositorio.cuc.edu.co/bitstreams/df3c7a58-9b94-40c0-b9e0-750669def4e3/downloade30e9215131d99561d40d6b0abbe9badMD53THUMBNAILMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdf.jpgMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdf.jpgimage/jpeg25282https://repositorio.cuc.edu.co/bitstreams/6a1ba468-2cce-486c-9ec5-76081059f2c2/download631709ed4e9bcd5e097b8300de6b4a43MD54THUMBNAILMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdf.jpgMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdf.jpgimage/jpeg25282https://repositorio.cuc.edu.co/bitstreams/d8c1cf3a-3005-4c49-b1dd-76ee57dd4a81/download631709ed4e9bcd5e097b8300de6b4a43MD54TEXTMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdf.txtMODELO PARA LA PREDICCIÓN DE LA DESERCIÓN DE ESTUDIANTES DE PREGADO, BASADO EN TÉCNICAS DE MINERÍA DE DATOS.pdf.txttext/plain195277https://repositorio.cuc.edu.co/bitstreams/b303bf15-1e1b-4c23-b514-5a09cc7715b8/download883c081178af3b7ca2843441b5a3c256MD5511323/7077oai:repositorio.cuc.edu.co:11323/70772024-09-17 14:24:12.736http://creativecommons.org/licenses/by-nc-sa/4.0/Attribution-NonCommercial-ShareAlike 4.0 Internationalopen.accesshttps://repositorio.cuc.edu.coRepositorio de la Universidad de la Costa CUCrepdigital@cuc.edu.coQXV0b3Jpem8gKGF1dG9yaXphbW9zKSBhIGxhIEJpYmxpb3RlY2EgZGUgbGEgSW5zdGl0dWNpw7NuIHBhcmEgcXVlIGluY2x1eWEgdW5hIGNvcGlhLCBpbmRleGUgeSBkaXZ1bGd1ZSBlbiBlbCBSZXBvc2l0b3JpbyBJbnN0aXR1Y2lvbmFsLCBsYSBvYnJhIG1lbmNpb25hZGEgY29uIGVsIGZpbiBkZSBmYWNpbGl0YXIgbG9zIHByb2Nlc29zIGRlIHZpc2liaWxpZGFkIGUgaW1wYWN0byBkZSBsYSBtaXNtYSwgY29uZm9ybWUgYSBsb3MgZGVyZWNob3MgcGF0cmltb25pYWxlcyBxdWUgbWUobm9zKSBjb3JyZXNwb25kZShuKSB5IHF1ZSBpbmNsdXllbjogbGEgcmVwcm9kdWNjacOzbiwgY29tdW5pY2FjacOzbiBww7pibGljYSwgZGlzdHJpYnVjacOzbiBhbCBww7pibGljbywgdHJhbnNmb3JtYWNpw7NuLCBkZSBjb25mb3JtaWRhZCBjb24gbGEgbm9ybWF0aXZpZGFkIHZpZ2VudGUgc29icmUgZGVyZWNob3MgZGUgYXV0b3IgeSBkZXJlY2hvcyBjb25leG9zIHJlZmVyaWRvcyBlbiBhcnQuIDIsIDEyLCAzMCAobW9kaWZpY2FkbyBwb3IgZWwgYXJ0IDUgZGUgbGEgbGV5IDE1MjAvMjAxMiksIHkgNzIgZGUgbGEgbGV5IDIzIGRlIGRlIDE5ODIsIExleSA0NCBkZSAxOTkzLCBhcnQuIDQgeSAxMSBEZWNpc2nDs24gQW5kaW5hIDM1MSBkZSAxOTkzIGFydC4gMTEsIERlY3JldG8gNDYwIGRlIDE5OTUsIENpcmN1bGFyIE5vIDA2LzIwMDIgZGUgbGEgRGlyZWNjacOzbiBOYWNpb25hbCBkZSBEZXJlY2hvcyBkZSBhdXRvciwgYXJ0LiAxNSBMZXkgMTUyMCBkZSAyMDEyLCBsYSBMZXkgMTkxNSBkZSAyMDE4IHkgZGVtw6FzIG5vcm1hcyBzb2JyZSBsYSBtYXRlcmlhLg0KDQpBbCByZXNwZWN0byBjb21vIEF1dG9yKGVzKSBtYW5pZmVzdGFtb3MgY29ub2NlciBxdWU6DQoNCi0gTGEgYXV0b3JpemFjacOzbiBlcyBkZSBjYXLDoWN0ZXIgbm8gZXhjbHVzaXZhIHkgbGltaXRhZGEsIGVzdG8gaW1wbGljYSBxdWUgbGEgbGljZW5jaWEgdGllbmUgdW5hIHZpZ2VuY2lhLCBxdWUgbm8gZXMgcGVycGV0dWEgeSBxdWUgZWwgYXV0b3IgcHVlZGUgcHVibGljYXIgbyBkaWZ1bmRpciBzdSBvYnJhIGVuIGN1YWxxdWllciBvdHJvIG1lZGlvLCBhc8OtIGNvbW8gbGxldmFyIGEgY2FibyBjdWFscXVpZXIgdGlwbyBkZSBhY2Npw7NuIHNvYnJlIGVsIGRvY3VtZW50by4NCg0KLSBMYSBhdXRvcml6YWNpw7NuIHRlbmRyw6EgdW5hIHZpZ2VuY2lhIGRlIGNpbmNvIGHDsW9zIGEgcGFydGlyIGRlbCBtb21lbnRvIGRlIGxhIGluY2x1c2nDs24gZGUgbGEgb2JyYSBlbiBlbCByZXBvc2l0b3JpbywgcHJvcnJvZ2FibGUgaW5kZWZpbmlkYW1lbnRlIHBvciBlbCB0aWVtcG8gZGUgZHVyYWNpw7NuIGRlIGxvcyBkZXJlY2hvcyBwYXRyaW1vbmlhbGVzIGRlbCBhdXRvciB5IHBvZHLDoSBkYXJzZSBwb3IgdGVybWluYWRhIHVuYSB2ZXogZWwgYXV0b3IgbG8gbWFuaWZpZXN0ZSBwb3IgZXNjcml0byBhIGxhIGluc3RpdHVjacOzbiwgY29uIGxhIHNhbHZlZGFkIGRlIHF1ZSBsYSBvYnJhIGVzIGRpZnVuZGlkYSBnbG9iYWxtZW50ZSB5IGNvc2VjaGFkYSBwb3IgZGlmZXJlbnRlcyBidXNjYWRvcmVzIHkvbyByZXBvc2l0b3Jpb3MgZW4gSW50ZXJuZXQgbG8gcXVlIG5vIGdhcmFudGl6YSBxdWUgbGEgb2JyYSBwdWVkYSBzZXIgcmV0aXJhZGEgZGUgbWFuZXJhIGlubWVkaWF0YSBkZSBvdHJvcyBzaXN0ZW1hcyBkZSBpbmZvcm1hY2nDs24gZW4gbG9zIHF1ZSBzZSBoYXlhIGluZGV4YWRvLCBkaWZlcmVudGVzIGFsIHJlcG9zaXRvcmlvIGluc3RpdHVjaW9uYWwgZGUgbGEgSW5zdGl0dWNpw7NuLCBkZSBtYW5lcmEgcXVlIGVsIGF1dG9yKHJlcykgdGVuZHLDoW4gcXVlIHNvbGljaXRhciBsYSByZXRpcmFkYSBkZSBzdSBvYnJhIGRpcmVjdGFtZW50ZSBhIG90cm9zIHNpc3RlbWFzIGRlIGluZm9ybWFjacOzbiBkaXN0aW50b3MgYWwgZGUgbGEgSW5zdGl0dWNpw7NuIHNpIGRlc2VhIHF1ZSBzdSBvYnJhIHNlYSByZXRpcmFkYSBkZSBpbm1lZGlhdG8uDQoNCi0gTGEgYXV0b3JpemFjacOzbiBkZSBwdWJsaWNhY2nDs24gY29tcHJlbmRlIGVsIGZvcm1hdG8gb3JpZ2luYWwgZGUgbGEgb2JyYSB5IHRvZG9zIGxvcyBkZW3DoXMgcXVlIHNlIHJlcXVpZXJhIHBhcmEgc3UgcHVibGljYWNpw7NuIGVuIGVsIHJlcG9zaXRvcmlvLiBJZ3VhbG1lbnRlLCBsYSBhdXRvcml6YWNpw7NuIHBlcm1pdGUgYSBsYSBpbnN0aXR1Y2nDs24gZWwgY2FtYmlvIGRlIHNvcG9ydGUgZGUgbGEgb2JyYSBjb24gZmluZXMgZGUgcHJlc2VydmFjacOzbiAoaW1wcmVzbywgZWxlY3Ryw7NuaWNvLCBkaWdpdGFsLCBJbnRlcm5ldCwgaW50cmFuZXQsIG8gY3VhbHF1aWVyIG90cm8gZm9ybWF0byBjb25vY2lkbyBvIHBvciBjb25vY2VyKS4NCg0KLSBMYSBhdXRvcml6YWNpw7NuIGVzIGdyYXR1aXRhIHkgc2UgcmVudW5jaWEgYSByZWNpYmlyIGN1YWxxdWllciByZW11bmVyYWNpw7NuIHBvciBsb3MgdXNvcyBkZSBsYSBvYnJhLCBkZSBhY3VlcmRvIGNvbiBsYSBsaWNlbmNpYSBlc3RhYmxlY2lkYSBlbiBlc3RhIGF1dG9yaXphY2nDs24uDQoNCi0gQWwgZmlybWFyIGVzdGEgYXV0b3JpemFjacOzbiwgc2UgbWFuaWZpZXN0YSBxdWUgbGEgb2JyYSBlcyBvcmlnaW5hbCB5IG5vIGV4aXN0ZSBlbiBlbGxhIG5pbmd1bmEgdmlvbGFjacOzbiBhIGxvcyBkZXJlY2hvcyBkZSBhdXRvciBkZSB0ZXJjZXJvcy4gRW4gY2FzbyBkZSBxdWUgZWwgdHJhYmFqbyBoYXlhIHNpZG8gZmluYW5jaWFkbyBwb3IgdGVyY2Vyb3MgZWwgbyBsb3MgYXV0b3JlcyBhc3VtZW4gbGEgcmVzcG9uc2FiaWxpZGFkIGRlbCBjdW1wbGltaWVudG8gZGUgbG9zIGFjdWVyZG9zIGVzdGFibGVjaWRvcyBzb2JyZSBsb3MgZGVyZWNob3MgcGF0cmltb25pYWxlcyBkZSBsYSBvYnJhIGNvbiBkaWNobyB0ZXJjZXJvLg0KDQotIEZyZW50ZSBhIGN1YWxxdWllciByZWNsYW1hY2nDs24gcG9yIHRlcmNlcm9zLCBlbCBvIGxvcyBhdXRvcmVzIHNlcsOhbiByZXNwb25zYWJsZXMsIGVuIG5pbmfDum4gY2FzbyBsYSByZXNwb25zYWJpbGlkYWQgc2Vyw6EgYXN1bWlkYSBwb3IgbGEgaW5zdGl0dWNpw7NuLg0KDQotIENvbiBsYSBhdXRvcml6YWNpw7NuLCBsYSBpbnN0aXR1Y2nDs24gcHVlZGUgZGlmdW5kaXIgbGEgb2JyYSBlbiDDrW5kaWNlcywgYnVzY2Fkb3JlcyB5IG90cm9zIHNpc3RlbWFzIGRlIGluZm9ybWFjacOzbiBxdWUgZmF2b3JlemNhbiBzdSB2aXNpYmlsaWRhZA==