Comparación entre árboles de regresión CART y regresión lineal

Linear regression is the most widely used method in statistics to predict values of continuous variables due to its easy interpretation, but in many situations the suppositions to apply the model are not met and some users tend to force them leading them to erroneous conclusions. CART regression tre...

Full description

Autores:
Díaz Sepúlveda, Juan Felipe
Correa, Juan Carlos
Tipo de recurso:
Fecha de publicación:
2013
Institución:
Universidad Santo Tomás
Repositorio:
Repositorio Institucional USTA
Idioma:
spa
OAI Identifier:
oai:repository.usta.edu.co:11634/39582
Acceso en línea:
https://revistas.usantotomas.edu.co/index.php/estadistica/article/view/1101
http://hdl.handle.net/11634/39582
Palabra clave:
Simulación
Error de predicción
Regresión Lineal
Árboles de clasificación y Regresión CART
Rights
License
http://purl.org/coar/access_right/c_abf2
Description
Summary:Linear regression is the most widely used method in statistics to predict values of continuous variables due to its easy interpretation, but in many situations the suppositions to apply the model are not met and some users tend to force them leading them to erroneous conclusions. CART regression trees is a regression alternative that does not require suppositions on the data to be analyzed and is a method of easy interpretation of results. This work compares predictive levels of linear regression with CART through simulation. In general, it was found that when the correct linear regression model is adjusted to the data, the prediction error of linear regression is always lower than that of CART. It was also found that when linear regression model is erroneously adjusted to the data, the prediction error of CART is lower than that of linear regression only when it has a sufficiently large amount of data.