Measuring Representativeness Using Covering Array Principles

Representativeness is an important data quality characteristic in data science processes; a data sample is said to be representative when it reflects a larger group as accurately as possible. Having low representativeness indices in the data can lead to the generation of biased models. Hence, this s...

Full description

Autores:
Tipo de recurso:
Fecha de publicación:
2023
Institución:
Universidad Pedagógica y Tecnológica de Colombia
Repositorio:
RiUPTC: Repositorio Institucional UPTC
Idioma:
eng
OAI Identifier:
oai:repositorio.uptc.edu.co:001/14367
Acceso en línea:
https://revistas.uptc.edu.co/index.php/ingenieria/article/view/15314
https://repositorio.uptc.edu.co/handle/001/14367
Palabra clave:
algoritmos de clasificación
arreglos de cobertura
calidad de datos
conjuntos de datos
representatividad de datos
classification algorithms
coverage arrays
data quality
data sets
data representativeness
Rights
License
Copyright (c) 2023 Alexander Castro-Romero, Carlos-Alberto Cobos-Lozada