Spatial Quality Control Method for Surface Temperature Observations Based on Multiple Elements
Quality control can effectively improve the quality of surface meteorological observations. To ensure the stability and effectiveness of a quality control model under different terrain and climate conditions, it is necessary to structure a quality control model with strong generalization ability. Al...
- Autores:
-
Ye, Xiaoling
Yang, Xing
Xiong, Xiong
Yang, Shuai
Chen, Yang
- Tipo de recurso:
- Article of journal
- Fecha de publicación:
- 2017
- Institución:
- Universidad Nacional de Colombia
- Repositorio:
- Universidad Nacional de Colombia
- Idioma:
- spa
- OAI Identifier:
- oai:repositorio.unal.edu.co:unal/63583
- Acceso en línea:
- https://repositorio.unal.edu.co/handle/unal/63583
http://bdigital.unal.edu.co/64029/
- Palabra clave:
- 55 Ciencias de la tierra / Earth sciences and geology
Surface air temperature
Quality control
Random Forest
Principal component analysis
Temperatura el aire de la superficie
control de calidad
bosques aleatorios
análisis de componentes principales
- Rights
- openAccess
- License
- Atribución-NoComercial 4.0 Internacional
Summary: | Quality control can effectively improve the quality of surface meteorological observations. To ensure the stability and effectiveness of a quality control model under different terrain and climate conditions, it is necessary to structure a quality control model with strong generalization ability. Algorithms such as the Random Forest provide such generalization ability. However, machine learning algorithms are slower than traditional mathematical models. Therefore, a Random Forest quality control algorithm based on the Principal Component Analysis (PCA-RF) is proposed in this paper. Fifteen target stations under different climatic and geomorphological conditions were selected and tested using observations collected four times daily at neighboring stations from 2005-2014. The results show that using PCA to analyze the elemental composition and select elements with high correlation factors, as well as applying the Random Forest algorithm, can effectively reduce the run time and keep the accuracy of the model. The training sample dependence, model prediction accuracy and error detection rate of the PCA-RF model are superior to those of the Spatial Regression method. Therefore, the PCA-RF method is a better-quality control model for the spatial quality control of multiple elements of surface air temperature observations. |
---|