Evaluating supervised learning approaches for spatial-domain multi-focus image fusion

Image fusion is the generation of an image  that combines the most relevant information from a set of images of the same scene, acquired with different cameras or camera settings. Multi-Focus Image Fusion (MFIF) aims to generate an image with extended depth-of-field from a set of images taken at dif...

Full description

Autores:
Atencio Ortiz, Pedro
Sánchez torres, German
Branch Bedoya, John Willian
Tipo de recurso:
Article of journal
Fecha de publicación:
2017
Institución:
Universidad Nacional de Colombia
Repositorio:
Universidad Nacional de Colombia
Idioma:
spa
OAI Identifier:
oai:repositorio.unal.edu.co:unal/60367
Acceso en línea:
https://repositorio.unal.edu.co/handle/unal/60367
http://bdigital.unal.edu.co/58699/
Palabra clave:
62 Ingeniería y operaciones afines / Engineering
Multi-focus image fusion
image processing
supervised learning
machine learning
Fusión de imágenes mutifoco
procesamiento de imágenes
aprendizaje supervisado
aprendizaje de máquina
Rights
openAccess
License
Atribución-NoComercial 4.0 Internacional
Description
Summary:Image fusion is the generation of an image  that combines the most relevant information from a set of images of the same scene, acquired with different cameras or camera settings. Multi-Focus Image Fusion (MFIF) aims to generate an image with extended depth-of-field from a set of images taken at different focal distances or focal planes, and it proposes a solution to the typical limited depth-of-field problem in an optical system configuration. A broad variety of works presented in the literature address this problem. The primary approaches found there are domain transformations and block-of-pixels analysis. In this work, we evaluate different systems of supervised machine learning applied to MFIF, including k-nearest neighbors, linear discriminant analysis, neural networks, and support vector machines. We started from two images at different focal distances and divided them into rectangular regions. The main objective of the machine-learning-based classification system is to choose the parts of both images that must be in the fused image in order to obtain a completely focused image. For focus quantification, we used the most popular metrics proposed in the literature, such as: Laplacian energy, sum-modified Laplacian, and gradient energy, among others. The evaluation of the proposed method considered classifier testing and fusion quality metrics commonly used in research, such as visual information fidelity and mutual information feature. Our results strongly suggest that the automatic classification concept satisfactorily addresses the MFIF problem.