An artificial immune system based on information theory for keyword extraction from text documents

This paper presents a model for keyword extraction, extending the basic concepts commonly used in this task, in order to get a formal background that allows determining the importance of the keywords to the documents. The proposed model combines an artificial immune system with a mathematical backgr...

Full description

Autores:
Romero Rodríguez, Carlos Andrés
Niño Vásquez, Luis Fernando
Tipo de recurso:
Article of journal
Fecha de publicación:
2007
Institución:
Universidad Nacional de Colombia
Repositorio:
Universidad Nacional de Colombia
Idioma:
spa
OAI Identifier:
oai:repositorio.unal.edu.co:unal/24120
Acceso en línea:
https://repositorio.unal.edu.co/handle/unal/24120
http://bdigital.unal.edu.co/15157/
Palabra clave:
Keyword Extraction
Artificial Immune Systems
Information Theory.
Rights
openAccess
License
Atribución-NoComercial 4.0 Internacional
Description
Summary:This paper presents a model for keyword extraction, extending the basic concepts commonly used in this task, in order to get a formal background that allows determining the importance of the keywords to the documents. The proposed model combines an artificial immune system with a mathematical background based on information theory; this new model has the advantage that does not need any domain knowledge, neither the use of a stopword list or any previous information about the content of the documents. The final result is a set of keywords for each category into the corpus used.