An automatic approach to generate corpus in Spanish
A corpus is an indispensable linguistic resource for any application of natural language processing. Some corpora have been created manually or semi-automatically for a specific domain. In this paper, we present an automatic approach to generate corpus from digital information sources such as Wikipe...
- Autores:
- Tipo de recurso:
- Fecha de publicación:
- 2018
- Institución:
- Universidad Tecnológica de Bolívar
- Repositorio:
- Repositorio Institucional UTB
- Idioma:
- eng
- OAI Identifier:
- oai:repositorio.utb.edu.co:20.500.12585/8916
- Acceso en línea:
- https://hdl.handle.net/20.500.12585/8916
- Palabra clave:
- Corpus
Knowledge extraction
Linguistic computational
Natural language processing
Text mining
Data mining
Extraction
Natural language processing systems
Tellurium compounds
Websites
Automatic approaches
Corpus
Digital information
Knowledge extraction
Linguistic resources
Propagation algorithm
Text mining
Wikipedia
Linguistics
- Rights
- restrictedAccess
- License
- http://creativecommons.org/licenses/by-nc-nd/4.0/