A method for detecting the profile of an author
This paper presents a method for detecting an author’s profile using the following two elements: gender and age. This is based on a set of dialogues, written in two languages: English and Spanish, provided for Author Profiling competence within the evaluation forum "Uncovering Plagiarism, Autho...
- Autores:
-
Silva, Jesus
García, Silvia
Binda, María Alejandra
Marin Gonzalez, Fredy
Barrios, Rosio
Leon Castro, Bellanit
- Tipo de recurso:
- Article of journal
- Fecha de publicación:
- 2020
- Institución:
- Corporación Universidad de la Costa
- Repositorio:
- REDICUC - Repositorio CUC
- Idioma:
- eng
- OAI Identifier:
- oai:repositorio.cuc.edu.co:11323/7788
- Acceso en línea:
- https://hdl.handle.net/11323/7788
https://doi.org/10.1016/j.procs.2020.03.101
https://repositorio.cuc.edu.co/
- Palabra clave:
- Supervised Classification
PAN 2018
Gender
Age
Random forest
- Rights
- openAccess
- License
- Attribution-NonCommercial-NoDerivatives 4.0 International
Summary: | This paper presents a method for detecting an author’s profile using the following two elements: gender and age. This is based on a set of dialogues, written in two languages: English and Spanish, provided for Author Profiling competence within the evaluation forum "Uncovering Plagiarism, Authorship, and Social Software Misuse" (PAN2018). Counts of lexical, semantic, and syntactic characteristics are used to generate a two-phase classification system, which first classifies gender and then age. The results obtained show that, with the amount of data available, it is possible to characterize both the age and gender of an author with an accuracy greater than 50%. However, these values could be improved by having more evidence of information in the training data. |
---|