Detection of Sociolinguistic Features in Digital Social Networks for the Detection of Communities

The emergence of digital social networks has transformed society, social groups, and institutions in terms of the communi cation and expression of their opinions. Determining how language variations allow the detection of communities, together with the relevance of specifc vocabulary (proposed by th...

Full description

Autores:
Puertas, Edwin
Moreno-Sandoval, Luis Gabriel
Redondo, Javier
Alvarado‑Valencia, Jorge Andres
Pomares Quimbaya, Alexandra
Tipo de recurso:
Fecha de publicación:
2020
Institución:
Universidad Tecnológica de Bolívar
Repositorio:
Repositorio Institucional UTB
Idioma:
eng
OAI Identifier:
oai:repositorio.utb.edu.co:20.500.12585/10325
Acceso en línea:
https://hdl.handle.net/20.500.12585/10325
https://doi.org/10.1007/s12559-021-09818-9
Palabra clave:
Sociolinguistic
Community discovery
Natural language processing
Social networks
Community detection
Rights
openAccess
License
http://creativecommons.org/licenses/by-nc-nd/4.0/
Description
Summary:The emergence of digital social networks has transformed society, social groups, and institutions in terms of the communi cation and expression of their opinions. Determining how language variations allow the detection of communities, together with the relevance of specifc vocabulary (proposed by the National Council of Accreditation of Colombia (Consejo Nacional de Acreditación - CNA) to determine the quality evaluation parameters for universities in Colombia) in digital assemblages could lead to a better understanding of their dynamics and social foundations, thus resulting in better communication policies and intervention where necessary. The approach presented in this paper intends to determine what are the semantic spaces (sociolinguistic features) shared by social groups in digital social networks. It includes fve layers based on Design Science Research, which are integrated with Natural Language Processing techniques (NLP), Computational Linguistics (CL), and Artifcial Intelligence (AI). The approach is validated through a case study wherein the semantic values of a series of “Twit ter” institutional accounts belonging to Colombian Universities are analyzed in terms of the 12 quality factors established by CNA. In addition, the topics and the sociolect used by diferent actors in the university communities are also analyzed. The current approach allows determining the sociolinguistic features of social groups in digital social networks. Its application allows detecting the words or concepts to which each actor of a social group (university) gives more importance in terms of vocabulary