Scalable influence maximization in social networks

Influence maximization is the problem of selecting a subset of individuals in a social network such that they maximize the spread of an influence. It is desirable to generate an idea diffusion cascade by making these initial influencers adopt it first. In other words, using the word-of-mouth "p...

Full description

Autores:
Campo Martínez, Carlos Andrés
Tipo de recurso:
Fecha de publicación:
2012
Institución:
Universidad de los Andes
Repositorio:
Séneca: repositorio Uniandes
Idioma:
eng
OAI Identifier:
oai:repositorio.uniandes.edu.co:1992/12495
Acceso en línea:
http://hdl.handle.net/1992/12495
Palabra clave:
Redes sociales - Escalabilidad
Comunidades virtuales - Investigaciones
Algoritmos (Computadores) - Investigaciones
Ingeniería
Rights
openAccess
License
http://creativecommons.org/licenses/by-nc-nd/4.0/
Description
Summary:Influence maximization is the problem of selecting a subset of individuals in a social network such that they maximize the spread of an influence. It is desirable to generate an idea diffusion cascade by making these initial influencers adopt it first. In other words, using the word-of-mouth "power" of these initial individuals should allow us to maximize the spread of an idea over the network. The problem of finding the most influential nodes in a network is NP-hard. Kempe, Kleinberg, and Tardos (2003) has proved that a Greedy algorithm strategy obtains a solution that is provably within 63% of optimal for several cases; However it is computationally prohibitive to run this algorithm in large social networks. Finding scalable solutions for this problem is critical for enabling applications in viral social marketing among other topics like detection sensor location or trend monitoring. In this thesis, a new heuristic algorithm is presented for mining top-K influencers based on a local influence region probability spread score. The proposed algorithm is composed of: 1) a bounded variation of breadth-first search traversal algorithm for calculating the mentioned score of each individual; and 2) an iterative greedy select-and-update algorithm for choosing top influencers. Experiment results show that this new algorithm can in several cases perform better (in influence spread and computational efficiency) than previous Works.