Privacy-enabled scalable recommender systems

Electronic content is ubiquitous in our daily lives. Several factors such as the development of Web 2.0 technologies, the increased access to mobile devices and the deployment of mobile networks has undoubtedly augmented the amount of information easily available to users. Given the limited attentio...

Full description

Autores:
Moreno Barbosa, Andrés Darío
Tipo de recurso:
Doctoral thesis
Fecha de publicación:
2015
Institución:
Universidad de los Andes
Repositorio:
Séneca: repositorio Uniandes
Idioma:
eng
OAI Identifier:
oai:repositorio.uniandes.edu.co:1992/7835
Acceso en línea:
http://hdl.handle.net/1992/7835
Palabra clave:
Sistemas de recomendación - Investigaciones
Recuperación de información - Investigaciones
Redes sociales en línea - Investigaciones
Seguridad en computadores - Investigaciones
Privacidad de los datos - Investigaciones
Ingeniería
Rights
openAccess
License
https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdf
id UNIANDES2_bc4a6928bbf3743c6f8009f80aa2f2a1
oai_identifier_str oai:repositorio.uniandes.edu.co:1992/7835
network_acronym_str UNIANDES2
network_name_str Séneca: repositorio Uniandes
repository_id_str
dc.title.es_CO.fl_str_mv Privacy-enabled scalable recommender systems
title Privacy-enabled scalable recommender systems
spellingShingle Privacy-enabled scalable recommender systems
Sistemas de recomendación - Investigaciones
Recuperación de información - Investigaciones
Redes sociales en línea - Investigaciones
Seguridad en computadores - Investigaciones
Privacidad de los datos - Investigaciones
Ingeniería
title_short Privacy-enabled scalable recommender systems
title_full Privacy-enabled scalable recommender systems
title_fullStr Privacy-enabled scalable recommender systems
title_full_unstemmed Privacy-enabled scalable recommender systems
title_sort Privacy-enabled scalable recommender systems
dc.creator.fl_str_mv Moreno Barbosa, Andrés Darío
dc.contributor.advisor.none.fl_str_mv Castro Barrera, Harold Enrique
Riveill, Michel
Jiménez Guarín, Claudia Lucía
Masseglia, Florent
dc.contributor.author.none.fl_str_mv Moreno Barbosa, Andrés Darío
dc.subject.keyword.es_CO.fl_str_mv Sistemas de recomendación - Investigaciones
Recuperación de información - Investigaciones
Redes sociales en línea - Investigaciones
Seguridad en computadores - Investigaciones
Privacidad de los datos - Investigaciones
topic Sistemas de recomendación - Investigaciones
Recuperación de información - Investigaciones
Redes sociales en línea - Investigaciones
Seguridad en computadores - Investigaciones
Privacidad de los datos - Investigaciones
Ingeniería
dc.subject.themes.none.fl_str_mv Ingeniería
description Electronic content is ubiquitous in our daily lives. Several factors such as the development of Web 2.0 technologies, the increased access to mobile devices and the deployment of mobile networks has undoubtedly augmented the amount of information easily available to users. Given the limited attention span of the user and the extensiveness of the available streams of information ready to be consumed, automatic systems must be available for the user to prioritize, suggest or screen content suitable for the user interests and situation. One of the most popular initiatives created to solve the information overload problem are Recommender Systems [Adomavicius 2005]. Recommender Systems are information filtering systems that use the historical information about the user (what the user has considered relevant or irrelevant on the past, among other information) to build and accurate representation of the user's interests that is used to predict the relevance of a large collection of available items for a specific user. Recommendation systems are used by several online retailers, online content streaming services and social networking sites to improve the user's experience of their services by automatically filtering their content or offers of items to the ones most likely to interest the user. Generally speaking, recommender systems can be classified into two categories: Content Based and Collaborative Filtering. The former category relies on the definition of explicit features that describe the item domain and assigns them weights to describe the affinity between the feature and the item. For example in the movie domain, items can be described by features such as the genre to which they belong, the director, writer and actors that take part in the movie. On the other hand Collaborative Filtering is content-agnostic and relies on correlations between users and items based on the historical consumption patterns between the users and the items. It has been shown generally that Collaborative Filtering methods present better results than Content-Based [Pil'aszy 2009], however due to inherent shortcomings of single approaches, a better predictive performance is achieved by developing a model that integrates different paradigms (Hybrid approaches). To keep their users satisfied, personalization services that operate recommendation methods should present relevant recommendations even when the number of users, items and user-item interactions in the system increase
publishDate 2015
dc.date.issued.none.fl_str_mv 2015
dc.date.accessioned.none.fl_str_mv 2018-09-27T16:39:37Z
dc.date.available.none.fl_str_mv 2018-09-27T16:39:37Z
dc.type.spa.fl_str_mv Trabajo de grado - Doctorado
dc.type.coarversion.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/doctoralThesis
dc.type.coar.spa.fl_str_mv http://purl.org/coar/resource_type/c_db06
dc.type.content.spa.fl_str_mv Text
dc.type.redcol.spa.fl_str_mv http://purl.org/redcol/resource_type/TD
format http://purl.org/coar/resource_type/c_db06
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/1992/7835
dc.identifier.doi.none.fl_str_mv 10.57784/1992/7835
dc.identifier.pdf.none.fl_str_mv u703737.pdf
dc.identifier.instname.spa.fl_str_mv instname:Universidad de los Andes
dc.identifier.reponame.spa.fl_str_mv reponame:Repositorio Institucional Séneca
dc.identifier.repourl.spa.fl_str_mv repourl:https://repositorio.uniandes.edu.co/
url http://hdl.handle.net/1992/7835
identifier_str_mv 10.57784/1992/7835
u703737.pdf
instname:Universidad de los Andes
reponame:Repositorio Institucional Séneca
repourl:https://repositorio.uniandes.edu.co/
dc.language.iso.es_CO.fl_str_mv eng
language eng
dc.rights.uri.*.fl_str_mv https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdf
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdf
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.extent.es_CO.fl_str_mv 122 hojas
dc.format.mimetype.es_CO.fl_str_mv application/pdf
dc.publisher.es_CO.fl_str_mv Uniandes
dc.publisher.program.es_CO.fl_str_mv Doctorado en Ingeniería
dc.publisher.faculty.es_CO.fl_str_mv Facultad de Ingeniería
dc.source.es_CO.fl_str_mv instname:Universidad de los Andes
reponame:Repositorio Institucional Séneca
instname_str Universidad de los Andes
institution Universidad de los Andes
reponame_str Repositorio Institucional Séneca
collection Repositorio Institucional Séneca
bitstream.url.fl_str_mv https://repositorio.uniandes.edu.co/bitstreams/0f092453-b6a6-40cd-b0f1-293f9d7a244a/download
https://repositorio.uniandes.edu.co/bitstreams/04de4dbd-15f6-49c2-affb-c8453546b56b/download
https://repositorio.uniandes.edu.co/bitstreams/08c4dec0-7a79-451a-8069-8203eb6b0e60/download
bitstream.checksum.fl_str_mv 5e4602b9f8d5570da658fc5546971316
e755bd1afc49364bee01ee14e34d4a3d
455b1d25faa658ff4889705b757da88e
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositorio institucional Séneca
repository.mail.fl_str_mv adminrepositorio@uniandes.edu.co
_version_ 1812134023371161600
spelling Al consultar y hacer uso de este recurso, está aceptando las condiciones de uso establecidas por los autores.https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdfinfo:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Castro Barrera, Harold Enriquef1b9a768-386d-4096-b133-6a56626118e8400Riveill, Michel10e5eccc-abf6-471f-8626-246c0caad675500Jiménez Guarín, Claudia Lucía9d5444d2-10f7-4701-bba6-d33ae54cba69400Masseglia, Florent91836313-eafb-4a41-ac29-10de9ec51808500Moreno Barbosa, Andrés Daríodc73c7d4-83ed-44e6-b08b-75035edc09575002018-09-27T16:39:37Z2018-09-27T16:39:37Z2015http://hdl.handle.net/1992/783510.57784/1992/7835u703737.pdfinstname:Universidad de los Andesreponame:Repositorio Institucional Sénecarepourl:https://repositorio.uniandes.edu.co/Electronic content is ubiquitous in our daily lives. Several factors such as the development of Web 2.0 technologies, the increased access to mobile devices and the deployment of mobile networks has undoubtedly augmented the amount of information easily available to users. Given the limited attention span of the user and the extensiveness of the available streams of information ready to be consumed, automatic systems must be available for the user to prioritize, suggest or screen content suitable for the user interests and situation. One of the most popular initiatives created to solve the information overload problem are Recommender Systems [Adomavicius 2005]. Recommender Systems are information filtering systems that use the historical information about the user (what the user has considered relevant or irrelevant on the past, among other information) to build and accurate representation of the user's interests that is used to predict the relevance of a large collection of available items for a specific user. Recommendation systems are used by several online retailers, online content streaming services and social networking sites to improve the user's experience of their services by automatically filtering their content or offers of items to the ones most likely to interest the user. Generally speaking, recommender systems can be classified into two categories: Content Based and Collaborative Filtering. The former category relies on the definition of explicit features that describe the item domain and assigns them weights to describe the affinity between the feature and the item. For example in the movie domain, items can be described by features such as the genre to which they belong, the director, writer and actors that take part in the movie. On the other hand Collaborative Filtering is content-agnostic and relies on correlations between users and items based on the historical consumption patterns between the users and the items. It has been shown generally that Collaborative Filtering methods present better results than Content-Based [Pil'aszy 2009], however due to inherent shortcomings of single approaches, a better predictive performance is achieved by developing a model that integrates different paradigms (Hybrid approaches). To keep their users satisfied, personalization services that operate recommendation methods should present relevant recommendations even when the number of users, items and user-item interactions in the system increaseAs it will be shown in Chapter 2, the computational complexity of Collaborative Filtering methods for keeping a user profile up to date depends directly on the number of users and items available in the system and the amount of registered user-item interactions. Current large scale personalization systems such as Netflix [Netflix 2013] (a content streaming service of movies and series) has an estimated number of users of 44 million while the number of available items to watch fluctuates around 13000 titles. The number explicit user-item interactions (assigned ratings) is estimated at 5 billion ratings [Schelter 2013]. To account for these large numbers, Recommender System's adopters employ the support of cloud computing frameworks. Recommender Systems are now highly scalable solutions that are able to: (1) gather and store as much information as possible about users and items supported by the current availability of cheap storage, (2) apply computational intensive algorithms to train recommendation models that scale up to the size of the collected data and (3) use the trained models to adequately answer to a large amount of recommendation requests. However, as it will be shown in Chapter 3 the current architecture of data gathering and processing of recommender systems places a conflict with users. Following the definition given by [Foner 1999], privacy can be defined as the ability of an individual to protect the disclosure of personal information to third parties who are not intended recipients of the information. While users trust recommender engines to use their information for filtering or personalization purposes, it will be argued that the centralized consolidation of information increases the likelihood of misuse of the user information, misplacing user trust. A question that arises after this claim is: why information gathered and processed by a recommendation system is privacy-sensitive? After all, due to the availability of personal micro blogging and social networks, users seem avid to share their information with others. Opinions given by users on items reveal at a great extend the personality of the user, opinions on items might reveal political inclination, sexual orientation, physical or mental treatments the user is taking or religious inclination of a particular user. An iconic case of how important are personal opinions on items in recommender systems came with the Doe Vs Netflix class action lawsuit [Singel 2009], when ratings from users were de-anonymized after being made public by the Netflix Prize Competition [Netflix 2009]. The plaintiff claimed that: "...information tending to identify or permit inference of her sexual orientation constitutes sensitive and personal information. She believes that, were her sexual orientation public knowledge, it would negatively affect her ability to pursue her livelihood and support her family and would hinder her and her childrens? ability to live peaceful lives within ...(her)...community." The aim of privacy-enabled recommendation systems is to give users tools to protect their privacy and keep the choice to themselves if they want to reveal their informationDoctor en IngenieríaDoctorado122 hojasapplication/pdfengUniandesDoctorado en IngenieríaFacultad de Ingenieríainstname:Universidad de los Andesreponame:Repositorio Institucional SénecaPrivacy-enabled scalable recommender systemsTrabajo de grado - Doctoradoinfo:eu-repo/semantics/doctoralThesishttp://purl.org/coar/resource_type/c_db06http://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/TDSistemas de recomendación - InvestigacionesRecuperación de información - InvestigacionesRedes sociales en línea - InvestigacionesSeguridad en computadores - InvestigacionesPrivacidad de los datos - InvestigacionesIngenieríaPublicationTHUMBNAILu703737.pdf.jpgu703737.pdf.jpgIM Thumbnailimage/jpeg12771https://repositorio.uniandes.edu.co/bitstreams/0f092453-b6a6-40cd-b0f1-293f9d7a244a/download5e4602b9f8d5570da658fc5546971316MD55ORIGINALu703737.pdfapplication/pdf2187753https://repositorio.uniandes.edu.co/bitstreams/04de4dbd-15f6-49c2-affb-c8453546b56b/downloade755bd1afc49364bee01ee14e34d4a3dMD51TEXTu703737.pdf.txtu703737.pdf.txtExtracted texttext/plain290599https://repositorio.uniandes.edu.co/bitstreams/08c4dec0-7a79-451a-8069-8203eb6b0e60/download455b1d25faa658ff4889705b757da88eMD541992/7835oai:repositorio.uniandes.edu.co:1992/78352024-08-26 15:25:37.363https://repositorio.uniandes.edu.co/static/pdf/aceptacion_uso_es.pdfopen.accesshttps://repositorio.uniandes.edu.coRepositorio institucional Sénecaadminrepositorio@uniandes.edu.co