A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment

Profanity is the use of offensive, obscene or abusive vocables or expressions in public conversations. A big source of conversations in text format nowadays are digital media such as forums, blogs or social networks where malicious users are taking advantage of their ample worldwide coverage to diss...

Full description

Autores:
Mogollón, Christian
Rojas-Galeano, Sergio A.
Tipo de recurso:
Article of journal
Fecha de publicación:
2016
Institución:
Pontificia Universidad Javeriana
Repositorio:
Repositorio Universidad Javeriana
Idioma:
eng
spa
OAI Identifier:
oai:repository.javeriana.edu.co:10554/25529
Acceso en línea:
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811
http://hdl.handle.net/10554/25529
Palabra clave:
Rights
openAccess
License
Atribución-NoComercial-SinDerivadas 4.0 Internacional
id JAVERIANA2_6a176d62020939c97b0417d59d1f964a
oai_identifier_str oai:repository.javeriana.edu.co:10554/25529
network_acronym_str JAVERIANA2
network_name_str Repositorio Universidad Javeriana
repository_id_str
spelling Atribución-NoComercial-SinDerivadas 4.0 InternacionalCopyright (c) 2016 Sergio A. Rojas-Galeano, Christian Mogollónhttp://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Mogollón, ChristianRojas-Galeano, Sergio A.2020-04-16T17:27:21Z2020-04-16T17:27:21Z2016-06-20http://revistas.javeriana.edu.co/index.php/iyu/article/view/1481110.11144/Javeriana.iyu20-2.wffd2011-27690123-2126http://hdl.handle.net/10554/25529Profanity is the use of offensive, obscene or abusive vocables or expressions in public conversations. A big source of conversations in text format nowadays are digital media such as forums, blogs or social networks where malicious users are taking advantage of their ample worldwide coverage to disseminate undesired profanity aimed at insulting or denigrating opinions, names or trademarks. Lexicon-based exact comparisons are the most common filter technique within these media; however, ingenious users are disguising profanity using transliteration or masking of the original vocable while still conveying its intended semantic (e.g. by writing piss as P!55 or p.i.s.s), hence defeating the filter. Recent approaches to this problem inspired in the sequence alignment methods from comparative genomics in bioinformatics, have shown promise in preventing overlooking such guises. Building upon those results we have developed an experimental Web forum where user comments are screened against disguised profanity. In this paper we introduce the software (ForumForte) and describe briefly the technique and engineering behind it, as well as some empirical evidence of its filtering performance. Our software is open-source under the New BSD License and is available at: http://tinyurl.com/ForumForte.PDFapplication/pdfapplication/vnd.openxmlformats-officedocument.wordprocessingml.documentapplication/zipapplication/zipapplication/pdfengspaPontificia Universidad Javerianahttp://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/13766http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18626http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18627http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18628http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18629Ingenieria y Universidad; Vol 20 No 2 (2016): July-December; 239-266Ingenieria y Universidad; Vol. 20 Núm. 2 (2016): Julio-Diciembre; 239-266A Web-Forum Free of Disguised Profanity by Means of Sequence Alignmenthttp://purl.org/coar/version/c_970fb48d4fbd8a85Artículo de revistahttp://purl.org/coar/resource_type/c_6501http://purl.org/coar/resource_type/c_2df8fbb1info:eu-repo/semantics/articlePeer-reviewed Article10554/25529oai:repository.javeriana.edu.co:10554/255292023-03-29 12:44:16.964Repositorio Institucional - Pontificia Universidad Javerianarepositorio@javeriana.edu.co
dc.title.spa.fl_str_mv A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
title A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
spellingShingle A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
title_short A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
title_full A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
title_fullStr A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
title_full_unstemmed A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
title_sort A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
dc.creator.fl_str_mv Mogollón, Christian
Rojas-Galeano, Sergio A.
dc.contributor.author.none.fl_str_mv Mogollón, Christian
Rojas-Galeano, Sergio A.
description Profanity is the use of offensive, obscene or abusive vocables or expressions in public conversations. A big source of conversations in text format nowadays are digital media such as forums, blogs or social networks where malicious users are taking advantage of their ample worldwide coverage to disseminate undesired profanity aimed at insulting or denigrating opinions, names or trademarks. Lexicon-based exact comparisons are the most common filter technique within these media; however, ingenious users are disguising profanity using transliteration or masking of the original vocable while still conveying its intended semantic (e.g. by writing piss as P!55 or p.i.s.s), hence defeating the filter. Recent approaches to this problem inspired in the sequence alignment methods from comparative genomics in bioinformatics, have shown promise in preventing overlooking such guises. Building upon those results we have developed an experimental Web forum where user comments are screened against disguised profanity. In this paper we introduce the software (ForumForte) and describe briefly the technique and engineering behind it, as well as some empirical evidence of its filtering performance. Our software is open-source under the New BSD License and is available at: http://tinyurl.com/ForumForte.
publishDate 2016
dc.date.created.none.fl_str_mv 2016-06-20
dc.date.accessioned.none.fl_str_mv 2020-04-16T17:27:21Z
dc.date.available.none.fl_str_mv 2020-04-16T17:27:21Z
dc.type.coar.fl_str_mv http://purl.org/coar/resource_type/c_2df8fbb1
dc.type.hasversion.none.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.local.spa.fl_str_mv Artículo de revista
dc.type.coar.none.fl_str_mv http://purl.org/coar/resource_type/c_6501
dc.type.driver.none.fl_str_mv info:eu-repo/semantics/article
dc.type.other.none.fl_str_mv Peer-reviewed Article
format http://purl.org/coar/resource_type/c_6501
dc.identifier.none.fl_str_mv http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811
10.11144/Javeriana.iyu20-2.wffd
dc.identifier.issn.none.fl_str_mv 2011-2769
0123-2126
dc.identifier.uri.none.fl_str_mv http://hdl.handle.net/10554/25529
url http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811
http://hdl.handle.net/10554/25529
identifier_str_mv 10.11144/Javeriana.iyu20-2.wffd
2011-2769
0123-2126
dc.language.iso.none.fl_str_mv eng
spa
language eng
spa
dc.relation.uri.none.fl_str_mv http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/13766
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18626
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18627
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18628
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18629
dc.relation.citationissue.eng.fl_str_mv Ingenieria y Universidad; Vol 20 No 2 (2016): July-December; 239-266
dc.relation.citationissue.spa.fl_str_mv Ingenieria y Universidad; Vol. 20 Núm. 2 (2016): Julio-Diciembre; 239-266
dc.rights.eng.fl_str_mv Copyright (c) 2016 Sergio A. Rojas-Galeano, Christian Mogollón
dc.rights.licence.*.fl_str_mv Atribución-NoComercial-SinDerivadas 4.0 Internacional
dc.rights.uri.eng.fl_str_mv http://creativecommons.org/licenses/by/4.0
dc.rights.accessrights.none.fl_str_mv info:eu-repo/semantics/openAccess
dc.rights.coar.spa.fl_str_mv http://purl.org/coar/access_right/c_abf2
rights_invalid_str_mv Atribución-NoComercial-SinDerivadas 4.0 Internacional
Copyright (c) 2016 Sergio A. Rojas-Galeano, Christian Mogollón
http://creativecommons.org/licenses/by/4.0
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.spa.fl_str_mv PDF
dc.format.mimetype.spa.fl_str_mv application/pdf
application/vnd.openxmlformats-officedocument.wordprocessingml.document
application/zip
application/zip
application/pdf
dc.publisher.eng.fl_str_mv Pontificia Universidad Javeriana
institution Pontificia Universidad Javeriana
repository.name.fl_str_mv Repositorio Institucional - Pontificia Universidad Javeriana
repository.mail.fl_str_mv repositorio@javeriana.edu.co
_version_ 1808389553373839360