A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment
Profanity is the use of offensive, obscene or abusive vocables or expressions in public conversations. A big source of conversations in text format nowadays are digital media such as forums, blogs or social networks where malicious users are taking advantage of their ample worldwide coverage to diss...
- Autores:
- Tipo de recurso:
- article
- Fecha de publicación:
- 2016
- Institución:
- Pontificia Universidad Javeriana
- Repositorio:
- Repositorio Universidad Javeriana
- Idioma:
- eng
spa
- OAI Identifier:
- oai:repository.javeriana.edu.co:10554/25529
- Acceso en línea:
- http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811
http://hdl.handle.net/10554/25529
- Palabra clave:
- Rights
- openAccess
- License
- Copyright (c) 2016 Sergio A. Rojas-Galeano, Christian Mogollón
id |
JAVERIANA_6a176d62020939c97b0417d59d1f964a |
---|---|
oai_identifier_str |
oai:repository.javeriana.edu.co:10554/25529 |
network_acronym_str |
JAVERIANA |
network_name_str |
Repositorio Universidad Javeriana |
repository_id_str |
|
spelling |
A Web-Forum Free of Disguised Profanity by Means of Sequence AlignmentMogollón, ChristianRojas-Galeano, Sergio A.Profanity is the use of offensive, obscene or abusive vocables or expressions in public conversations. A big source of conversations in text format nowadays are digital media such as forums, blogs or social networks where malicious users are taking advantage of their ample worldwide coverage to disseminate undesired profanity aimed at insulting or denigrating opinions, names or trademarks. Lexicon-based exact comparisons are the most common filter technique within these media; however, ingenious users are disguising profanity using transliteration or masking of the original vocable while still conveying its intended semantic (e.g. by writing piss as P!55 or p.i.s.s), hence defeating the filter. Recent approaches to this problem inspired in the sequence alignment methods from comparative genomics in bioinformatics, have shown promise in preventing overlooking such guises. Building upon those results we have developed an experimental Web forum where user comments are screened against disguised profanity. In this paper we introduce the software (ForumForte) and describe briefly the technique and engineering behind it, as well as some empirical evidence of its filtering performance. Our software is open-source under the New BSD License and is available at: http://tinyurl.com/ForumForte.Pontificia Universidad Javeriana2020-04-16T17:27:21Z2020-04-16T17:27:21Z2016-06-20http://purl.org/coar/version/c_970fb48d4fbd8a85Artículo de revistahttp://purl.org/coar/resource_type/c_6501info:eu-repo/semantics/articlePeer-reviewed Articleinfo:eu-repo/semantics/publishedVersionPDFapplication/pdfapplication/vnd.openxmlformats-officedocument.wordprocessingml.documentapplication/zipapplication/zipapplication/pdfhttp://revistas.javeriana.edu.co/index.php/iyu/article/view/1481110.11144/Javeriana.iyu20-2.wffd2011-27690123-2126http://hdl.handle.net/10554/25529engspahttp://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/13766http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18626http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18627http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18628http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18629Ingenieria y Universidad; Vol 20 No 2 (2016): July-December; 239-266Ingenieria y Universidad; Vol. 20 Núm. 2 (2016): Julio-Diciembre; 239-266Copyright (c) 2016 Sergio A. Rojas-Galeano, Christian MogollónAtribución-NoComercial-SinDerivadas 4.0 Internacionalhttp://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2reponame:Repositorio Universidad Javerianainstname:Pontificia Universidad Javerianainstacron:Pontificia Universidad Javeriana2023-03-29T17:44:16Z |
dc.title.none.fl_str_mv |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
title |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
spellingShingle |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment Mogollón, Christian |
title_short |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
title_full |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
title_fullStr |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
title_full_unstemmed |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
title_sort |
A Web-Forum Free of Disguised Profanity by Means of Sequence Alignment |
dc.creator.none.fl_str_mv |
Mogollón, Christian Rojas-Galeano, Sergio A. |
author |
Mogollón, Christian |
author_facet |
Mogollón, Christian Rojas-Galeano, Sergio A. |
author_role |
author |
author2 |
Rojas-Galeano, Sergio A. |
author2_role |
author |
description |
Profanity is the use of offensive, obscene or abusive vocables or expressions in public conversations. A big source of conversations in text format nowadays are digital media such as forums, blogs or social networks where malicious users are taking advantage of their ample worldwide coverage to disseminate undesired profanity aimed at insulting or denigrating opinions, names or trademarks. Lexicon-based exact comparisons are the most common filter technique within these media; however, ingenious users are disguising profanity using transliteration or masking of the original vocable while still conveying its intended semantic (e.g. by writing piss as P!55 or p.i.s.s), hence defeating the filter. Recent approaches to this problem inspired in the sequence alignment methods from comparative genomics in bioinformatics, have shown promise in preventing overlooking such guises. Building upon those results we have developed an experimental Web forum where user comments are screened against disguised profanity. In this paper we introduce the software (ForumForte) and describe briefly the technique and engineering behind it, as well as some empirical evidence of its filtering performance. Our software is open-source under the New BSD License and is available at: http://tinyurl.com/ForumForte. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016-06-20 2020-04-16T17:27:21Z 2020-04-16T17:27:21Z |
dc.type.none.fl_str_mv |
http://purl.org/coar/version/c_970fb48d4fbd8a85 Artículo de revista http://purl.org/coar/resource_type/c_6501 info:eu-repo/semantics/article Peer-reviewed Article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.none.fl_str_mv |
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811 10.11144/Javeriana.iyu20-2.wffd 2011-2769 0123-2126 http://hdl.handle.net/10554/25529 |
url |
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811 http://hdl.handle.net/10554/25529 |
identifier_str_mv |
10.11144/Javeriana.iyu20-2.wffd 2011-2769 0123-2126 |
dc.language.none.fl_str_mv |
eng spa |
language |
eng spa |
dc.relation.none.fl_str_mv |
http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/13766 http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18626 http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18627 http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18628 http://revistas.javeriana.edu.co/index.php/iyu/article/view/14811/18629 Ingenieria y Universidad; Vol 20 No 2 (2016): July-December; 239-266 Ingenieria y Universidad; Vol. 20 Núm. 2 (2016): Julio-Diciembre; 239-266 |
dc.rights.none.fl_str_mv |
Copyright (c) 2016 Sergio A. Rojas-Galeano, Christian Mogollón Atribución-NoComercial-SinDerivadas 4.0 Internacional http://creativecommons.org/licenses/by/4.0 info:eu-repo/semantics/openAccess http://purl.org/coar/access_right/c_abf2 |
rights_invalid_str_mv |
Copyright (c) 2016 Sergio A. Rojas-Galeano, Christian Mogollón Atribución-NoComercial-SinDerivadas 4.0 Internacional http://creativecommons.org/licenses/by/4.0 http://purl.org/coar/access_right/c_abf2 |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
PDF application/pdf application/vnd.openxmlformats-officedocument.wordprocessingml.document application/zip application/zip application/pdf |
dc.publisher.none.fl_str_mv |
Pontificia Universidad Javeriana |
publisher.none.fl_str_mv |
Pontificia Universidad Javeriana |
dc.source.none.fl_str_mv |
reponame:Repositorio Universidad Javeriana instname:Pontificia Universidad Javeriana instacron:Pontificia Universidad Javeriana |
instname_str |
Pontificia Universidad Javeriana |
instacron_str |
Pontificia Universidad Javeriana |
institution |
Pontificia Universidad Javeriana |
reponame_str |
Repositorio Universidad Javeriana |
collection |
Repositorio Universidad Javeriana |
_version_ |
1803712877871235072 |