Revealing non-alphabetical guises of spam-trigger vocables

Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, invol...

Full description

Autores:
Rojas Galeano, Sergio Andres
Tipo de recurso:
Article of journal
Fecha de publicación:
2013
Institución:
Universidad Nacional de Colombia
Repositorio:
Universidad Nacional de Colombia
Idioma:
spa
OAI Identifier:
oai:repositorio.unal.edu.co:unal/42641
Acceso en línea:
https://repositorio.unal.edu.co/handle/unal/42641
http://bdigital.unal.edu.co/32738/
Palabra clave:
Uncovering of spam vocables
approximate string matching algorithm
Rights
openAccess
License
Atribución-NoComercial 4.0 Internacional
id UNACIONAL2_83e0664d6e83d604609e931d30bb103b
oai_identifier_str oai:repositorio.unal.edu.co:unal/42641
network_acronym_str UNACIONAL2
network_name_str Universidad Nacional de Colombia
repository_id_str
spelling Atribución-NoComercial 4.0 InternacionalDerechos reservados - Universidad Nacional de Colombiahttp://creativecommons.org/licenses/by-nc/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Rojas Galeano, Sergio Andresdb56f59c-e0f5-419d-a95a-50df2598a01e3002019-06-28T11:01:41Z2019-06-28T11:01:41Z2013https://repositorio.unal.edu.co/handle/unal/42641http://bdigital.unal.edu.co/32738/Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, involves obfuscation of black-listed words with visually equivalent text substitutions from non-alphabetic symbols, in such a way it still conveys the semantics of the original word to the human eye (e.g. masking viagra as v1@gr@ or as v-i-a-g-r-a). In this paper we discuss how a simple-yet-effective adaptation of a classical algorithm for string matching may meet this stylish challenge to effectively reveal the similarity between genuine spam-trigger terms with their disguised alpha-numeric variants.application/pdfspaUniversidad Nacional de Colombia Sede Medellínhttp://revistas.unal.edu.co/index.php/dyna/article/view/32319Universidad Nacional de Colombia Revistas electrónicas UN DynaDynaDyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353Rojas Galeano, Sergio Andres (2013) Revealing non-alphabetical guises of spam-trigger vocables. Dyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353 .Revealing non-alphabetical guises of spam-trigger vocablesArtículo de revistainfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501http://purl.org/coar/resource_type/c_2df8fbb1http://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/ARTUncovering of spam vocablesapproximate string matching algorithmORIGINAL32319-188326-1-PB.pdfapplication/pdf2035060https://repositorio.unal.edu.co/bitstream/unal/42641/1/32319-188326-1-PB.pdfc87830ed0ba7ca8b4ae342a9faf8d424MD5132319-119507-1-SP.docapplication/msword38400https://repositorio.unal.edu.co/bitstream/unal/42641/2/32319-119507-1-SP.docc2863541cfdbd89e2eb5be6cdecb8d43MD52THUMBNAIL32319-188326-1-PB.pdf.jpg32319-188326-1-PB.pdf.jpgGenerated Thumbnailimage/jpeg9493https://repositorio.unal.edu.co/bitstream/unal/42641/3/32319-188326-1-PB.pdf.jpg24e125ad59142a034f936dc4cdd0a515MD53unal/42641oai:repositorio.unal.edu.co:unal/426412023-02-08 23:05:06.661Repositorio Institucional Universidad Nacional de Colombiarepositorio_nal@unal.edu.co
dc.title.spa.fl_str_mv Revealing non-alphabetical guises of spam-trigger vocables
title Revealing non-alphabetical guises of spam-trigger vocables
spellingShingle Revealing non-alphabetical guises of spam-trigger vocables
Uncovering of spam vocables
approximate string matching algorithm
title_short Revealing non-alphabetical guises of spam-trigger vocables
title_full Revealing non-alphabetical guises of spam-trigger vocables
title_fullStr Revealing non-alphabetical guises of spam-trigger vocables
title_full_unstemmed Revealing non-alphabetical guises of spam-trigger vocables
title_sort Revealing non-alphabetical guises of spam-trigger vocables
dc.creator.fl_str_mv Rojas Galeano, Sergio Andres
dc.contributor.author.spa.fl_str_mv Rojas Galeano, Sergio Andres
dc.subject.proposal.spa.fl_str_mv Uncovering of spam vocables
approximate string matching algorithm
topic Uncovering of spam vocables
approximate string matching algorithm
description Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, involves obfuscation of black-listed words with visually equivalent text substitutions from non-alphabetic symbols, in such a way it still conveys the semantics of the original word to the human eye (e.g. masking viagra as v1@gr@ or as v-i-a-g-r-a). In this paper we discuss how a simple-yet-effective adaptation of a classical algorithm for string matching may meet this stylish challenge to effectively reveal the similarity between genuine spam-trigger terms with their disguised alpha-numeric variants.
publishDate 2013
dc.date.issued.spa.fl_str_mv 2013
dc.date.accessioned.spa.fl_str_mv 2019-06-28T11:01:41Z
dc.date.available.spa.fl_str_mv 2019-06-28T11:01:41Z
dc.type.spa.fl_str_mv Artículo de revista
dc.type.coar.fl_str_mv http://purl.org/coar/resource_type/c_2df8fbb1
dc.type.driver.spa.fl_str_mv info:eu-repo/semantics/article
dc.type.version.spa.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.coar.spa.fl_str_mv http://purl.org/coar/resource_type/c_6501
dc.type.coarversion.spa.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.content.spa.fl_str_mv Text
dc.type.redcol.spa.fl_str_mv http://purl.org/redcol/resource_type/ART
format http://purl.org/coar/resource_type/c_6501
status_str publishedVersion
dc.identifier.uri.none.fl_str_mv https://repositorio.unal.edu.co/handle/unal/42641
dc.identifier.eprints.spa.fl_str_mv http://bdigital.unal.edu.co/32738/
url https://repositorio.unal.edu.co/handle/unal/42641
http://bdigital.unal.edu.co/32738/
dc.language.iso.spa.fl_str_mv spa
language spa
dc.relation.spa.fl_str_mv http://revistas.unal.edu.co/index.php/dyna/article/view/32319
dc.relation.ispartof.spa.fl_str_mv Universidad Nacional de Colombia Revistas electrónicas UN Dyna
Dyna
dc.relation.ispartofseries.none.fl_str_mv Dyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353
dc.relation.references.spa.fl_str_mv Rojas Galeano, Sergio Andres (2013) Revealing non-alphabetical guises of spam-trigger vocables. Dyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353 .
dc.rights.spa.fl_str_mv Derechos reservados - Universidad Nacional de Colombia
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_abf2
dc.rights.license.spa.fl_str_mv Atribución-NoComercial 4.0 Internacional
dc.rights.uri.spa.fl_str_mv http://creativecommons.org/licenses/by-nc/4.0/
dc.rights.accessrights.spa.fl_str_mv info:eu-repo/semantics/openAccess
rights_invalid_str_mv Atribución-NoComercial 4.0 Internacional
Derechos reservados - Universidad Nacional de Colombia
http://creativecommons.org/licenses/by-nc/4.0/
http://purl.org/coar/access_right/c_abf2
eu_rights_str_mv openAccess
dc.format.mimetype.spa.fl_str_mv application/pdf
dc.publisher.spa.fl_str_mv Universidad Nacional de Colombia Sede Medellín
institution Universidad Nacional de Colombia
bitstream.url.fl_str_mv https://repositorio.unal.edu.co/bitstream/unal/42641/1/32319-188326-1-PB.pdf
https://repositorio.unal.edu.co/bitstream/unal/42641/2/32319-119507-1-SP.doc
https://repositorio.unal.edu.co/bitstream/unal/42641/3/32319-188326-1-PB.pdf.jpg
bitstream.checksum.fl_str_mv c87830ed0ba7ca8b4ae342a9faf8d424
c2863541cfdbd89e2eb5be6cdecb8d43
24e125ad59142a034f936dc4cdd0a515
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
repository.name.fl_str_mv Repositorio Institucional Universidad Nacional de Colombia
repository.mail.fl_str_mv repositorio_nal@unal.edu.co
_version_ 1814089744199450624