Revealing non-alphabetical guises of spam-trigger vocables
Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, invol...
- Autores:
-
Rojas Galeano, Sergio Andres
- Tipo de recurso:
- Article of journal
- Fecha de publicación:
- 2013
- Institución:
- Universidad Nacional de Colombia
- Repositorio:
- Universidad Nacional de Colombia
- Idioma:
- spa
- OAI Identifier:
- oai:repositorio.unal.edu.co:unal/42641
- Acceso en línea:
- https://repositorio.unal.edu.co/handle/unal/42641
http://bdigital.unal.edu.co/32738/
- Palabra clave:
- Uncovering of spam vocables
approximate string matching algorithm
- Rights
- openAccess
- License
- Atribución-NoComercial 4.0 Internacional
id |
UNACIONAL2_83e0664d6e83d604609e931d30bb103b |
---|---|
oai_identifier_str |
oai:repositorio.unal.edu.co:unal/42641 |
network_acronym_str |
UNACIONAL2 |
network_name_str |
Universidad Nacional de Colombia |
repository_id_str |
|
spelling |
Atribución-NoComercial 4.0 InternacionalDerechos reservados - Universidad Nacional de Colombiahttp://creativecommons.org/licenses/by-nc/4.0/info:eu-repo/semantics/openAccesshttp://purl.org/coar/access_right/c_abf2Rojas Galeano, Sergio Andresdb56f59c-e0f5-419d-a95a-50df2598a01e3002019-06-28T11:01:41Z2019-06-28T11:01:41Z2013https://repositorio.unal.edu.co/handle/unal/42641http://bdigital.unal.edu.co/32738/Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, involves obfuscation of black-listed words with visually equivalent text substitutions from non-alphabetic symbols, in such a way it still conveys the semantics of the original word to the human eye (e.g. masking viagra as v1@gr@ or as v-i-a-g-r-a). In this paper we discuss how a simple-yet-effective adaptation of a classical algorithm for string matching may meet this stylish challenge to effectively reveal the similarity between genuine spam-trigger terms with their disguised alpha-numeric variants.application/pdfspaUniversidad Nacional de Colombia Sede Medellínhttp://revistas.unal.edu.co/index.php/dyna/article/view/32319Universidad Nacional de Colombia Revistas electrónicas UN DynaDynaDyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353Rojas Galeano, Sergio Andres (2013) Revealing non-alphabetical guises of spam-trigger vocables. Dyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353 .Revealing non-alphabetical guises of spam-trigger vocablesArtículo de revistainfo:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501http://purl.org/coar/resource_type/c_2df8fbb1http://purl.org/coar/version/c_970fb48d4fbd8a85Texthttp://purl.org/redcol/resource_type/ARTUncovering of spam vocablesapproximate string matching algorithmORIGINAL32319-188326-1-PB.pdfapplication/pdf2035060https://repositorio.unal.edu.co/bitstream/unal/42641/1/32319-188326-1-PB.pdfc87830ed0ba7ca8b4ae342a9faf8d424MD5132319-119507-1-SP.docapplication/msword38400https://repositorio.unal.edu.co/bitstream/unal/42641/2/32319-119507-1-SP.docc2863541cfdbd89e2eb5be6cdecb8d43MD52THUMBNAIL32319-188326-1-PB.pdf.jpg32319-188326-1-PB.pdf.jpgGenerated Thumbnailimage/jpeg9493https://repositorio.unal.edu.co/bitstream/unal/42641/3/32319-188326-1-PB.pdf.jpg24e125ad59142a034f936dc4cdd0a515MD53unal/42641oai:repositorio.unal.edu.co:unal/426412023-02-08 23:05:06.661Repositorio Institucional Universidad Nacional de Colombiarepositorio_nal@unal.edu.co |
dc.title.spa.fl_str_mv |
Revealing non-alphabetical guises of spam-trigger vocables |
title |
Revealing non-alphabetical guises of spam-trigger vocables |
spellingShingle |
Revealing non-alphabetical guises of spam-trigger vocables Uncovering of spam vocables approximate string matching algorithm |
title_short |
Revealing non-alphabetical guises of spam-trigger vocables |
title_full |
Revealing non-alphabetical guises of spam-trigger vocables |
title_fullStr |
Revealing non-alphabetical guises of spam-trigger vocables |
title_full_unstemmed |
Revealing non-alphabetical guises of spam-trigger vocables |
title_sort |
Revealing non-alphabetical guises of spam-trigger vocables |
dc.creator.fl_str_mv |
Rojas Galeano, Sergio Andres |
dc.contributor.author.spa.fl_str_mv |
Rojas Galeano, Sergio Andres |
dc.subject.proposal.spa.fl_str_mv |
Uncovering of spam vocables approximate string matching algorithm |
topic |
Uncovering of spam vocables approximate string matching algorithm |
description |
Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, involves obfuscation of black-listed words with visually equivalent text substitutions from non-alphabetic symbols, in such a way it still conveys the semantics of the original word to the human eye (e.g. masking viagra as v1@gr@ or as v-i-a-g-r-a). In this paper we discuss how a simple-yet-effective adaptation of a classical algorithm for string matching may meet this stylish challenge to effectively reveal the similarity between genuine spam-trigger terms with their disguised alpha-numeric variants. |
publishDate |
2013 |
dc.date.issued.spa.fl_str_mv |
2013 |
dc.date.accessioned.spa.fl_str_mv |
2019-06-28T11:01:41Z |
dc.date.available.spa.fl_str_mv |
2019-06-28T11:01:41Z |
dc.type.spa.fl_str_mv |
Artículo de revista |
dc.type.coar.fl_str_mv |
http://purl.org/coar/resource_type/c_2df8fbb1 |
dc.type.driver.spa.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.version.spa.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.coar.spa.fl_str_mv |
http://purl.org/coar/resource_type/c_6501 |
dc.type.coarversion.spa.fl_str_mv |
http://purl.org/coar/version/c_970fb48d4fbd8a85 |
dc.type.content.spa.fl_str_mv |
Text |
dc.type.redcol.spa.fl_str_mv |
http://purl.org/redcol/resource_type/ART |
format |
http://purl.org/coar/resource_type/c_6501 |
status_str |
publishedVersion |
dc.identifier.uri.none.fl_str_mv |
https://repositorio.unal.edu.co/handle/unal/42641 |
dc.identifier.eprints.spa.fl_str_mv |
http://bdigital.unal.edu.co/32738/ |
url |
https://repositorio.unal.edu.co/handle/unal/42641 http://bdigital.unal.edu.co/32738/ |
dc.language.iso.spa.fl_str_mv |
spa |
language |
spa |
dc.relation.spa.fl_str_mv |
http://revistas.unal.edu.co/index.php/dyna/article/view/32319 |
dc.relation.ispartof.spa.fl_str_mv |
Universidad Nacional de Colombia Revistas electrónicas UN Dyna Dyna |
dc.relation.ispartofseries.none.fl_str_mv |
Dyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353 |
dc.relation.references.spa.fl_str_mv |
Rojas Galeano, Sergio Andres (2013) Revealing non-alphabetical guises of spam-trigger vocables. Dyna; Vol. 80, núm. 182 (2013); 50-57 DYNA; Vol. 80, núm. 182 (2013); 50-57 2346-2183 0012-7353 . |
dc.rights.spa.fl_str_mv |
Derechos reservados - Universidad Nacional de Colombia |
dc.rights.coar.fl_str_mv |
http://purl.org/coar/access_right/c_abf2 |
dc.rights.license.spa.fl_str_mv |
Atribución-NoComercial 4.0 Internacional |
dc.rights.uri.spa.fl_str_mv |
http://creativecommons.org/licenses/by-nc/4.0/ |
dc.rights.accessrights.spa.fl_str_mv |
info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Atribución-NoComercial 4.0 Internacional Derechos reservados - Universidad Nacional de Colombia http://creativecommons.org/licenses/by-nc/4.0/ http://purl.org/coar/access_right/c_abf2 |
eu_rights_str_mv |
openAccess |
dc.format.mimetype.spa.fl_str_mv |
application/pdf |
dc.publisher.spa.fl_str_mv |
Universidad Nacional de Colombia Sede Medellín |
institution |
Universidad Nacional de Colombia |
bitstream.url.fl_str_mv |
https://repositorio.unal.edu.co/bitstream/unal/42641/1/32319-188326-1-PB.pdf https://repositorio.unal.edu.co/bitstream/unal/42641/2/32319-119507-1-SP.doc https://repositorio.unal.edu.co/bitstream/unal/42641/3/32319-188326-1-PB.pdf.jpg |
bitstream.checksum.fl_str_mv |
c87830ed0ba7ca8b4ae342a9faf8d424 c2863541cfdbd89e2eb5be6cdecb8d43 24e125ad59142a034f936dc4cdd0a515 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositorio Institucional Universidad Nacional de Colombia |
repository.mail.fl_str_mv |
repositorio_nal@unal.edu.co |
_version_ |
1814089744199450624 |