Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web

El reciente y dramático crecimiento del Internet, es un claro signo de que la computación está entrando en una nueva era. Por esto es necesario empezar a reconocer nuevos conceptos como el filtrado y recuperación de información que nos permiten mostrar los documentos más relevantes de acuerdo con lo...

Full description

Autores:: Amaya Díaz, Javier Enrique
Cañate Celedón, Jair José
Carvajal Pineda, Carlos Fernando

Tipo de recurso:: Trabajo de grado de pregrado

Fecha de publicación:: 2002

Institución:: Universidad Autónoma de Bucaramanga - UNAB

Repositorio:: Repositorio UNAB

Idioma:: spa

id	UNAB2_37eb363d3539a4eda1d8ce591a9b09f8
oai_identifier_str	oai:repository.unab.edu.co:20.500.12749/27025
network_acronym_str	UNAB2
network_name_str	Repositorio UNAB
repository_id_str
dc.title.spa.fl_str_mv	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
dc.title.translated.spa.fl_str_mv	Prototype of content filtering system for the dissemination of information contained on the web
title	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
spellingShingle	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web Systems engineer Technological innovations Selective dissemination Storage systems Information retrieval Information retrieval Artificial intelligence Neural networks (Computer science) Ingeniería de sistemas Innovaciones tecnológicas Recuperación de información Inteligencia artificial Redes neuronales (Computadores) Diseminación selectiva Sistemas de almacenamiento Recuperación de información
title_short	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
title_full	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
title_fullStr	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
title_full_unstemmed	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
title_sort	Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web
dc.creator.fl_str_mv	Amaya Díaz, Javier Enrique Cañate Celedón, Jair José Carvajal Pineda, Carlos Fernando
dc.contributor.advisor.none.fl_str_mv	Pérez Alcázar, José de Jesús
dc.contributor.author.none.fl_str_mv	Amaya Díaz, Javier Enrique Cañate Celedón, Jair José Carvajal Pineda, Carlos Fernando
dc.contributor.cvlac.spa.fl_str_mv	Amaya Díaz, Javier Enrique [0000164326]
dc.subject.keywords.spa.fl_str_mv	Systems engineer Technological innovations Selective dissemination Storage systems Information retrieval Information retrieval Artificial intelligence Neural networks (Computer science)
topic	Systems engineer Technological innovations Selective dissemination Storage systems Information retrieval Information retrieval Artificial intelligence Neural networks (Computer science) Ingeniería de sistemas Innovaciones tecnológicas Recuperación de información Inteligencia artificial Redes neuronales (Computadores) Diseminación selectiva Sistemas de almacenamiento Recuperación de información
dc.subject.lemb.spa.fl_str_mv	Ingeniería de sistemas Innovaciones tecnológicas Recuperación de información Inteligencia artificial Redes neuronales (Computadores)
dc.subject.proposal.spa.fl_str_mv	Diseminación selectiva Sistemas de almacenamiento Recuperación de información
description	El reciente y dramático crecimiento del Internet, es un claro signo de que la computación está entrando en una nueva era. Por esto es necesario empezar a reconocer nuevos conceptos como el filtrado y recuperación de información que nos permiten mostrar los documentos más relevantes de acuerdo con los perfiles de sus usuarios. Existen diversos modelos para dicha labor, entre los que se encuentran los modelos clásicos como el modelo Booleano y Vectorial, los cuales tiene un formalismo simple porque la relevancia de los documentos recuperados se basa simplemente en la igualación parcial de los términos indexados en los documentos y las consultas. Otros modelos como el LSI, (indexación semántico latente), toma un paso adelante y además de trabajar con términos indexados, trabaja con “conceptos”, es decir, recupera documentos cuyos términos indexados no se encuentran en la consulta del usuario pero que también son relevantes. El modelo de redes neuronales tiene una función similar pero trabaja algoritmos de aprendizaje.
publishDate	2002
dc.date.issued.none.fl_str_mv	2002-01-20
dc.date.accessioned.none.fl_str_mv	2024-10-21T14:36:13Z
dc.date.available.none.fl_str_mv	2024-10-21T14:36:13Z
dc.type.driver.none.fl_str_mv	info:eu-repo/semantics/bachelorThesis
dc.type.local.spa.fl_str_mv	Trabajo de Grado
dc.type.coar.none.fl_str_mv	http://purl.org/coar/resource_type/c_7a1f
dc.type.hasversion.none.fl_str_mv	info:eu-repo/semantics/acceptedVersion
dc.type.redcol.none.fl_str_mv	http://purl.org/redcol/resource_type/TP
format	http://purl.org/coar/resource_type/c_7a1f
status_str	acceptedVersion
dc.identifier.uri.none.fl_str_mv	http://hdl.handle.net/20.500.12749/27025
dc.identifier.instname.spa.fl_str_mv	instname:Universidad Autónoma de Bucaramanga - UNAB
dc.identifier.reponame.spa.fl_str_mv	reponame:Repositorio Institucional UNAB
dc.identifier.repourl.spa.fl_str_mv	repourl:https://repository.unab.edu.co
url	http://hdl.handle.net/20.500.12749/27025
identifier_str_mv	instname:Universidad Autónoma de Bucaramanga - UNAB reponame:Repositorio Institucional UNAB repourl:https://repository.unab.edu.co
dc.language.iso.spa.fl_str_mv	spa
language	spa
dc.relation.references.spa.fl_str_mv	ARMSTRONG, R., FRIETAG, D., JOACHIMS, T. y MITCHELL, T„ WebWatcher: a learning apprentice for the world wide web. En Proceedings of the 1995 AAAI Spring Symposium of Information Gathering from Heterogeneous, Distributed Environments, Stanford, CA, 1995. AAAI Press. [ BAEZA Yates, Ricardo A. String Searching Algorithms. En FRAKES, William B. y BAEZA Yates, Ricardo A. Information Retrieval: Data Structures & Algorithms. UpperSaddle River, New Jersey: Prentice Hall PTR, 1992. BAEZA YATES, Ricardo y FRAKES, William B. Information Retrieval Data Structures & Algorithms. Prentice Hall PTR, Upper Saddle River, New Jersey. 1992. BAEZA YATES, Ricardo Y RIBEIRO NETO, Rerthier. Modern Information Retrieval. Addisson WesleyACM Press. 1992. BAKEL, Bas van. Modern classical document indexing: a linguistic contribution to knowledge-based IR. En Annual International ACM-SIGIR Conference on 12R research and Development in Information Retrieval (SIGIR’98) 1998. Melborne, AU. Proceedings. New York ACM Press, 1998. p.333-334. BELKIN Nicholas J. y CROFT W. Bruce. Information filtering and information retrieval: Two sides of the same coin? Communications of the ACM, 35(12):29-38. Diciembre 1992. BHARAT, K. y HENZINGER, M. Improved algorithms for topic distillation in a hyperlinked environment. In Proc. 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 104-111, August 1998. BOOCH, Grady, JACOBSON, Ivar Y RUMBAUGH, James. The Unified Modeling Languaje. Addison Wesley Longman Inc. Rational Software Corporation. 1999. CORTHOUST, Jan. The DSI Service of VUBIS-Antwerpen of Antwerp. 1996. http:/143.169.20.1/MAN/SDIE/# corp-au DELGADO, J.A. Agent - Based Information Filtering and Recommender Systems on the Internet. PhD. Thesis, Nagoya Institute of Technology. Marzo 2000. FOX, Christopher. Lexical analysis and stoplists. En: FRAKES, William B. y BAEZA Yates, Ricardo A. Information Retrieval: Data Structures & Algorithms. UpperSaddle River, New Jersey: Prentice Hall PTR, 1992. p. 102-130. FRAKES, William B. Stemming Algorithms. En FRAKES, William B. BAEZA Yates, Ricardo A. Information Retrieval: Data Structures & Algorithms. Upper Saddle River, New Jersey: Prentice Hall PTR, 1992. GILES, L„ BOLLACKER, K. y LAWRENCE, S. CiteSeer An Automatic Citation Indexing System. En Proceedings of the 3rd ACM Conference on Digital Librarles. KAUTZ, H., SELMAN, B. y SHAH, M. The Hidden Web. Al Magazine. Summer 1997. AAAI Press. KLEINBERG, J. Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. To appear in Journal of the ACM. 1999. Also appears as IBM Research Report RJ 10076, May 1997. [ KORFHAGE, Robert R. Information Retrieval and Storage. New York: John Wiley & Sons, 1997. KOWALSKI, Gerald. Information Retrieval Systems: Theory and Implementation. Boston: Kluwer Academic Publishers, 1997. KRAAIJ, Wessel y POHLMANN, Renée. Viewing stemming as recall enhancement. En Annual International ACM-SIGIR Conference on research and Development in Information Retrieval (SIGIR’96), 1996, Zurich, Switzerland. Proceedings. New York: ACM Press, 1996. p.40-48. MEADOW, Charles T. Text Information Retrieval Systems. Academic Press, 1992 MLADENIC, Dunja y GROBELNIK, Marko. Feature Selection for Classification Based on Text Hierarchy. En: Conference on Automated Learning and Discovery (CONALD-98), 2000, Proceedings. Pittsburg: Carnegie Mellón University, 2000. p.6p. http://www.cs.cmu.edu/afs/cs/user/dunja/www/pww.html OARD W, Douglas. A conceptual Framework for Text Filtering. University of Maryland, College Park, Mayo, 1996. http://www.enee.umd.edu/medlab/filter/filter.html PAGE, L. Y BRIN, S.. The Anatomy of a Search Engine. The Seventh International VWVW Conference (WWW’98). Brisbane, Australia, April 14-18, 1998. 129 [23] Pérez, Claudia. Agentes Móviles en Bibliotecas Digitales, [online]. [citado 17 mar., 2001], Disponible de <http://ict.pue.udlap.mx/pubs/claudia/cap1.html> PERKOWITZ, M. y ETZIONI, O. Adaptive Web Sites: Automatically Synthesizing Web Pages. En Proceedings of the American National Conference on Artificial Intelligence AAAI-98. RILOFF, Ellen. Little words can make big difference for text classification. En Annual International ACM-SIGIR Conference on research and Development in Information Retrieval (SIGIR’95), 1995, Seattle, USA. Proceedings. New York: ACM Press, 1995. SALTON, Gerard y BUCKLEY, Chris. Improving Retrieval Performance by Relevance Feedback. Ithaca, New York. Department of computer science, Cornell University, 1987. (Technical Report). SALTON, Gerard. MACGILL, Michael J. Introduction to Modern Information Retrieval. New York: McGRAW-Híll, 1983. Scott Deerwester, Susan T. Dumais, George W. Fumas, Thomas K. Laundauer, Richard Harshman. Indexing by Latent Semantic Analysis. WILBUR, J. W. y SIROTKIN, K. The Automatic Identification of Stop Words. Journal of Information Society, v.18, , p.45-55. 1992. VAN RIJSBERGEN, C. J. Information retrieval. Butterworths YANG, Yiming y PEDERSEN, Jan O. A comparative study on features selection in text categorization. School of Computer Science, Carnegie Mellón University, 1997.
dc.rights.coar.fl_str_mv	http://purl.org/coar/access_right/c_abf2
dc.rights.uri.*.fl_str_mv	http://creativecommons.org/licenses/by-nc-nd/2.5/co/
dc.rights.local.spa.fl_str_mv	Abierto (Texto Completo)
dc.rights.creativecommons.*.fl_str_mv	Atribución-NoComercial-SinDerivadas 2.5 Colombia
rights_invalid_str_mv	http://creativecommons.org/licenses/by-nc-nd/2.5/co/ Abierto (Texto Completo) Atribución-NoComercial-SinDerivadas 2.5 Colombia http://purl.org/coar/access_right/c_abf2
dc.format.mimetype.spa.fl_str_mv	application/pdf
dc.coverage.spatial.spa.fl_str_mv	Bucaramanga (Santander, Colombia)
dc.coverage.campus.spa.fl_str_mv	UNAB Campus Bucaramanga
dc.publisher.grantor.spa.fl_str_mv	Universidad Autónoma de Bucaramanga UNAB
dc.publisher.faculty.spa.fl_str_mv	Facultad Ingeniería
dc.publisher.program.spa.fl_str_mv	Pregrado Ingeniería de Sistemas
dc.publisher.programid.none.fl_str_mv	ISI-1791
institution	Universidad Autónoma de Bucaramanga - UNAB
bitstream.url.fl_str_mv	https://repository.unab.edu.co/bitstream/20.500.12749/27025/1/2002_Amaya_Diaz_Javier.pdf https://repository.unab.edu.co/bitstream/20.500.12749/27025/2/license.txt https://repository.unab.edu.co/bitstream/20.500.12749/27025/3/2002_Amaya_Diaz_Javier.pdf.jpg
bitstream.checksum.fl_str_mv	0c0b99dd10ea396fc5358f0333e06c01 3755c0cfdb77e29f2b9125d7a45dd316 7986b199722a7e6408274756e8dd599e
bitstream.checksumAlgorithm.fl_str_mv	MD5 MD5 MD5
repository.name.fl_str_mv	Repositorio Institucional \| Universidad Autónoma de Bucaramanga - UNAB
repository.mail.fl_str_mv	repositorio@unab.edu.co
_version_	1851051925976383488
spelling	Pérez Alcázar, José de Jesús38f31005-c259-48e5-845c-ac95c39cc2b9Amaya Díaz, Javier Enriqued0138279-f799-459a-9a1c-c112d0b0f487Cañate Celedón, Jair José0ed7eaf2-4c60-4352-8075-c31b0559c581Carvajal Pineda, Carlos Fernando6cde9629-acf2-4d81-a9c3-804fd4c94ef0Amaya Díaz, Javier Enrique [0000164326]Bucaramanga (Santander, Colombia)UNAB Campus Bucaramanga2024-10-21T14:36:13Z2024-10-21T14:36:13Z2002-01-20http://hdl.handle.net/20.500.12749/27025instname:Universidad Autónoma de Bucaramanga - UNABreponame:Repositorio Institucional UNABrepourl:https://repository.unab.edu.coEl reciente y dramático crecimiento del Internet, es un claro signo de que la computación está entrando en una nueva era. Por esto es necesario empezar a reconocer nuevos conceptos como el filtrado y recuperación de información que nos permiten mostrar los documentos más relevantes de acuerdo con los perfiles de sus usuarios. Existen diversos modelos para dicha labor, entre los que se encuentran los modelos clásicos como el modelo Booleano y Vectorial, los cuales tiene un formalismo simple porque la relevancia de los documentos recuperados se basa simplemente en la igualación parcial de los términos indexados en los documentos y las consultas. Otros modelos como el LSI, (indexación semántico latente), toma un paso adelante y además de trabajar con términos indexados, trabaja con “conceptos”, es decir, recupera documentos cuyos términos indexados no se encuentran en la consulta del usuario pero que también son relevantes. El modelo de redes neuronales tiene una función similar pero trabaja algoritmos de aprendizaje.Introducción 1. Generalidades 5 1.1. Diseminación selectiva de información 5 1.2. Filtrado de información 6 1.2.1. Filtrado social o colaborativo. 6 1.2.1.1. Usuarios de un sistema de filtrado. 7 1.2.2. Filtrado basado en eventos 7 1.2.3 filtrado basado en reputación 10 1.2.4. Técnica de filtrado cognitivo o basado en contenido 12 2. Filtrado y recuperación de información 2.1. Conceptos básicos18 2.1.1. La tarea del usuario 2.1.2. La vista lógica del documento 2.2. El perfil en filtrado por contenido 20 3. Representación del documento 22 3.1. Estructura de almacenamiento de datos 23 3.1.1. Listas o archivos invertidos 23 3.2. Indexación automática 3.2.1. Identificación de términos. 25 3.2.2. Remoción de “stopwords”. 26 3.2.3. Normalización morfológica. 27 3.2.4. Calculo de relevancia. 30 3.2.5. Selección de términos. 4. Modelos de recuperación 34 4.1. Modelos de recuperación 4.2. Características de los modelos clásicos 36 4.2.1. Modelo booleano 36 4.2.2. Modelo vectorial 37 4.2.3. Modelo probabilístico 38 4.3. Modelo vectorial en recuperación de Información 4.4. Modelos algebraicos alternativos 43 4.4.1. Modelo de indexación semántico latente 43 4.4.2. Modelo de redes neuronales 47 4.4.2.1. Definiciones de una red neuronal 49 4.4.2.2. Ventajas que ofrecen las redes neuronale 4.4.2.2.1. Aprendizaje adaptativo 51 4.4.2.2.2. Auto-organización 52 4.4.2.2.3. Tolerancia a fallos. 53 4.4.2.2.4. Operación en tiempo real. 54 4.4.2.2.5. Fácil inserción dentro de la tecnología existente 55 4.4.2.3. Niveles o capas de una red neuronal 57 4.4.2.4. Mecanismos de aprendizaje 58 4.4.2.4.1. Aprendizaje por corrección de error 4.4.2.4.2. Aprendizaje por refuerzo 63 4.4.2.4.3. Aprendizaje estocástico 64 4.4.2.4.4. Aprendizaje no supervisado 65 4.4.2.5. Modelo de redes neuronales para la recuperación de 66 Información 5. Comparación y evaluación de los modelos de 71 Filtrado por contenido 5.1. Pruebas en el modelo booleano 73 5.2. Pruebas en el modelo vectorial 76 5.3. Pruebas en el modelo de redes neuronales 78 5.4. Pruebas en el modelo de indexación 80 Semántico latente 6. Prototipo de sistema de filtrado de 83 Información basada en contenido 6.1. Preanálisis 83 6.1.1. Casos de uso descripción 83 6.1.1.1. Validar usuario 84 6.1.1.2. Suscribirse al sistema 85 6.1.1.3. Activar proceso de filtrado 86 6.1.1.4. Definir perfil 87 6.1.1.5. Consultar información 88 6.2. Análisis. 89 6.2.1. Diagramas de secuencia y colaboración 6.2.2. Diagrama de clases 99 6.2.3. Diccionario de datos para el diagrama de clases 100 6.2.4. Diagrama de actividades 102 6.2.4.1. Usuario 102 6.2.4.2. Método 103 6.2.4.3. Vector espacial 105 6.3. Diseño 106 6.3.1. Diagramas correspondientes a la ingeniería de casos de 106 Uso. 6.3.2 descripción procedimental de objetos. 109 6.3.3. Descripción de pantallas 113 6.3.4. Arquitectura del sistema 117 6.3.5 desarrollo del sistema 120 7. Conclusiones 123 8. Recomendaciones 125 9. Bibliografía 126PregradoThe recent and dramatic growth of the Internet is a clear sign that computing is entering a new era. For this reason, it is necessary to begin to recognize new concepts such as filtering and information retrieval that allow us to display the most relevant documents according to the profiles of their users. There are various models for this task, among which are the classic models such as the Boolean and Vector models, which have a simple formalism because the relevance of the retrieved documents is based simply on the partial matching of the indexed terms in the documents and the queries. Other models such as LSI (latent semantic indexing) take a step forward and, in addition to working with indexed terms, work with “concepts”, that is, retrieve documents whose indexed terms are not found in the user’s query but are also relevant. The neural network model has a similar function but works with learning algorithms.Modalidad Presencialapplication/pdfspahttp://creativecommons.org/licenses/by-nc-nd/2.5/co/Abierto (Texto Completo)Atribución-NoComercial-SinDerivadas 2.5 Colombiahttp://purl.org/coar/access_right/c_abf2Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la webPrototype of content filtering system for the dissemination of information contained on the webIngeniero de SistemasUniversidad Autónoma de Bucaramanga UNABFacultad IngenieríaPregrado Ingeniería de SistemasISI-1791info:eu-repo/semantics/bachelorThesisTrabajo de Gradohttp://purl.org/coar/resource_type/c_7a1finfo:eu-repo/semantics/acceptedVersionhttp://purl.org/redcol/resource_type/TPSystems engineerTechnological innovationsSelective disseminationStorage systemsInformation retrievalInformation retrievalArtificial intelligenceNeural networks (Computer science)Ingeniería de sistemasInnovaciones tecnológicasRecuperación de informaciónInteligencia artificialRedes neuronales (Computadores)Diseminación selectivaSistemas de almacenamientoRecuperación de informaciónARMSTRONG, R., FRIETAG, D., JOACHIMS, T. y MITCHELL, T„ WebWatcher: a learning apprentice for the world wide web. En Proceedings of the 1995 AAAI Spring Symposium of Information Gathering from Heterogeneous, Distributed Environments, Stanford, CA, 1995. AAAI Press. [BAEZA Yates, Ricardo A. String Searching Algorithms. En FRAKES, William B. y BAEZA Yates, Ricardo A. Information Retrieval: Data Structures & Algorithms. UpperSaddle River, New Jersey: Prentice Hall PTR, 1992.BAEZA YATES, Ricardo y FRAKES, William B. Information Retrieval Data Structures & Algorithms. Prentice Hall PTR, Upper Saddle River, New Jersey. 1992.BAEZA YATES, Ricardo Y RIBEIRO NETO, Rerthier. Modern Information Retrieval. Addisson WesleyACM Press. 1992.BAKEL, Bas van. Modern classical document indexing: a linguistic contribution to knowledge-based IR. En Annual International ACM-SIGIR Conference on 12R research and Development in Information Retrieval (SIGIR’98) 1998. Melborne, AU. Proceedings. New York ACM Press, 1998. p.333-334.BELKIN Nicholas J. y CROFT W. Bruce. Information filtering and information retrieval: Two sides of the same coin? Communications of the ACM, 35(12):29-38. Diciembre 1992.BHARAT, K. y HENZINGER, M. Improved algorithms for topic distillation in a hyperlinked environment. In Proc. 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 104-111, August 1998.BOOCH, Grady, JACOBSON, Ivar Y RUMBAUGH, James. The Unified Modeling Languaje. Addison Wesley Longman Inc. Rational Software Corporation. 1999.CORTHOUST, Jan. The DSI Service of VUBIS-Antwerpen of Antwerp. 1996. http:/143.169.20.1/MAN/SDIE/# corp-auDELGADO, J.A. Agent - Based Information Filtering and Recommender Systems on the Internet. PhD. Thesis, Nagoya Institute of Technology. Marzo 2000.FOX, Christopher. Lexical analysis and stoplists. En: FRAKES, William B. y BAEZA Yates, Ricardo A. Information Retrieval: Data Structures & Algorithms. UpperSaddle River, New Jersey: Prentice Hall PTR, 1992. p. 102-130.FRAKES, William B. Stemming Algorithms. En FRAKES, William B. BAEZA Yates, Ricardo A. Information Retrieval: Data Structures & Algorithms. Upper Saddle River, New Jersey: Prentice Hall PTR, 1992.GILES, L„ BOLLACKER, K. y LAWRENCE, S. CiteSeer An Automatic Citation Indexing System. En Proceedings of the 3rd ACM Conference on Digital Librarles.KAUTZ, H., SELMAN, B. y SHAH, M. The Hidden Web. Al Magazine. Summer 1997. AAAI Press.KLEINBERG, J. Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. To appear in Journal of the ACM. 1999. Also appears as IBM Research Report RJ 10076, May 1997. [KORFHAGE, Robert R. Information Retrieval and Storage. New York: John Wiley & Sons, 1997.KOWALSKI, Gerald. Information Retrieval Systems: Theory and Implementation. Boston: Kluwer Academic Publishers, 1997.KRAAIJ, Wessel y POHLMANN, Renée. Viewing stemming as recall enhancement. En Annual International ACM-SIGIR Conference on research and Development in Information Retrieval (SIGIR’96), 1996, Zurich, Switzerland. Proceedings. New York: ACM Press, 1996. p.40-48.MEADOW, Charles T. Text Information Retrieval Systems. Academic Press, 1992MLADENIC, Dunja y GROBELNIK, Marko. Feature Selection for Classification Based on Text Hierarchy. En: Conference on Automated Learning and Discovery (CONALD-98), 2000, Proceedings. Pittsburg: Carnegie Mellón University, 2000. p.6p. http://www.cs.cmu.edu/afs/cs/user/dunja/www/pww.htmlOARD W, Douglas. A conceptual Framework for Text Filtering. University of Maryland, College Park, Mayo, 1996. http://www.enee.umd.edu/medlab/filter/filter.htmlPAGE, L. Y BRIN, S.. The Anatomy of a Search Engine. The Seventh International VWVW Conference (WWW’98). Brisbane, Australia, April 14-18, 1998. 129 [23] Pérez, Claudia. Agentes Móviles en Bibliotecas Digitales, [online]. [citado 17 mar., 2001], Disponible de <http://ict.pue.udlap.mx/pubs/claudia/cap1.html>PERKOWITZ, M. y ETZIONI, O. Adaptive Web Sites: Automatically Synthesizing Web Pages. En Proceedings of the American National Conference on Artificial Intelligence AAAI-98.RILOFF, Ellen. Little words can make big difference for text classification. En Annual International ACM-SIGIR Conference on research and Development in Information Retrieval (SIGIR’95), 1995, Seattle, USA. Proceedings. New York: ACM Press, 1995.SALTON, Gerard y BUCKLEY, Chris. Improving Retrieval Performance by Relevance Feedback. Ithaca, New York. Department of computer science, Cornell University, 1987. (Technical Report).SALTON, Gerard. MACGILL, Michael J. Introduction to Modern Information Retrieval. New York: McGRAW-Híll, 1983.Scott Deerwester, Susan T. Dumais, George W. Fumas, Thomas K. Laundauer, Richard Harshman. Indexing by Latent Semantic Analysis.WILBUR, J. W. y SIROTKIN, K. The Automatic Identification of Stop Words. Journal of Information Society, v.18, , p.45-55. 1992.VAN RIJSBERGEN, C. J. Information retrieval. ButterworthsYANG, Yiming y PEDERSEN, Jan O. A comparative study on features selection in text categorization. School of Computer Science, Carnegie Mellón University, 1997.ORIGINAL2002_Amaya_Diaz_Javier.pdf2002_Amaya_Diaz_Javier.pdfTesisapplication/pdf23575503https://repository.unab.edu.co/bitstream/20.500.12749/27025/1/2002_Amaya_Diaz_Javier.pdf0c0b99dd10ea396fc5358f0333e06c01MD51open accessLICENSElicense.txtlicense.txttext/plain; charset=utf-8829https://repository.unab.edu.co/bitstream/20.500.12749/27025/2/license.txt3755c0cfdb77e29f2b9125d7a45dd316MD52open accessTHUMBNAIL2002_Amaya_Diaz_Javier.pdf.jpg2002_Amaya_Diaz_Javier.pdf.jpgIM Thumbnailimage/jpeg7349https://repository.unab.edu.co/bitstream/20.500.12749/27025/3/2002_Amaya_Diaz_Javier.pdf.jpg7986b199722a7e6408274756e8dd599eMD53open access20.500.12749/27025oai:repository.unab.edu.co:20.500.12749/270252024-10-21 22:03:32.764open accessRepositorio Institucional \| Universidad Autónoma de Bucaramanga - UNABrepositorio@unab.edu.coRUwoTE9TKSBBVVRPUihFUyksIG1hbmlmaWVzdGEobWFuaWZlc3RhbW9zKSBxdWUgbGEgb2JyYSBvYmpldG8gZGUgbGEgcHJlc2VudGUgYXV0b3JpemFjacOzbiBlcyBvcmlnaW5hbCB5IGxhIHJlYWxpesOzIHNpbiB2aW9sYXIgbyB1c3VycGFyIGRlcmVjaG9zIGRlIGF1dG9yIGRlIHRlcmNlcm9zLCBwb3IgbG8gdGFudG8sIGxhIG9icmEgZXMgZGUgZXhjbHVzaXZhIGF1dG9yw61hIHkgdGllbmUgbGEgdGl0dWxhcmlkYWQgc29icmUgbGEgbWlzbWEuCgpFbiBjYXNvIGRlIHByZXNlbnRhcnNlIGN1YWxxdWllciByZWNsYW1hY2nDs24gbyBhY2Npw7NuIHBvciBwYXJ0ZSBkZSB1biB0ZXJjZXJvIGVuIGN1YW50byBhIGxvcyBkZXJlY2hvcyBkZSBhdXRvciBzb2JyZSBsYSBvYnJhIGVuIGN1ZXN0acOzbi4gRWwgQVVUT1IgYXN1bWlyw6EgdG9kYSBsYSByZXNwb25zYWJpbGlkYWQsIHkgc2FsZHLDoSBlbiBkZWZlbnNhIGRlIGxvcyBkZXJlY2hvcyBhcXXDrSBhdXRvcml6YWRvcywgcGFyYSB0b2RvcyBsb3MgZWZlY3RvcyBsYSBVTkFCIGFjdMO6YSBjb21vIHVuIHRlcmNlcm8gZGUgYnVlbmEgZmUuCgpFbCBBVVRPUiBhdXRvcml6YSBhIGxhIFVuaXZlcnNpZGFkIEF1dMOzbm9tYSBkZSBCdWNhcmFtYW5nYSBwYXJhIHF1ZSBlbiBsb3MgdMOpcm1pbm9zIGVzdGFibGVjaWRvcyBlbiBsYSBMZXkgMjMgZGUgMTk4MiwgTGV5IDQ0IGRlIDE5OTMsIERlY2lzacOzbiBBbmRpbmEgMzUxIGRlIDE5OTMgeSBkZW3DoXMgbm9ybWFzIGdlbmVyYWxlcyBzb2JyZSBsYSBtYXRlcmlhLCB1dGlsaWNlIGxhIG9icmEgb2JqZXRvIGRlIGxhIHByZXNlbnRlIGF1dG9yaXphY2nDs24uCg==

Prototipo de sistema de filtrado por contenido para la diseminación de información contenida en la web

Publicaciones similares