Generalized molecular descriptors derived from event-based discrete derivative

In the present study, a generalized approach for molecular structure characterization is introduced, based on the relation frequency matrix (F) representation of the molecular graph and the subsequent calculation of the corresponding discrete derivative (finite difference) over a pair of elements (a...

Full description

Autores:
Tipo de recurso:
Fecha de publicación:
2016
Institución:
Universidad Tecnológica de Bolívar
Repositorio:
Repositorio Institucional UTB
Idioma:
eng
OAI Identifier:
oai:repositorio.utb.edu.co:20.500.12585/9005
Acceso en línea:
https://hdl.handle.net/20.500.12585/9005
Palabra clave:
Atom-based derivative index
Discrete derivative
DIVATI
Event criterion
Fingerprint
Frequency matrix
PCA
QSAR/QSPR
TOMOCOMD-CARDD
Carbon
Halogen
Furan derivative
Algorithm
Atom
Chemical structure
Computer interface
Controlled study
Factor analysis
Hydrophobicity
Molecular biology
Priority journal
Quantitative structure property relation
Review
Algorithm
Chemical model
Chemistry
Software
Algorithms
Furans
Models, Chemical
Software
Rights
restrictedAccess
License
http://creativecommons.org/licenses/by-nc-nd/4.0/
id UTB2_cafbb83294716320365c343c9494e7c4
oai_identifier_str oai:repositorio.utb.edu.co:20.500.12585/9005
network_acronym_str UTB2
network_name_str Repositorio Institucional UTB
repository_id_str
dc.title.none.fl_str_mv Generalized molecular descriptors derived from event-based discrete derivative
title Generalized molecular descriptors derived from event-based discrete derivative
spellingShingle Generalized molecular descriptors derived from event-based discrete derivative
Atom-based derivative index
Discrete derivative
DIVATI
Event criterion
Fingerprint
Frequency matrix
PCA
QSAR/QSPR
TOMOCOMD-CARDD
Carbon
Halogen
Furan derivative
Algorithm
Atom
Chemical structure
Computer interface
Controlled study
Factor analysis
Hydrophobicity
Molecular biology
Priority journal
Quantitative structure property relation
Review
Algorithm
Chemical model
Chemistry
Software
Algorithms
Furans
Models, Chemical
Software
title_short Generalized molecular descriptors derived from event-based discrete derivative
title_full Generalized molecular descriptors derived from event-based discrete derivative
title_fullStr Generalized molecular descriptors derived from event-based discrete derivative
title_full_unstemmed Generalized molecular descriptors derived from event-based discrete derivative
title_sort Generalized molecular descriptors derived from event-based discrete derivative
dc.subject.keywords.none.fl_str_mv Atom-based derivative index
Discrete derivative
DIVATI
Event criterion
Fingerprint
Frequency matrix
PCA
QSAR/QSPR
TOMOCOMD-CARDD
Carbon
Halogen
Furan derivative
Algorithm
Atom
Chemical structure
Computer interface
Controlled study
Factor analysis
Hydrophobicity
Molecular biology
Priority journal
Quantitative structure property relation
Review
Algorithm
Chemical model
Chemistry
Software
Algorithms
Furans
Models, Chemical
Software
topic Atom-based derivative index
Discrete derivative
DIVATI
Event criterion
Fingerprint
Frequency matrix
PCA
QSAR/QSPR
TOMOCOMD-CARDD
Carbon
Halogen
Furan derivative
Algorithm
Atom
Chemical structure
Computer interface
Controlled study
Factor analysis
Hydrophobicity
Molecular biology
Priority journal
Quantitative structure property relation
Review
Algorithm
Chemical model
Chemistry
Software
Algorithms
Furans
Models, Chemical
Software
description In the present study, a generalized approach for molecular structure characterization is introduced, based on the relation frequency matrix (F) representation of the molecular graph and the subsequent calculation of the corresponding discrete derivative (finite difference) over a pair of elements (atoms). In earlier publications (22-24), an unique event, named connected subgraphs, (based on the Kier-Hall’s subgraphs) was systematically employed for the computation of the matrix F. The present report is a generalization of this notion, in which eleven additional events are introduced, classified in three categories, namely, topological (terminal paths, vertex path incidence, quantum subgraphs, walks of length k, Sach’s subgraphs), fingerprints (MACCs, E-state and substructure fingerprints) and atomic contributions (Ghose and Crippen atom-types for hydrophobicity and refractivity) for F generation. The events are intended to capture diverse information by the generation or search of different kinds of substructures from the graph representation of a molecule. The discrete derivative over duplex atom relations are calculated for each event, and the resulting derivatives, local vertex invariants (LOVIs) are finally obtained. These LOVIs are subsequently employed as the basis for the calculation of global and local indices over groups of atoms (heteroatoms, halogens, methyl carbons, etc.), by using norms, means, statistics and classical algorithms as aggregator (fusion) operators. These indices were implemented in our house software DIVATI (Derivative Type Indices, a new module of TOMOCOMDCARDD system). DIVATI provides a friendly and cross-platform graphical user interface, developed in the Java programming language and is freely available at: http: //www.tomocomd.com. Factor analysis shows that the presented events are rather orthogonal and collect diverse information about the chemical structure. Finally, QSPR models were built to describe the logP and logK of 34 furylethylenes derivatives using the eleven events. Generally, the equations obtained according to these events showed high correlations, with the Sach’s sub-graphs and Multiplicity events showing the best behavior in the description of logK (Q2LOO value of 99.06%) and logP (Q2LOO value of 98.1%), respectively. These results show that these new eventbased indices constitute a powerful approach for chemoinformatics studies. © 2016 Bentham Science Publishers.
publishDate 2016
dc.date.issued.none.fl_str_mv 2016
dc.date.accessioned.none.fl_str_mv 2020-03-26T16:32:45Z
dc.date.available.none.fl_str_mv 2020-03-26T16:32:45Z
dc.type.coarversion.fl_str_mv http://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.coar.fl_str_mv http://purl.org/coar/resource_type/c_efa0
dc.type.driver.none.fl_str_mv info:eu-repo/semantics/review
dc.type.hasversion.none.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.spa.none.fl_str_mv Artículo de revisión
status_str publishedVersion
dc.identifier.citation.none.fl_str_mv Current Pharmaceutical Design; Vol. 22, Núm. 33; pp. 5095-5113
dc.identifier.issn.none.fl_str_mv 13816128
dc.identifier.uri.none.fl_str_mv https://hdl.handle.net/20.500.12585/9005
dc.identifier.doi.none.fl_str_mv 10.2174/1381612822666160610114148
dc.identifier.instname.none.fl_str_mv Universidad Tecnológica de Bolívar
dc.identifier.reponame.none.fl_str_mv Repositorio UTB
dc.identifier.orcid.none.fl_str_mv 24824951000
56506611500
55665599200
55363486500
36454896800
16025887100
14040865600
57192014301
7801534311
57192013138
57192004534
7004872108
identifier_str_mv Current Pharmaceutical Design; Vol. 22, Núm. 33; pp. 5095-5113
13816128
10.2174/1381612822666160610114148
Universidad Tecnológica de Bolívar
Repositorio UTB
24824951000
56506611500
55665599200
55363486500
36454896800
16025887100
14040865600
57192014301
7801534311
57192013138
57192004534
7004872108
url https://hdl.handle.net/20.500.12585/9005
dc.language.iso.none.fl_str_mv eng
language eng
dc.rights.coar.fl_str_mv http://purl.org/coar/access_right/c_16ec
dc.rights.uri.none.fl_str_mv http://creativecommons.org/licenses/by-nc-nd/4.0/
dc.rights.accessrights.none.fl_str_mv info:eu-repo/semantics/restrictedAccess
dc.rights.cc.none.fl_str_mv Atribución-NoComercial 4.0 Internacional
rights_invalid_str_mv http://creativecommons.org/licenses/by-nc-nd/4.0/
Atribución-NoComercial 4.0 Internacional
http://purl.org/coar/access_right/c_16ec
eu_rights_str_mv restrictedAccess
dc.format.medium.none.fl_str_mv Recurso electrónico
dc.format.mimetype.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Bentham Science Publishers B.V.
publisher.none.fl_str_mv Bentham Science Publishers B.V.
dc.source.none.fl_str_mv https://www.scopus.com/inward/record.uri?eid=2-s2.0-84995975328&doi=10.2174%2f1381612822666160610114148&partnerID=40&md5=32f7bc1daa4fe35762dd2b97589470b4
institution Universidad Tecnológica de Bolívar
bitstream.url.fl_str_mv https://repositorio.utb.edu.co/bitstream/20.500.12585/9005/1/MiniProdInv.png
bitstream.checksum.fl_str_mv 0cb0f101a8d16897fb46fc914d3d7043
bitstream.checksumAlgorithm.fl_str_mv MD5
repository.name.fl_str_mv Repositorio Institucional UTB
repository.mail.fl_str_mv repositorioutb@utb.edu.co
_version_ 1814021557070069760
spelling 2020-03-26T16:32:45Z2020-03-26T16:32:45Z2016Current Pharmaceutical Design; Vol. 22, Núm. 33; pp. 5095-511313816128https://hdl.handle.net/20.500.12585/900510.2174/1381612822666160610114148Universidad Tecnológica de BolívarRepositorio UTB2482495100056506611500556655992005536348650036454896800160258871001404086560057192014301780153431157192013138571920045347004872108In the present study, a generalized approach for molecular structure characterization is introduced, based on the relation frequency matrix (F) representation of the molecular graph and the subsequent calculation of the corresponding discrete derivative (finite difference) over a pair of elements (atoms). In earlier publications (22-24), an unique event, named connected subgraphs, (based on the Kier-Hall’s subgraphs) was systematically employed for the computation of the matrix F. The present report is a generalization of this notion, in which eleven additional events are introduced, classified in three categories, namely, topological (terminal paths, vertex path incidence, quantum subgraphs, walks of length k, Sach’s subgraphs), fingerprints (MACCs, E-state and substructure fingerprints) and atomic contributions (Ghose and Crippen atom-types for hydrophobicity and refractivity) for F generation. The events are intended to capture diverse information by the generation or search of different kinds of substructures from the graph representation of a molecule. The discrete derivative over duplex atom relations are calculated for each event, and the resulting derivatives, local vertex invariants (LOVIs) are finally obtained. These LOVIs are subsequently employed as the basis for the calculation of global and local indices over groups of atoms (heteroatoms, halogens, methyl carbons, etc.), by using norms, means, statistics and classical algorithms as aggregator (fusion) operators. These indices were implemented in our house software DIVATI (Derivative Type Indices, a new module of TOMOCOMDCARDD system). DIVATI provides a friendly and cross-platform graphical user interface, developed in the Java programming language and is freely available at: http: //www.tomocomd.com. Factor analysis shows that the presented events are rather orthogonal and collect diverse information about the chemical structure. Finally, QSPR models were built to describe the logP and logK of 34 furylethylenes derivatives using the eleven events. Generally, the equations obtained according to these events showed high correlations, with the Sach’s sub-graphs and Multiplicity events showing the best behavior in the description of logK (Q2LOO value of 99.06%) and logP (Q2LOO value of 98.1%), respectively. These results show that these new eventbased indices constitute a powerful approach for chemoinformatics studies. © 2016 Bentham Science Publishers.carbon, 7440-44-0; FuransRecurso electrónicoapplication/pdfengBentham Science Publishers B.V.http://creativecommons.org/licenses/by-nc-nd/4.0/info:eu-repo/semantics/restrictedAccessAtribución-NoComercial 4.0 Internacionalhttp://purl.org/coar/access_right/c_16echttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84995975328&doi=10.2174%2f1381612822666160610114148&partnerID=40&md5=32f7bc1daa4fe35762dd2b97589470b4Generalized molecular descriptors derived from event-based discrete derivativeinfo:eu-repo/semantics/reviewinfo:eu-repo/semantics/publishedVersionArtículo de revisiónhttp://purl.org/coar/version/c_970fb48d4fbd8a85http://purl.org/coar/resource_type/c_efa0Atom-based derivative indexDiscrete derivativeDIVATIEvent criterionFingerprintFrequency matrixPCAQSAR/QSPRTOMOCOMD-CARDDCarbonHalogenFuran derivativeAlgorithmAtomChemical structureComputer interfaceControlled studyFactor analysisHydrophobicityMolecular biologyPriority journalQuantitative structure property relationReviewAlgorithmChemical modelChemistrySoftwareAlgorithmsFuransModels, ChemicalSoftwareMartínez-Santiago O.Cabrera R.M.Marrero-Ponce Y.Barigye S.J.Le-Thi-Thu H.Torres, JavierZambrano C.H.Yaber Goenaga, IvánCruz-Monteagudo, M.López Y.M.Giménez F.P.Torrens, F.Todeschini, R., (2009) Consonni, 5. , Molecular Descriptors for Chemoinformatics. USA: Wiley-VCHCrum-Brown, A., On an application of mathematics to chemistry (1867) Proc Royal Soc, VI (73), pp. 89-90Kier, L.B., Hall, L.H., Molecular Structure Description (1999) The Electrotopological State, , San Diego: Academic PressCrum-Brown, A., Fraser, T.R., On the connection between chemical constitution and physiological action. Part 1. On the physiological action of salts of the ammonium bases, derived from strychnia, brucia, thebia, codeia, orphia and nicotia (1868) Trans Roy Soc, 25, pp. 151-203González-Díaz, H.-P.Y., Hernández, I., Bastida, I., 3DMEDNEs: An alternative “In Silico” technique for chemical research in toxicology. Prediction of chemically induced agranulocytosis (2003) Chem Res Toxicol, 16 (10), pp. 1318-1327Marrero-Ponce, Y.-D.H., Romero-Zaldivar, V., Torrens, F., Castro, E.A., 3D-chiral quadratic indices of the “molecular pseudograph’s atom adjacency matrix” and their application to central chirality codification: Classification of ACE inhibitors and prediction of σ-receptor antagonist activities (2004) Bioorg Med Chem, 12, pp. 5331-5342García-Jacas, C.R., Contreras-Torres, E., Marrero-Ponce, Y., Pupo-Meriño, M., Barigye, S.J., Cabrera-Leyva, L., Examining the predictive accuracy of the novel 3D N-linear algebraic molecular codifications on benchmark datasets (2016) J Cheminf, 8 (1), pp. 1-16Marrero-Ponce, Y., Optimum search strategies or novel 3D molecular descriptors: Is there a stalemate? (2015) Curr Bioinf, p. 10Fernández, N.R., Castelán, M.M., Pozos, S.G., A new type of quinoxalinone derivatives affects viability, invasion, and intracellular growth of Toxoplasma gondii tachyzoites in vitro (2016) Parasitol Res, pp. 1-16Marrero-Ponce, Y., R Garcia-Jacas, C., J Barigye, S., Optimum search strategies or novel 3D molecular descriptors: Is there a stalemate? (2015) Curr Bioinf, 10 (5), pp. 533-564González-Díaz, H., González-Díaz, Y., Santana, L., Ubeira, F.M., Uriarte, E., Proteomics, networks and connectivity indices (2008) Proteomics, 8 (4), pp. 750-778Puzyn, T., Leszczynski, J., Cronin, M., (2009) Recent Advances in QSAR Studies. Methods and Applications, , USA: SpringerGonzalez-Diaz, H., Vilar, S., Santana, L., Uriarte, E., Medicinal chemistry and bioinformatics-current trends in drugs discovery with networks topological indices (2007) Curr Top Med Chem, 7 (10), pp. 1015-1029Ponce, Y.M., Perez, M.C., Zaldivar, V.R., Diaz, H.G., Torrens, F., A new topological descriptors based model for predicting intestinal epithelial transport of drugs in Caco-2 cell culture (2004) J Pharm Pharm Sci, 7, pp. 186-199Vega, M.C., Montero-Torres, A., Marrero-Ponce, Y., New ligandbased approach for the discovery of antitrypanosomal compounds (2006) Bioorganic Med Chem Lett, 16 (7), pp. 1898-1904Marrero-Ponce, Y., Khan, M., Casañola-Martín, G.M., Bondbased 2D TOMOCOMD-CARDD approach for drug discovery: Aiding decision-making in ‘in silico’selection of new lead tyrosinase inhibitors (2007) J Computer-Aided Mol Des, 21 (4), pp. 167-188Marrero, Y., Torrens, F., García, R., Ortega, S.E., Romero, V., Novel 2D TOMOCOMD-CARDD molecular descriptors: Atom-based stochastic and non-stochastic bilinear indices and their QSPR applications (2008) J Math Chem, 44, pp. 650-673Estrada, E., Molina, E., Novel local (Fragment-based) topological molecular descriptors for QSPR/QSAR and molecular design (2001) J Mol Graph Model, 20, pp. 54-64Estrada, E., Uriarte, E., Recent advances on the role of topological indices in drug discovery research (2001) Curr Med Chem, 8, pp. 1573-1588Devillers, J., Balaban, A., (1999) Topological Indices and Related Descriptors in QSAR and QSPR, , Amsterdam: Gordon and BreachMarrero-Ponce, Y., Linear indices of the “molecular pseudograph’s atom adjacency matrix”: Definition, significance-interpretation, and application to QSAR analysis of flavone derivatives as HIV-1 integrase inhibitors (2004) J Chem Inf Comput Sci, 44, pp. 2010-2026Martínez-Santiago, O., Marrero-Ponce, Y., Millán Cabrera, R., Extending graph (Discrete) derivative descriptors to N-tuple atomrelations (2015) MATCH: Commun Math Comput Chem, 73, pp. 397-420Martínez-Santiago, O., Millán-Cabrera, R., Marrero-Ponce, Y., Discrete derivatives for atom-pairs as a novel Graph-theoretical invariant for generating new molecular descriptors: Orthogonality, interpretation and QSARs/QSPRs on benchmark databases (2014) Mol Informatics, 33, pp. 343-368Marrero Ponce, Y., Martínez-Santiago, O., Martínez López, Y., Barigye, S.J., Torrens, F., Derivatives in discrete mathematics: A novel graph-theoretical invariant for generating new 2/3D molecular descriptors. I Theory and QSPR application (2012) J Comput Aided Mol Des, 26, pp. 1229-1246Gorbátov, V.A., Fundamentos de la Matematica Discreta. Moscú (1988) URSS: MirBarigye, S.J., Marrero-Ponce, Y., Martínez Santiago, O., Martínez López, Y., Torrens, F., Shannon’s, Mutual, conditional and joint entropy-based information indices (2013) Generalization of Global Indices Defined from Local Vertex Invariants. Curr Comput-Aided Drug Des, 9, pp. 164-183Marrero-Ponce, Y., Martínez López, Y., Martínez-Santiago, O., Barigye Jones, S., TOMOCOMD (TOpological MOlecular COMputer Design) (2010) Unit of Computer-Aided Molecular “Biosilico” Discovery and Bioinformatic Research (CAMD-BIR Unit)Martínez-Santiago, O., Marrero-Ponce, Y., Martínez-López, Y., Barigye, J.S., (2011) DIVATI (Discrete Derivative Type Indices) Software, Version 1.0 Santa Clara, , Villa Clara, Cuba: Unit of Computer-Aided Molecular “Bio-silico” Discovery and Bioinformatic Research (CAMD-BIR UnitJantschi, L., Graph Theory. 1. Fragmentation of structural graphs (2002) Leonardo Electron J Pract Technol, 1, pp. 19-36Janezic, A.M., Nikolic, S., Trinajstic, N., (2007) Graph Theoretical Matrices in Chemistry. University of Kragujevac, Faculty of Science, , Kragujevac, SerbiaSkvortsova, M.I., Stankevich, I.V., Eigenvectors of weighted graphs: A supplement to Sachs’ theorem (2005) J Mol Struct, 719, pp. 213-223Sachs, H., Beziehungen zwischen den in einem graphen enthaltenen Kreisen und seinem characteristischen Polynom (1964) Publ Math, 11, pp. 119-134Marrero-Ponce, Y., Castillo-Garit, J.A., Torrens, F., Romero-Zaldivar, V., Castro, E., Atom, atom-type, and total linear indices of the “molecular pseudogragh´s atom adjacency matrix”: Application to QSPR/QSAR studies of organic compounds (2004) Molecules, 9, pp. 1100-1123Jun-Ichi, A., General rules for constructing huckel molecular orbital characteristic polynomials (1975) J am Chem Soc, 98, pp. 6840-6844Gutman, I., Impact of the Sachs Theorem on Theoretical Chemistry (2003) A Participant’s Testimony. MATCH Commun Math Comput Chem, 48, pp. 17-34Durant, J.L., Leland, B.A., Henry, D.R., Nourse, J.G., Reoptimization of MDL keys for use in drug discovery (2002) J Chem Inf Comput Sci, 42, pp. 1273-1280Kier, L.B., Hall, L.H., An electrotopological-state index for atoms in molecules (1990) Pharm Res, 7, pp. 801-807Kier, L.B., Electrotopological state indices for atom types: A novel combination of electronic, topological, and valence state information (1995) J Chem Inf Comput Sci, 35, pp. 1039-1045Kier, L.B., Hall, L.H., (1999) The Electrotopological State: Structure Modeling for QSAR and Data Base Analysis. In: Topological Indices and Related Descriptors in QSAR and QSPR, pp. 491-562. , Amsterdam: Gordon and Breach Sci PubGhose, A.K., Atomic physicochemical parameters for three-dimensional-structure-directed quantitative structure-activity relationships. 2. Modeling Dispersive and Hydrophobic Interactions (1987) J Chem Inf Comput Sci, 27, pp. 21-35Diudea, M.V., (2001) QSPR/QSAR Studies by Molecular Descriptors, , Huntington, N.Y: Nova ScienceMoldovan, C.D., Diudea, M.V., Costescu, A., Katona, G., Applicationt o QSAR studies of 2-furylethylene derivatives (2008) J Math ChemEstrada, E., (2001) 3D Connectivity Indices in QSPR/QSAR Studies. J Chem Inf Comput Sci, 41, pp. 791-797(2004) Mobydigs, , 1.0 ed. Milano, ItalyTulsa, I., STATISTICA. Version. 8.0 Ed: StatsoftMarrero-Ponce, Y., Martinez-Albelo, E.R., Casanola-Martin, G.M., Bond-based linear indices of the non-stochasitic and stochastic edge-adjacency matrix. 1. Theory and modeling of chemphys properties of organic molecules (2009) Mol Divers, 14, pp. 731-753Basilevsky, A., (1994) Statistical Factor Analysis and Related MethodsFrank, I.E., Friedman, J.H., (1993) Technometrics, 35, pp. 109-135http://purl.org/coar/resource_type/c_dcae04bcTHUMBNAILMiniProdInv.pngMiniProdInv.pngimage/png23941https://repositorio.utb.edu.co/bitstream/20.500.12585/9005/1/MiniProdInv.png0cb0f101a8d16897fb46fc914d3d7043MD5120.500.12585/9005oai:repositorio.utb.edu.co:20.500.12585/90052023-05-26 13:49:18.217Repositorio Institucional UTBrepositorioutb@utb.edu.co