Crossref journal-article
Wiley
Molecular Informatics (311)
Abstract

AbstractHere, the utility of Generative Topographic Maps (GTM) for data visualization, structure‐activity modeling and database comparison is evaluated, on hand of subsets of the Database of Useful Decoys (DUD). Unlike other popular dimensionality reduction approaches like Principal Component Analysis, Sammon Mapping or Self‐Organizing Maps, the great advantage of GTMs is providing data probability distribution functions (PDF), both in the high‐dimensional space defined by molecular descriptors and in 2D latent space. PDFs for the molecules of different activity classes were successfully used to build classification models in the framework of the Bayesian approach. Because PDFs are represented by a mixture of Gaussian functions, the Bhattacharyya kernel has been proposed as a measure of the overlap of datasets, which leads to an elegant method of global comparison of chemical libraries.

Bibliography

Kireeva, N., Baskin, I. I., Gaspar, H. A., Horvath, D., Marcou, G., & Varnek, A. (2012). Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure‐Activity Modeling and Dataset Comparison. Molecular Informatics, 31(3–4), 301–312. Portico.

Authors 6
  1. N. Kireeva (first)
  2. I. I. Baskin (additional)
  3. H. A. Gaspar (additional)
  4. D. Horvath (additional)
  5. G. Marcou (additional)
  6. A. Varnek (additional)
References 39 Referenced 113
  1. 10.1021/cc0000388
  2. 10.1002/minf.201000100
  3. 10.1007/978-0-387-39351-3
  4. {'key': 'e_1_2_7_4_2', 'volume-title': 'Principal Manifolds for Data Visualisation and Dimension Reduction', 'author': 'Gorban A. N.', 'year': '2007'} / Principal Manifolds for Data Visualisation and Dimension Reduction by Gorban A. N. (2007)
  5. 10.1070/RC2009v078n05ABEH004030
  6. 10.1016/j.drudis.2009.05.016
  7. {'key': 'e_1_2_7_7_2', 'volume-title': 'Pharmaceutical Data Mining. Approaches and Applications for Drug Discovery', 'author': 'Balakin K. V.', 'year': '2010'} / Pharmaceutical Data Mining. Approaches and Applications for Drug Discovery by Balakin K. V. (2010)
  8. {'key': 'e_1_2_7_8_2', 'volume-title': 'Principal Component Analysis', 'author': 'Jolliffe I. T.', 'year': '2002'} / Principal Component Analysis by Jolliffe I. T. (2002)
  9. 10.1007/BF02289565
  10. 10.1007/BF02289694
  11. 10.1007/978-3-642-56927-2
  12. 10.1109/T-C.1969.222678
  13. 10.1073/pnas.242424399
  14. 10.1002/jcc.10234
  15. 10.1016/S1093-3263(03)00155-4
  16. {'key': 'e_1_2_7_16_2', 'first-page': '833', 'volume-title': 'Advances in Neural Information Processing Systems,', 'author': 'Hinton G. E.', 'year': '2002'} / Advances in Neural Information Processing Systems, by Hinton G. E. (2002)
  17. 10.1002/ange.201105156
  18. 10.1021/ci050471a
  19. 10.1007/978-0-387-45528-0
  20. 10.1007/978-3-540-45080-1_49
  21. 10.1162/089976698300017953
  22. {'key': 'e_1_2_7_22_2', 'author': 'Bishop C. M.', 'year': '1997', 'journal-title': 'Tech. Report. Neural Comput. Res. Group.'} / Tech. Report. Neural Comput. Res. Group. by Bishop C. M. (1997)
  23. 10.1007/BF00201801
  24. J. F. M. Svensén PhD Thesis Aston University (UK)1998.
  25. 10.1016/S0925-2312(98)00043-5
  26. 10.1021/ci1004042
  27. 10.1007/978-3-540-45167-9_6
  28. {'key': 'e_1_2_7_28_2', 'first-page': '819', 'volume': '5', 'author': 'Jebara T.', 'year': '2004', 'journal-title': 'J. Mach. Learn. Res.'} / J. Mach. Learn. Res. by Jebara T. (2004)
  29. 10.1021/jm0608356
  30. 10.1017/CBO9780511790423
  31. 10.1111/j.2517-6161.1977.tb01600.x / J. Roy. Stat. Soc. B Met. by Dempster A. P. (1977)
  32. {'key': 'e_1_2_7_32_2', 'first-page': '340', 'volume': '41', 'author': 'Kullback S.', 'year': '1987', 'journal-title': 'Am. Statistician'} / Am. Statistician by Kullback S. (1987)
  33. 10.1021/ci049714
  34. Chemaxon Standardizer http://www.chemaxon.com/library/scientific‐presentations/standardizer/.
  35. Instant JChem www.chemaxon.com/products/instant‐jchem/.
  36. 10.1002/minf.201000099
  37. http://www1.aston.ac.uk/eas/research/groups/ncrg/resources/netlab/.
  38. {'key': 'e_1_2_7_38_2', 'volume-title': 'Algorithms for Pattern Recognition', 'author': 'Nabney I.', 'year': '2002'} / Algorithms for Pattern Recognition by Nabney I. (2002)
  39. 10.1145/1656274.1656278
Dates
Type When
Created 13 years, 4 months ago (April 4, 2012, 11:40 a.m.)
Deposited 1 year, 3 months ago (April 22, 2024, 3:56 p.m.)
Indexed 3 hours, 33 minutes ago (Aug. 21, 2025, 12:42 p.m.)
Issued 13 years, 4 months ago (April 1, 2012)
Published 13 years, 4 months ago (April 1, 2012)
Published Online 13 years, 4 months ago (April 4, 2012)
Published Print 13 years, 4 months ago (April 1, 2012)
Funders 0

None

@article{Kireeva_2012, title={Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure‐Activity Modeling and Dataset Comparison}, volume={31}, ISSN={1868-1751}, url={http://dx.doi.org/10.1002/minf.201100163}, DOI={10.1002/minf.201100163}, number={3–4}, journal={Molecular Informatics}, publisher={Wiley}, author={Kireeva, N. and Baskin, I. I. and Gaspar, H. A. and Horvath, D. and Marcou, G. and Varnek, A.}, year={2012}, month=apr, pages={301–312} }