Abstract
AbstractHere, the utility of Generative Topographic Maps (GTM) for data visualization, structure‐activity modeling and database comparison is evaluated, on hand of subsets of the Database of Useful Decoys (DUD). Unlike other popular dimensionality reduction approaches like Principal Component Analysis, Sammon Mapping or Self‐Organizing Maps, the great advantage of GTMs is providing data probability distribution functions (PDF), both in the high‐dimensional space defined by molecular descriptors and in 2D latent space. PDFs for the molecules of different activity classes were successfully used to build classification models in the framework of the Bayesian approach. Because PDFs are represented by a mixture of Gaussian functions, the Bhattacharyya kernel has been proposed as a measure of the overlap of datasets, which leads to an elegant method of global comparison of chemical libraries.
References
39
Referenced
113
10.1021/cc0000388
10.1002/minf.201000100
10.1007/978-0-387-39351-3
{'key': 'e_1_2_7_4_2', 'volume-title': 'Principal Manifolds for Data Visualisation and Dimension Reduction', 'author': 'Gorban A. N.', 'year': '2007'}
/ Principal Manifolds for Data Visualisation and Dimension Reduction by Gorban A. N. (2007)10.1070/RC2009v078n05ABEH004030
10.1016/j.drudis.2009.05.016
{'key': 'e_1_2_7_7_2', 'volume-title': 'Pharmaceutical Data Mining. Approaches and Applications for Drug Discovery', 'author': 'Balakin K. V.', 'year': '2010'}
/ Pharmaceutical Data Mining. Approaches and Applications for Drug Discovery by Balakin K. V. (2010){'key': 'e_1_2_7_8_2', 'volume-title': 'Principal Component Analysis', 'author': 'Jolliffe I. T.', 'year': '2002'}
/ Principal Component Analysis by Jolliffe I. T. (2002)10.1007/BF02289565
10.1007/BF02289694
10.1007/978-3-642-56927-2
10.1109/T-C.1969.222678
10.1073/pnas.242424399
10.1002/jcc.10234
10.1016/S1093-3263(03)00155-4
{'key': 'e_1_2_7_16_2', 'first-page': '833', 'volume-title': 'Advances in Neural Information Processing Systems,', 'author': 'Hinton G. E.', 'year': '2002'}
/ Advances in Neural Information Processing Systems, by Hinton G. E. (2002)10.1002/ange.201105156
10.1021/ci050471a
10.1007/978-0-387-45528-0
10.1007/978-3-540-45080-1_49
10.1162/089976698300017953
{'key': 'e_1_2_7_22_2', 'author': 'Bishop C. M.', 'year': '1997', 'journal-title': 'Tech. Report. Neural Comput. Res. Group.'}
/ Tech. Report. Neural Comput. Res. Group. by Bishop C. M. (1997)10.1007/BF00201801
- J. F. M. Svensén PhD Thesis Aston University (UK)1998.
10.1016/S0925-2312(98)00043-5
10.1021/ci1004042
10.1007/978-3-540-45167-9_6
{'key': 'e_1_2_7_28_2', 'first-page': '819', 'volume': '5', 'author': 'Jebara T.', 'year': '2004', 'journal-title': 'J. Mach. Learn. Res.'}
/ J. Mach. Learn. Res. by Jebara T. (2004)10.1021/jm0608356
10.1017/CBO9780511790423
10.1111/j.2517-6161.1977.tb01600.x
/ J. Roy. Stat. Soc. B Met. by Dempster A. P. (1977){'key': 'e_1_2_7_32_2', 'first-page': '340', 'volume': '41', 'author': 'Kullback S.', 'year': '1987', 'journal-title': 'Am. Statistician'}
/ Am. Statistician by Kullback S. (1987)10.1021/ci049714
- Chemaxon Standardizer http://www.chemaxon.com/library/scientific‐presentations/standardizer/.
- Instant JChem www.chemaxon.com/products/instant‐jchem/.
10.1002/minf.201000099
- http://www1.aston.ac.uk/eas/research/groups/ncrg/resources/netlab/.
{'key': 'e_1_2_7_38_2', 'volume-title': 'Algorithms for Pattern Recognition', 'author': 'Nabney I.', 'year': '2002'}
/ Algorithms for Pattern Recognition by Nabney I. (2002)10.1145/1656274.1656278
Dates
Type | When |
---|---|
Created | 13 years, 4 months ago (April 4, 2012, 11:40 a.m.) |
Deposited | 1 year, 3 months ago (April 22, 2024, 3:56 p.m.) |
Indexed | 3 hours, 33 minutes ago (Aug. 21, 2025, 12:42 p.m.) |
Issued | 13 years, 4 months ago (April 1, 2012) |
Published | 13 years, 4 months ago (April 1, 2012) |
Published Online | 13 years, 4 months ago (April 4, 2012) |
Published Print | 13 years, 4 months ago (April 1, 2012) |
@article{Kireeva_2012, title={Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure‐Activity Modeling and Dataset Comparison}, volume={31}, ISSN={1868-1751}, url={http://dx.doi.org/10.1002/minf.201100163}, DOI={10.1002/minf.201100163}, number={3–4}, journal={Molecular Informatics}, publisher={Wiley}, author={Kireeva, N. and Baskin, I. I. and Gaspar, H. A. and Horvath, D. and Marcou, G. and Varnek, A.}, year={2012}, month=apr, pages={301–312} }