10.1146/annurev-matsci-070214-020844
Crossref journal-article
Annual Reviews
Annual Review of Materials Research (22)
Abstract

The field of materials science and engineering is on the cusp of a digital data revolution. After reviewing the nature of data science and Big Data, we discuss the features of materials data that distinguish them from data in other fields. We introduce the concept of process-structure-property (PSP) linkages and illustrate how the determination of PSPs is one of the main objectives of materials data science. Then we review a selection of materials databases, as well as important aspects of materials data management, such as storage hardware, archiving strategies, and data access strategies. We introduce the emerging field of materials data analytics, which focuses on data-driven approaches to extract and curate materials knowledge from available data sets. The critical need for materials e-collaboration platforms is highlighted, and we conclude the article with a number of suggestions regarding the near-term future of the materials data science field.

Bibliography

Kalidindi, S. R., & De Graef, M. (2015). Materials Data Science: Current Status and Future Outlook. Annual Review of Materials Research, 45(1), 171–193.

Authors 2
  1. Surya R. Kalidindi (first)
  2. Marc De Graef (additional)
References 148 Referenced 217
  1. {'key': 'B1', 'first-page': '21', 'volume': '69', 'author': 'Cleveland WS', 'year': '2001', 'journal-title': 'ISI Rev.'} / ISI Rev. by Cleveland WS (2001)
  2. 10.1145/2500499
  3. {'volume-title': 'The Fourth Paradigm: Data-Intensive Scientific Discovery.', 'year': '2009', 'author': 'Hey T', 'key': 'B3'} / The Fourth Paradigm: Data-Intensive Scientific Discovery. by Hey T (2009)
  4. {'key': 'B4', 'first-page': '16.07', 'volume': '16', 'author': 'Anderson C', 'year': '2008', 'journal-title': 'Wired Mag.'} / Wired Mag. by Anderson C (2008)
  5. 10.1109/MIC.2003.1167344
  6. 6. Li I, Dey A, Forlizzi J. 2010. A stage-based model of personal informatics systems. InProc. SIGCHI Conference on Human Factors in Computing Systems, pp. 557–66. New York: ACM (10.1145/1753326.1753409)
  7. 10.1016/j.drudis.2008.11.015
  8. 10.1109/TSMCC.2003.809345
  9. 10.1007/s10916-006-7397-9
  10. 10. Dropbox, Inc. 2014.http://www.dropbox.com
  11. 11. GitHub. 2014.https://github.com
  12. 12. HUBzero. 2014.https://hubzero.org
  13. 13. nanoHUB. 2014.https://nanohub.org
  14. 14. National Science Board. 2005.Long-lived digital data collections: enabling research and education in the 21st century. Rep. NSB-05-40, National Science Board.http://www.nsf.gov/pubs/2005/nsb0540
  15. 10.1080/14685248.2012.674643
  16. 16. CERN Open Data Portal. 2014.http://opendata.cern.ch
  17. 17. Hubble Legacy Archive. 2014.http://hla.stsci.edu
  18. 18. CERN. 2014.http://home.web.cern.ch/about
  19. 19. Fermi National Accelerator Laboratory. 2014.http://www.fnal.gov
  20. 20. Relativistic Heavy Ion Collider. 2014.http://www.bnl.gov/rhic
  21. 21. Fuhrmann P. 2014.dCache, the overview. White Pap., dCache.http://www.dcache.org/manuals/dcache-whitepaper-light.pdf
  22. 22. dCache. 2014.http://www.dcache.org
  23. 23. DESY (Deutsches Elektronen-Synchrotron). 2014.http://www.desy.de
  24. 24. CERN: The Worldwide LHC Computing Grid. 2014.http://home.web.cern.ch/about/computing/worldwide-lhc-computing-grid
  25. 25. National Center for Biotechnology Information. 2014.http://www.ncbi.nlm.nih.gov
  26. 26. McDonald E, Brown C. 2014.Working with Big Data in bioinformatics.http://www.aosabook.org/en/posa/working-with-big-data-in-bioinformatics.html
  27. 27. NOAA (National Oceanic and Atmospheric Administration). 2014.http://www.nesdis.noaa.gov
  28. 28. NOAA View Data Exploration Tool. 2014.http://www.nnvl.noaa.gov/view
  29. 29. NOAA: National Operational Model Archive and Distribution System. 2014.http://nomads.ncdc.noaa.gov/data.php
  30. 30. GrADS Data Server. 2014.http://grads.iges.org/grads/gds/index.html
  31. 31. OPeNDAP. 2014.http://opendap.org
  32. 32. Earth Observing System Data and Information System. 2014.https://earthdata.nasa.gov/about-eosdis
  33. 33. National Snow and Ice Data Center. 2014.http://nsidc.org/daac/data-sets.html
  34. 10.1557/mrs.2013.187
  35. {'volume-title': 'Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security', 'year': '2008', 'author': 'Committee on Integrated Computational Materials Engineering, National Research Council', 'key': 'B35'} / Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security by Committee on Integrated Computational Materials Engineering, National Research Council (2008)
  36. 36. National Science and Technology Council, Executive Office of the President. 2011.Materials genome initiative for global competitiveness.http://www.whitehouse.gov/sites/default/files/microsites/ostp/materials_genome_initiative-final.pdf
  37. 10.1002/9783527641864
  38. 10.1179/1743280414Y.0000000043
  39. 39. McNulty E. 2014.Understanding Big Data: the seven V's.http://dataconomy.com/seven-vs-big-data/
  40. 10.1007/s11837-011-0116-0
  41. 41. Advanced Photon Source, Argonne National Laboratory. 2014.https://www1.aps.anl.gov
  42. 10.1557/mrs.2013.234
  43. 10.1002/9780470172551.ch17
  44. 44. Citrine Informatics. 2014.http://www.citrination.com
  45. 45. Clean Energy Project. 2014.http://cleanenergy.molecularspace.org
  46. 46. The Materials Project. 2014.http://www.materialsproject.org
  47. 47. Automatic-FLOW for Materials Discovery. 2014.http://www.aflowlib.org
  48. 48. CALPHAD (Computer Coupling of Phase Diagrams and Thermochemistry). 2014.http://www.calphad.org
  49. 49. Open Quantum Materials Database. 2014.http://oqmd.org
  50. 50. NIST (National Institute of Standards and Technology) Data Gateway. 2014.http://srdata.nist.gov/gateway/gateway?dblist=1
  51. 51. NIST Material Measurement Laboratory. 2014.http://www.ctcms.nist.gov/potentials/
  52. 52. MatWeb. 2014.http://www.matweb.com/
  53. 53. Granta. 2014.http://www.grantadesign.com/products/ces/
  54. 54. MatNavi (NIMS Materials Database). 2014.http://mits.nims.go.jp/index_en.html
  55. {'key': 'B55', 'first-page': '28', 'volume': '90', 'author': 'Freiman S', 'year': '2011', 'journal-title': 'Am. Ceram. Soc. Bull.'} / Am. Ceram. Soc. Bull. by Freiman S (2011)
  56. {'key': 'B56', 'first-page': '159', 'volume': '1', 'author': 'Kaufman J', 'year': '1986', 'journal-title': 'Mater. Prop. Data'} / Mater. Prop. Data by Kaufman J (1986)
  57. {'volume-title': 'Microstructure Sensitive Design for Performance Optimization', 'year': '2012', 'author': 'Adams BL', 'key': 'B57'} / Microstructure Sensitive Design for Performance Optimization by Adams BL (2012)
  58. {'volume-title': 'The Theory of Composites', 'year': '2001', 'author': 'Milton GW', 'key': 'B58'} / The Theory of Composites by Milton GW (2001)
  59. 10.1007/978-1-4757-6355-3
  60. 10.1016/j.cad.2012.06.006
  61. 61. DREAM.3D. 2014.http://dream3d.bluequartz.net
  62. 62. Materials Atlas. 2014.https://cosmicweb.mse.iastate.edu/wiki/display/home/materials+atlas+home
  63. 63. Computational Materials Data Network. 2014.http://www.asminternational.org/web/cmdnetwork/about
  64. 10.1186/2193-9772-3-5
  65. 65. Material Data Management Consortium. 2014.http://www.mdmc.net
  66. 66. TMS (The Minerals, Metals and Materials Society). 2014.http://www.tms.org/
  67. 67. The Materials Cyberinfrastructure Portal. 2014.http://www.tms.org/cyberportal/
  68. 68. Patterson DA, Gibson G, Katz RH. 1988. A case for redundant arrays of inexpensive disks (RAID). InProc. 1988 ACM SIGMOD International Conference on Management of Data, pp. 109–16. Chicago: ACM (10.1145/971701.50214)
  69. 69. Schroeder B, Gibson G. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?InProc. 5th USENIX Conference on File and Storage Technologies(FAST'07). San Jose, CA: USENIX
  70. 70. Ghemawat S, Gobioff H, Leung ST. 2003. The Google File System. InProc. 19th ACM Symposium on Operating System Principles.Bolton Landing, NY: ACM (10.1145/945445.945450)
  71. 71. Healey CG. 2014.CSC541: advanced data structures. Course Notes, Dep. Comput. Sci., NC State Univ.http://www.csc.ncsu.edu/faculty/healey/csc541/notes/file_sys.pdf
  72. 72. Rodeh O, Teperman A. 2003. zFS—a scalable distributed file system using object disks. InProc. 20th IEEE Conference on Mass Storage Systems and Technology, pp. 207–18. San Diego, CA: IEEE (10.1109/MASS.2003.1194858)
  73. 73. Shvachko K, Hairong K, Radia S, Chansler R. 2010. The Hadoop distributed file system. InProc. 26th IEEE Symposium on Mass Storage Systems and Technologies, pp. 1–10. Incline Village, NY: IEEE (10.1109/MSST.2010.5496972)
  74. 10.1186/2193-9772-3-4
  75. 10.1088/0965-0393/18/6/065008
  76. 76. The HDF Group. 2014.http://www.hdfgroup.org/
  77. 77. NetCDF. 2014.http://www.unidata.ucar.edu/software/netcdf/index.html
  78. 78. PDB (Protein Data Bank). 2014.http://www.pdb.org/pdb/home/home.do
  79. 79. FITS Support Office (NASA/Goddard Space Flight Center). 2014.http://fits.gsfc.nasa.gov/
  80. 80. DICOM. 2014.http://medical.nema.org/
  81. 81. National Digital Information Infrastructure and Preservation Program. 2014.http://www.digitalpreservation.gov
  82. 82. Community Owned Digital Preservation Tool Registry. 2014.http://coptr.digipres.org/main_page
  83. 83. ISO (International Organization for Standardization) 26234:2012. 2014.http://www.iso.org/iso/catalogue_detail.htm?csnumber=43506
  84. 84. DOI. 2014.http://www.doi.org/
  85. 85. Handle.Net. 2014.http://www.handle.net/index.html
  86. 86. Corporation for National Research Initiatives. 2014.http://www.cnri.reston.va.us
  87. 87. Globus Online GridFTP. 2014.http://toolkit.globus.org/toolkit/docs/latest-stable/gridftp/
  88. 88. Internet Engineering Task Force. 2014.https://www.ietf.org
  89. 89. Globus Online. 2014.https://www.globus.org
  90. 90. Kumar S, Edwards J, Bremer PT, Knoll A, Christensen C, et al. 2014. Efficient I/O and storage of adaptive-resolution data. InProc. International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 413–23. Piscataway, NJ: IEEE (10.1109/SC.2014.39)
  91. 10.1080/14685240802376389
  92. 92. ASTM Int. 2013.Standard test methods for determining average grain size. ASTM E112, ASTM Int.
  93. 93. ASTM Int. 2008.Standard test methods for characterizing duplex grain sizes. ASTM E1181, ASTM Int.
  94. {'volume-title': 'The Image Processing Handbook', 'year': '1992', 'author': 'Russ J', 'key': 'B94'} / The Image Processing Handbook by Russ J (1992)
  95. 10.1088/0370-1301/64/9/303
  96. {'key': 'B96', 'first-page': '25', 'volume': '174', 'author': 'Petch N', 'year': '1953', 'journal-title': 'Iron Steel Inst. J.'} / Iron Steel Inst. J. by Petch N (1953)
  97. {'volume-title': 'Strengthening Mechanisms in Crystal Plasticity', 'year': '2008', 'author': 'Argon A', 'key': 'B97'} / Strengthening Mechanisms in Crystal Plasticity by Argon A (2008)
  98. {'key': 'B98', 'volume-title': 'Physical Metallurgy Principles', 'author': 'Reed-Hill R', 'year': '1994', 'edition': '3'} / Physical Metallurgy Principles by Reed-Hill R (1994)
  99. 10.1016/S1369-7021(05)71123-8
  100. 10.1016/S1367-5931(00)00091-0
  101. 10.1103/PhysRevLett.91.135503
  102. 10.1126/science.280.5366.1099
  103. 10.1002/adfm.201301744
  104. 10.1557/mrs2006.229
  105. 10.1007/s10851-005-3616-0
  106. 10.1109/TIP.2013.2284071
  107. 10.1007/s11837-011-0113-3
  108. 108. EM/MPM Workbench. 2014.http://www.bluequartz.net/?page_id=97
  109. 10.1088/0965-0393/17/2/025002
  110. 10.1016/j.actamat.2007.09.039
  111. 10.1088/0965-0393/16/4/045008
  112. 10.1063/1.1830512
  113. 10.1038/nature05745
  114. {'volume-title': 'Statistical Analysis of Microstructures in Materials Science', 'year': '2000', 'author': 'Ohser J', 'key': 'B114'} / Statistical Analysis of Microstructures in Materials Science by Ohser J (2000)
  115. 10.1016/j.actamat.2005.03.052
  116. 10.1016/j.actamat.2008.07.005
  117. 10.1016/j.actamat.2007.10.044
  118. 10.1016/j.pmatsci.2009.08.002
  119. {'key': 'B119', 'first-page': '79', 'volume': '14', 'author': 'Niezgoda SR', 'year': '2009', 'journal-title': 'Comput. Mater. Contin.'} / Comput. Mater. Contin. by Niezgoda SR (2009)
  120. 10.1016/j.actamat.2010.04.041
  121. 10.1016/j.jpowsour.2011.09.035
  122. 10.1016/j.actamat.2012.06.026
  123. 10.1016/j.msea.2007.10.087
  124. 10.1016/j.commatsci.2004.01.038
  125. 10.1103/PhysRevE.56.3203
  126. 10.1186/2193-9772-2-3
  127. 10.1007/s11837-011-0057-7
  128. 10.5402/2012/305692
  129. 10.1016/j.actamat.2010.10.008
  130. 10.1016/j.actamat.2011.04.005
  131. 10.1016/j.actamat.2010.01.007
  132. 10.1016/j.actamat.2014.08.022
  133. 10.1186/s40192-014-0024-6
  134. 134. Google Docs. 2014.https://docs.google.com/
  135. 135. Authorea. 2014.https://www.authorea.com/
  136. 136. ShareLaTeX. 2014.https://www.sharelatex.com/
  137. 137. Mendeley. 2014.http://www.mendeley.com/
  138. 138. ResearchGate. 2014.http://www.researchgate.net/
  139. 139. Sourceforge. 2014.http://sourceforge.net/
  140. 140. Plotly. 2014.https://plot.ly/
  141. 141. Google+. 2014.https://plus.google.com/
  142. 142. LinkedIn. 2014.https://www.linkedin.com/
  143. 143. Materials Microcharacterization Collaboratory. 2014.http://web.ornl.gov/sci/doe2k/MICSReview/99/
  144. 144. TelePresence Microscopy Collaboratory. 2014.http://tpm.amc.anl.gov
  145. 145. MGI (Materials Genome Initiative) Digital Data Community. 2014.https://www.linkedin.com/groups/mgi-digital-data-community-7459917
  146. 146. The PRISMS Center: Materials Commons. 2014.http://prisms.engin.umich.edu/#/prisms
  147. 147. Dabbish L, Stuart C, Tsay J, Herbsleb J. 2012. Social coding in GitHub: transparency and collaboration in an open software repository. InProc. ACM 2012 Conference on Computer Supported Cooperative Work, pp. 1277–86. New York: ACM (10.1145/2145204.2145396)
  148. 148. maTIN. 2015.http://materials.gatech.edu/matin
Dates
Type When
Created 10 years, 2 months ago (July 1, 2015, 7:10 p.m.)
Deposited 3 months, 1 week ago (May 28, 2025, 7:57 p.m.)
Indexed 1 week ago (Aug. 28, 2025, 8:29 a.m.)
Issued 10 years, 2 months ago (July 1, 2015)
Published 10 years, 2 months ago (July 1, 2015)
Published Print 10 years, 2 months ago (July 1, 2015)
Funders 0

None

@article{Kalidindi_2015, title={Materials Data Science: Current Status and Future Outlook}, volume={45}, ISSN={1545-4118}, url={http://dx.doi.org/10.1146/annurev-matsci-070214-020844}, DOI={10.1146/annurev-matsci-070214-020844}, number={1}, journal={Annual Review of Materials Research}, publisher={Annual Reviews}, author={Kalidindi, Surya R. and De Graef, Marc}, year={2015}, month=jul, pages={171–193} }