Abstract
The field of materials science and engineering is on the cusp of a digital data revolution. After reviewing the nature of data science and Big Data, we discuss the features of materials data that distinguish them from data in other fields. We introduce the concept of process-structure-property (PSP) linkages and illustrate how the determination of PSPs is one of the main objectives of materials data science. Then we review a selection of materials databases, as well as important aspects of materials data management, such as storage hardware, archiving strategies, and data access strategies. We introduce the emerging field of materials data analytics, which focuses on data-driven approaches to extract and curate materials knowledge from available data sets. The critical need for materials e-collaboration platforms is highlighted, and we conclude the article with a number of suggestions regarding the near-term future of the materials data science field.
References
148
Referenced
217
{'key': 'B1', 'first-page': '21', 'volume': '69', 'author': 'Cleveland WS', 'year': '2001', 'journal-title': 'ISI Rev.'}
/ ISI Rev. by Cleveland WS (2001)10.1145/2500499
{'volume-title': 'The Fourth Paradigm: Data-Intensive Scientific Discovery.', 'year': '2009', 'author': 'Hey T', 'key': 'B3'}
/ The Fourth Paradigm: Data-Intensive Scientific Discovery. by Hey T (2009){'key': 'B4', 'first-page': '16.07', 'volume': '16', 'author': 'Anderson C', 'year': '2008', 'journal-title': 'Wired Mag.'}
/ Wired Mag. by Anderson C (2008)10.1109/MIC.2003.1167344
-
6. Li I, Dey A, Forlizzi J. 2010. A stage-based model of personal informatics systems. InProc. SIGCHI Conference on Human Factors in Computing Systems, pp. 557–66. New York: ACM
(
10.1145/1753326.1753409
) 10.1016/j.drudis.2008.11.015
10.1109/TSMCC.2003.809345
10.1007/s10916-006-7397-9
- 10. Dropbox, Inc. 2014.http://www.dropbox.com
- 11. GitHub. 2014.https://github.com
- 12. HUBzero. 2014.https://hubzero.org
- 13. nanoHUB. 2014.https://nanohub.org
- 14. National Science Board. 2005.Long-lived digital data collections: enabling research and education in the 21st century. Rep. NSB-05-40, National Science Board.http://www.nsf.gov/pubs/2005/nsb0540
10.1080/14685248.2012.674643
- 16. CERN Open Data Portal. 2014.http://opendata.cern.ch
- 17. Hubble Legacy Archive. 2014.http://hla.stsci.edu
- 18. CERN. 2014.http://home.web.cern.ch/about
- 19. Fermi National Accelerator Laboratory. 2014.http://www.fnal.gov
- 20. Relativistic Heavy Ion Collider. 2014.http://www.bnl.gov/rhic
- 21. Fuhrmann P. 2014.dCache, the overview. White Pap., dCache.http://www.dcache.org/manuals/dcache-whitepaper-light.pdf
- 22. dCache. 2014.http://www.dcache.org
- 23. DESY (Deutsches Elektronen-Synchrotron). 2014.http://www.desy.de
- 24. CERN: The Worldwide LHC Computing Grid. 2014.http://home.web.cern.ch/about/computing/worldwide-lhc-computing-grid
- 25. National Center for Biotechnology Information. 2014.http://www.ncbi.nlm.nih.gov
- 26. McDonald E, Brown C. 2014.Working with Big Data in bioinformatics.http://www.aosabook.org/en/posa/working-with-big-data-in-bioinformatics.html
- 27. NOAA (National Oceanic and Atmospheric Administration). 2014.http://www.nesdis.noaa.gov
- 28. NOAA View Data Exploration Tool. 2014.http://www.nnvl.noaa.gov/view
- 29. NOAA: National Operational Model Archive and Distribution System. 2014.http://nomads.ncdc.noaa.gov/data.php
- 30. GrADS Data Server. 2014.http://grads.iges.org/grads/gds/index.html
- 31. OPeNDAP. 2014.http://opendap.org
- 32. Earth Observing System Data and Information System. 2014.https://earthdata.nasa.gov/about-eosdis
- 33. National Snow and Ice Data Center. 2014.http://nsidc.org/daac/data-sets.html
10.1557/mrs.2013.187
{'volume-title': 'Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security', 'year': '2008', 'author': 'Committee on Integrated Computational Materials Engineering, National Research Council', 'key': 'B35'}
/ Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security by Committee on Integrated Computational Materials Engineering, National Research Council (2008)- 36. National Science and Technology Council, Executive Office of the President. 2011.Materials genome initiative for global competitiveness.http://www.whitehouse.gov/sites/default/files/microsites/ostp/materials_genome_initiative-final.pdf
10.1002/9783527641864
10.1179/1743280414Y.0000000043
- 39. McNulty E. 2014.Understanding Big Data: the seven V's.http://dataconomy.com/seven-vs-big-data/
10.1007/s11837-011-0116-0
- 41. Advanced Photon Source, Argonne National Laboratory. 2014.https://www1.aps.anl.gov
10.1557/mrs.2013.234
10.1002/9780470172551.ch17
- 44. Citrine Informatics. 2014.http://www.citrination.com
- 45. Clean Energy Project. 2014.http://cleanenergy.molecularspace.org
- 46. The Materials Project. 2014.http://www.materialsproject.org
- 47. Automatic-FLOW for Materials Discovery. 2014.http://www.aflowlib.org
- 48. CALPHAD (Computer Coupling of Phase Diagrams and Thermochemistry). 2014.http://www.calphad.org
- 49. Open Quantum Materials Database. 2014.http://oqmd.org
- 50. NIST (National Institute of Standards and Technology) Data Gateway. 2014.http://srdata.nist.gov/gateway/gateway?dblist=1
- 51. NIST Material Measurement Laboratory. 2014.http://www.ctcms.nist.gov/potentials/
- 52. MatWeb. 2014.http://www.matweb.com/
- 53. Granta. 2014.http://www.grantadesign.com/products/ces/
- 54. MatNavi (NIMS Materials Database). 2014.http://mits.nims.go.jp/index_en.html
{'key': 'B55', 'first-page': '28', 'volume': '90', 'author': 'Freiman S', 'year': '2011', 'journal-title': 'Am. Ceram. Soc. Bull.'}
/ Am. Ceram. Soc. Bull. by Freiman S (2011){'key': 'B56', 'first-page': '159', 'volume': '1', 'author': 'Kaufman J', 'year': '1986', 'journal-title': 'Mater. Prop. Data'}
/ Mater. Prop. Data by Kaufman J (1986){'volume-title': 'Microstructure Sensitive Design for Performance Optimization', 'year': '2012', 'author': 'Adams BL', 'key': 'B57'}
/ Microstructure Sensitive Design for Performance Optimization by Adams BL (2012){'volume-title': 'The Theory of Composites', 'year': '2001', 'author': 'Milton GW', 'key': 'B58'}
/ The Theory of Composites by Milton GW (2001)10.1007/978-1-4757-6355-3
10.1016/j.cad.2012.06.006
- 61. DREAM.3D. 2014.http://dream3d.bluequartz.net
- 62. Materials Atlas. 2014.https://cosmicweb.mse.iastate.edu/wiki/display/home/materials+atlas+home
- 63. Computational Materials Data Network. 2014.http://www.asminternational.org/web/cmdnetwork/about
10.1186/2193-9772-3-5
- 65. Material Data Management Consortium. 2014.http://www.mdmc.net
- 66. TMS (The Minerals, Metals and Materials Society). 2014.http://www.tms.org/
- 67. The Materials Cyberinfrastructure Portal. 2014.http://www.tms.org/cyberportal/
-
68. Patterson DA, Gibson G, Katz RH. 1988. A case for redundant arrays of inexpensive disks (RAID). InProc. 1988 ACM SIGMOD International Conference on Management of Data, pp. 109–16. Chicago: ACM
(
10.1145/971701.50214
) - 69. Schroeder B, Gibson G. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you?InProc. 5th USENIX Conference on File and Storage Technologies(FAST'07). San Jose, CA: USENIX
-
70. Ghemawat S, Gobioff H, Leung ST. 2003. The Google File System. InProc. 19th ACM Symposium on Operating System Principles.Bolton Landing, NY: ACM
(
10.1145/945445.945450
) - 71. Healey CG. 2014.CSC541: advanced data structures. Course Notes, Dep. Comput. Sci., NC State Univ.http://www.csc.ncsu.edu/faculty/healey/csc541/notes/file_sys.pdf
-
72. Rodeh O, Teperman A. 2003. zFS—a scalable distributed file system using object disks. InProc. 20th IEEE Conference on Mass Storage Systems and Technology, pp. 207–18. San Diego, CA: IEEE
(
10.1109/MASS.2003.1194858
) -
73. Shvachko K, Hairong K, Radia S, Chansler R. 2010. The Hadoop distributed file system. InProc. 26th IEEE Symposium on Mass Storage Systems and Technologies, pp. 1–10. Incline Village, NY: IEEE
(
10.1109/MSST.2010.5496972
) 10.1186/2193-9772-3-4
10.1088/0965-0393/18/6/065008
- 76. The HDF Group. 2014.http://www.hdfgroup.org/
- 77. NetCDF. 2014.http://www.unidata.ucar.edu/software/netcdf/index.html
- 78. PDB (Protein Data Bank). 2014.http://www.pdb.org/pdb/home/home.do
- 79. FITS Support Office (NASA/Goddard Space Flight Center). 2014.http://fits.gsfc.nasa.gov/
- 80. DICOM. 2014.http://medical.nema.org/
- 81. National Digital Information Infrastructure and Preservation Program. 2014.http://www.digitalpreservation.gov
- 82. Community Owned Digital Preservation Tool Registry. 2014.http://coptr.digipres.org/main_page
- 83. ISO (International Organization for Standardization) 26234:2012. 2014.http://www.iso.org/iso/catalogue_detail.htm?csnumber=43506
- 84. DOI. 2014.http://www.doi.org/
- 85. Handle.Net. 2014.http://www.handle.net/index.html
- 86. Corporation for National Research Initiatives. 2014.http://www.cnri.reston.va.us
- 87. Globus Online GridFTP. 2014.http://toolkit.globus.org/toolkit/docs/latest-stable/gridftp/
- 88. Internet Engineering Task Force. 2014.https://www.ietf.org
- 89. Globus Online. 2014.https://www.globus.org
-
90. Kumar S, Edwards J, Bremer PT, Knoll A, Christensen C, et al. 2014. Efficient I/O and storage of adaptive-resolution data. InProc. International Conference for High Performance Computing, Networking, Storage and Analysis, pp. 413–23. Piscataway, NJ: IEEE
(
10.1109/SC.2014.39
) 10.1080/14685240802376389
- 92. ASTM Int. 2013.Standard test methods for determining average grain size. ASTM E112, ASTM Int.
- 93. ASTM Int. 2008.Standard test methods for characterizing duplex grain sizes. ASTM E1181, ASTM Int.
{'volume-title': 'The Image Processing Handbook', 'year': '1992', 'author': 'Russ J', 'key': 'B94'}
/ The Image Processing Handbook by Russ J (1992)10.1088/0370-1301/64/9/303
{'key': 'B96', 'first-page': '25', 'volume': '174', 'author': 'Petch N', 'year': '1953', 'journal-title': 'Iron Steel Inst. J.'}
/ Iron Steel Inst. J. by Petch N (1953){'volume-title': 'Strengthening Mechanisms in Crystal Plasticity', 'year': '2008', 'author': 'Argon A', 'key': 'B97'}
/ Strengthening Mechanisms in Crystal Plasticity by Argon A (2008){'key': 'B98', 'volume-title': 'Physical Metallurgy Principles', 'author': 'Reed-Hill R', 'year': '1994', 'edition': '3'}
/ Physical Metallurgy Principles by Reed-Hill R (1994)10.1016/S1369-7021(05)71123-8
10.1016/S1367-5931(00)00091-0
10.1103/PhysRevLett.91.135503
10.1126/science.280.5366.1099
10.1002/adfm.201301744
10.1557/mrs2006.229
10.1007/s10851-005-3616-0
10.1109/TIP.2013.2284071
10.1007/s11837-011-0113-3
- 108. EM/MPM Workbench. 2014.http://www.bluequartz.net/?page_id=97
10.1088/0965-0393/17/2/025002
10.1016/j.actamat.2007.09.039
10.1088/0965-0393/16/4/045008
10.1063/1.1830512
10.1038/nature05745
{'volume-title': 'Statistical Analysis of Microstructures in Materials Science', 'year': '2000', 'author': 'Ohser J', 'key': 'B114'}
/ Statistical Analysis of Microstructures in Materials Science by Ohser J (2000)10.1016/j.actamat.2005.03.052
10.1016/j.actamat.2008.07.005
10.1016/j.actamat.2007.10.044
10.1016/j.pmatsci.2009.08.002
{'key': 'B119', 'first-page': '79', 'volume': '14', 'author': 'Niezgoda SR', 'year': '2009', 'journal-title': 'Comput. Mater. Contin.'}
/ Comput. Mater. Contin. by Niezgoda SR (2009)10.1016/j.actamat.2010.04.041
10.1016/j.jpowsour.2011.09.035
10.1016/j.actamat.2012.06.026
10.1016/j.msea.2007.10.087
10.1016/j.commatsci.2004.01.038
10.1103/PhysRevE.56.3203
10.1186/2193-9772-2-3
10.1007/s11837-011-0057-7
10.5402/2012/305692
10.1016/j.actamat.2010.10.008
10.1016/j.actamat.2011.04.005
10.1016/j.actamat.2010.01.007
10.1016/j.actamat.2014.08.022
10.1186/s40192-014-0024-6
- 134. Google Docs. 2014.https://docs.google.com/
- 135. Authorea. 2014.https://www.authorea.com/
- 136. ShareLaTeX. 2014.https://www.sharelatex.com/
- 137. Mendeley. 2014.http://www.mendeley.com/
- 138. ResearchGate. 2014.http://www.researchgate.net/
- 139. Sourceforge. 2014.http://sourceforge.net/
- 140. Plotly. 2014.https://plot.ly/
- 141. Google+. 2014.https://plus.google.com/
- 142. LinkedIn. 2014.https://www.linkedin.com/
- 143. Materials Microcharacterization Collaboratory. 2014.http://web.ornl.gov/sci/doe2k/MICSReview/99/
- 144. TelePresence Microscopy Collaboratory. 2014.http://tpm.amc.anl.gov
- 145. MGI (Materials Genome Initiative) Digital Data Community. 2014.https://www.linkedin.com/groups/mgi-digital-data-community-7459917
- 146. The PRISMS Center: Materials Commons. 2014.http://prisms.engin.umich.edu/#/prisms
-
147. Dabbish L, Stuart C, Tsay J, Herbsleb J. 2012. Social coding in GitHub: transparency and collaboration in an open software repository. InProc. ACM 2012 Conference on Computer Supported Cooperative Work, pp. 1277–86. New York: ACM
(
10.1145/2145204.2145396
) - 148. maTIN. 2015.http://materials.gatech.edu/matin
Dates
Type | When |
---|---|
Created | 10 years, 2 months ago (July 1, 2015, 7:10 p.m.) |
Deposited | 3 months, 1 week ago (May 28, 2025, 7:57 p.m.) |
Indexed | 1 week ago (Aug. 28, 2025, 8:29 a.m.) |
Issued | 10 years, 2 months ago (July 1, 2015) |
Published | 10 years, 2 months ago (July 1, 2015) |
Published Print | 10 years, 2 months ago (July 1, 2015) |
@article{Kalidindi_2015, title={Materials Data Science: Current Status and Future Outlook}, volume={45}, ISSN={1545-4118}, url={http://dx.doi.org/10.1146/annurev-matsci-070214-020844}, DOI={10.1146/annurev-matsci-070214-020844}, number={1}, journal={Annual Review of Materials Research}, publisher={Annual Reviews}, author={Kalidindi, Surya R. and De Graef, Marc}, year={2015}, month=jul, pages={171–193} }