Crossref journal-article
Springer Science and Business Media LLC
Integrating Materials and Manufacturing Innovation (297)
Abstract

AbstractThis paper describes the use of data analytics tools for predicting the fatigue strength of steels. Several physics-based as well as data-driven approaches have been used to arrive at correlations between various properties of alloys and their compositions and manufacturing process parameters. Data-driven approaches are of significant interest to materials engineers especially in arriving at extreme value properties such as cyclic fatigue, where the current state-of-the-art physics based models have severe limitations. Unfortunately, there is limited amount of documented success in these efforts. In this paper, we explore the application of different data science techniques, including feature selection and predictive modeling, to the fatigue properties of steels, utilizing the data from the National Institute for Material Science (NIMS) public domain database, and present a systematic end-to-end framework for exploring materials informatics. Results demonstrate that several advanced data analytics techniques such as neural networks, decision trees, and multivariate polynomial regression can achieve significant improvement in the prediction accuracy over previous efforts, withR2values over 0.97. The results have successfully demonstrated the utility of such data mining tools for ranking the composition and process parameters in the order of their potential for predicting fatigue strength of steels, and actually develop predictive models for the same.

Bibliography

Agrawal, A., Deshpande, P. D., Cecen, A., Basavarsu, G. P., Choudhary, A. N., & Kalidindi, S. R. (2014). Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters. Integrating Materials and Manufacturing Innovation, 3(1), 90–108.

Authors 6
  1. Ankit Agrawal (first)
  2. Parijat D Deshpande (additional)
  3. Ahmet Cecen (additional)
  4. Gautham P Basavarsu (additional)
  5. Alok N Choudhary (additional)
  6. Surya R Kalidindi (additional)
References 44 Referenced 216
  1. Committee on Integrated Computational Materials Engineering N. R. C.: Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security. 2008.http://www.nap.edu/openbook.php?record_id=12199 / Integrated Computational Materials Engineering: A Transformational Discipline for Improved Competitiveness and National Security by N. R. C. Committee on Integrated Computation (2008)
  2. National Science and Technology Council: Materials genome initiative for global competitiveness. Technical report, National Science and Technology Council. 2011.http://www.whitehouse.gov/sites/default/files/microsites/ostp/materials_genome_initiative-final.pdf / Materials genome initiative for global competitiveness. Technical report, National Science and Technology Council by National Science and Technology Council (2011)
  3. Kalidindi SR, Niezgoda SR, Salem AA: Microstructure informatics using higher-order statistics and efficient data-mining protocols. JOM - J Minerals, Met Mater Soc 2011, 63(4):40–41. / JOM - J Minerals, Met Mater Soc by SR Kalidindi (2011)
  4. Rajan K: Materials informatics. Materials Today 2005, 8(10):38–45. 10.1016/S1369-7021(05)71123-8 (10.1016/S1369-7021(05)71123-8) / Materials Today by K Rajan (2005)
  5. Hey T, Tansley S, Tolle K: The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, 1st edition. 2009.http://research.microsoft.com/en-us/collaboration/fourthparadigm/ ISBN: 0982544200, URL: . / The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research, 1st edition by T Hey (2009)
  6. Linden G, Smith B, York J: Amazon.com recommendations: item-to-item collaborative filtering. Internet Comput IEEE 2003, 7(1):76–80. 10.1109/MIC.2003.1167344 (10.1109/MIC.2003.1167344) / Internet Comput IEEE by G Linden (2003)
  7. Mobasher B: Data mining for web personalization. In Brusilovsky P, Kobsa A, Nejdl W (eds) The adaptive web. Lecture Notes in Computer Science, vol. 4321. Berlin, Heidelberg: Springer-Verlag; 2007:90–135. / Brusilovsky P, Kobsa A, Nejdl W (eds) The adaptive web. Lecture Notes in Computer Science, vol. 4321 by B Mobasher (2007)
  8. Zhou Y, Wilkinson D, Schreiber R, Pan R: Large-scale parallel collaborative filtering for the netflix prize. In Proceedings of the 4th International Conference on Algorithmic Aspects in Information and Management. Berlin, Heidelberg: AAIM ’08, Springer; 2008:337–348. / Proceedings of the 4th International Conference on Algorithmic Aspects in Information and Management by Y Zhou (2008)
  9. Das AS, Datar M, Garg A, Rajaram S: Google news personalization: Scalable online collaborative filtering. In Proceedings of the 16th International Conference on World Wide Web. New York, NY, USA: WWW ’07, ACM; 2007:271–280. (10.1145/1242572.1242610) / Proceedings of the 16th International Conference on World Wide Web by AS Das (2007)
  10. URL: Walmart is making big data part of its DNA. Bigdata startups, 2013, . http://www.bigdata-startups.com/BigData-startup/walmart-making-big-data-part-dna/
  11. King M: URL: Data Mining the TARGET way. 2012.http://www.slideshare.net/ipullrank/datamining-the-target-way / URL: Data Mining the TARGET way by M King (2012)
  12. Rajan K, Suh C, Mendez P: Principal component analysis and dimensional analysis as materials informatics tools to reduce dimensionality in materials science and engineering. Stat Anal Data Min 2009, 1: 361–371. 10.1002/sam.10031 (10.1002/sam.10031) / Stat Anal Data Min by K Rajan (2009)
  13. Suh C, Rajan K: Virtual screening and qsar formulations for crystal chemistry. QSAR & Comb. Sci 2005, 24(1):114–119. 10.1002/qsar.200420057 (10.1002/qsar.200420057) / QSAR & Comb. Sci by C Suh (2005)
  14. Nowers JR, Broderick SR, Rajan K, Narasimhan B: Combinatorial methods and informatics provide insight into physical properties and structure relationships during ipn formation. Macromol Rapid Commun 2007, 28: 972–976. 10.1002/marc.200600780 (10.1002/marc.200600780) / Macromol Rapid Commun by JR Nowers (2007)
  15. Gadzuric S, Suh C, Gaune-Escard M, Rajan K: Extracting information from the molten salt database. Metallogr Mater Trans A 2006, 37(12):3411–3414. 10.1007/s11661-006-1034-6 (10.1007/s11661-006-1034-6) / Metallogr Mater Trans A by S Gadzuric (2006)
  16. George L, Hrubiak R, Rajan K, Saxena SK: Principal component analysis on properties of binary and ternary hydrides and a comparison of metal versus metal hydride properties. J Alloys Compounds 2009, 478(1–2):731–735. (10.1016/j.jallcom.2008.11.137) / J Alloys Compounds by L George (2009)
  17. Singh S, Bhadeshia H, MacKay D, Carey H, Martin I: Neural network analysis of steel plate processing. Iron-mak Steelmak 1998, 25: 355–365. / Iron-mak Steelmak by S Singh (1998)
  18. Fujii H, MacKay D, Bhadeshia H: Bayesian neural network analysis of fatigue crack growth rate in nickel base superalloys. ISIJ INT 1996, 36: 1373–1382. 10.2355/isijinternational.36.1373 (10.2355/isijinternational.36.1373) / ISIJ INT by H Fujii (1996)
  19. Hancheng Q, Bocai X, Shangzheng L, Fagen W: Fuzzy neural network modeling of material properties. J Mater Process Technol 2002, 122(2–3):196–200. (10.1016/S0924-0136(02)00019-5) / J Mater Process Technol by Q Hancheng (2002)
  20. Gopalakrishnan K, Ceylan H, Kim S, Khaitan SK: Natural selection of asphalt mix stiffness predictive models with genetic programming. ANNIE Int Eng Syst Artif Neural Netw 2010, 20: 10. / ANNIE Int Eng Syst Artif Neural Netw by K Gopalakrishnan (2010)
  21. Gopalakrishnan K, Manik A, Khaitan SK: Runway stiffness evaluation using an artificial neural systems approach. Int J Electrical Comput Eng 2006, 1(7):496–502. / Int J Electrical Comput Eng by K Gopalakrishnan (2006)
  22. Wen YF, Cai CZ, Liu XH, Pei JF, Zhu XJ, Xiao TT: Corrosion rate prediction of 3c steel under different seawater environment by using support vector regression. Corrosion Sci 2009, 51(2):349–355. 10.1016/j.corsci.2008.10.038 (10.1016/j.corsci.2008.10.038) / Corrosion Sci by YF Wen (2009)
  23. Rao BV, Gopalakrishna SJ: Hardgrove grindability index prediction using support vector regression. Int J Miner Process 2009, 91(1–2):55–59. / Int J Miner Process by BV Rao (2009)
  24. Gautham BP, Kumar R, Bothra S, Mohapatra G, Kulkarni N, Padmanabhan KA: More Efficient ICME through Materials Informatics and Process Modeling. In Proceedings of the 1st World Congress on Integrated Computational Materials Engineering (ICME) (eds J. Allison, P. Collins and G. Spanos). Hoboken, NJ, USA: John Wiley & Sons, Inc.; 2011. doi: 10.1002/9781118147726.ch5 doi: 10.1002/9781118147726.ch5 / Proceedings of the 1st World Congress on Integrated Computational Materials Engineering (ICME) (eds J. Allison, P. Collins and G. Spanos) by BP Gautham (2011)
  25. Dieter GE: Mechanical Metallurgy. Mc Graw-Hill Book Co. 3rd edition, ISBN: 0–07–016893–8 26. Deshpande PD, Gautham BP, Cecen A, Kalidindi S, Agrawal A, Choudhary (2013) Application of Statistical and Machine Learning Techniques for Correlating Properties to Composition and Manufacturing Processes of Steels. In 2nd World Congress on Integrated Computational Materials Engineering. John Wiley & Sons, Inc.; 1986:155–160. ISBN: 9781118767061 ISBN: 9781118767061 / 2nd World Congress on Integrated Computational Materials Engineering by GE Dieter (1986)
  26. Deshpande PD, Gautham BP, Cecen A, Kalidindi S, Agrawal A, Choudhary: Application of Statistical and Machine Learning Techniques for Correlating Properties to Composition and Manufacturing Processes of Steels. In 2nd World Congress on Integrated Computational Materials Engineering. John Wiley & Sons, Inc.; 2013:155–160. ISBN: 9781118767061 ISBN: 9781118767061 (10.1002/9781118767061.ch25) / 2nd World Congress on Integrated Computational Materials Engineering by PD Deshpande (2013)
  27. URL: National Institute of Materials Science . http://smds.nims.go.jp/fatigue/index_en.html .
  28. Weher E, Allen EL: An introduction to linear regression and correlation. (a series of books in psychology.) w. h. freeman and comp., San Francisco 1976. 213 s., tafelanh., s 7.00. Biom J 1977, 19(1):83–84. (10.1002/bimj.4710190121) / Biom J by E Weher (1977)
  29. Wang Y: A new approach to fitting linear models in high dimensional spaces. 2000.http://books.google.com/books?id=Z0OntgAACAAJ / A new approach to fitting linear models in high dimensional spaces by Y Wang (2000)
  30. Wang Y, Witten IH: Modeling for optimal probability prediction. In Proceedings of the Nineteenth International Conference on Machine Learning. Morgan Kaufmann Publishers; 2002:650–657. / Proceedings of the Nineteenth International Conference on Machine Learning by Y Wang (2002)
  31. URL: Robust Fit Regression, Mathworks . http://www.mathworks.in/help/stats/robustfit.html .
  32. Aha DW, Kibler D: Instance-based learning algorithms. Machine Learning Vol. 6. Boston,: Kluwer Academic Publishers; 1991. pp 37–66 pp 37–66 / Instance-based learning algorithms. Machine Learning Vol. 6 by DW Aha (1991)
  33. Cleary JG, Trigg LE: K*: An instance-based learner using an entropic distance measure. In Proceedings of the 12th International Conference on Machine Learning. Morgan Kaufmann Publishers; 1995:108–114. / Proceedings of the 12th International Conference on Machine Learning by JG Cleary (1995)
  34. Kohavi R: The power of decision tables. In Proceedings of the 8th European Conference on Machine Learning. London, UK: ECML ’95, Springer-Verlag; 1995:174–189. / Proceedings of the 8th European Conference on Machine Learning by R Kohavi (1995)
  35. Vapnik VN: The nature of statistical learning theory. Information Science and Statistics Series. New York, NY, USA: Springer-Verlag New York, Inc.; 1995. ISBN: 0–387–94559–8 ISBN: 0-387-94559-8 / The nature of statistical learning theory. Information Science and Statistics Series by VN Vapnik (1995)
  36. Bishop C: Neural Networks for Pattern Recognition. USA: Oxford University Press; 1995. ISBN: 0198538642 ISBN: 0198538642 (10.1093/oso/9780198538493.001.0001) / Neural Networks for Pattern Recognition by C Bishop (1995)
  37. Fausett L: Fundamentals of Neural Networks. New York: Prentice Hall, Pearson; 1994. ISBN: 0133341860 ISBN: 0133341860 / Fundamentals of Neural Networks by L Fausett (1994)
  38. Witten IH, Frank E: Data Mining: Practical Machine Learning Tools and Techniques. The Morgan Kaufmann Series in Data Management Systems,. Morgan Kaufmann Publishers; 2005. ISBN: 0120884070 ISBN: 0120884070 / Data Mining: Practical Machine Learning Tools and Techniques. The Morgan Kaufmann Series in Data Management Systems, by IH Witten (2005)
  39. Wang Y, Witten IH: Induction of model trees for predicting continuous classes. In Proc European Conference on Machine Learning Poster Papers. Prague, Czech Republic; 1997:128–137. / Proc European Conference on Machine Learning Poster Papers by Y Wang (1997)
  40. Quinlan JR: Learning with continuous classes. In 5th Australian Joint Conference on Artificial Intelligence. World Scientific; 1992:343–348. / 5th Australian Joint Conference on Artificial Intelligence by JR Quinlan (1992)
  41. Breiman L, Friedman J, Olshen R, Stone C: Classification and Regression Trees. Monterey, CA: Wadsworth and Brooks; 1984. / Classification and Regression Trees by L Breiman (1984)
  42. R Development Core Team: R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria: R Foundation for Statistical Computing; 2011. ISBN 3–900051–07–0. . http://www.R-project.org. / R: A Language and Environment for Statistical Computing by R Development Core Team (2011)
  43. MATLAB: Version 7.10.0 (R2010a). Natick, Massachusetts: The MathWorks Inc.; 2010. / Version 7.10.0 (R2010a) by MATLAB (2010)
  44. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH: The weka data mining software: An update. SIGKDD Explorations Newsletter 2009, 11(1):10–18. doi:10.1145/1656274.1656278, URL: . http://doi.acm.org/10.1145/1656274.1656278 (10.1145/1656274.1656278) / SIGKDD Explorations Newsletter by M Hall (2009)
Dates
Type When
Created 11 years, 4 months ago (April 7, 2014, 3:41 p.m.)
Deposited 1 year, 2 months ago (May 25, 2024, 2:50 p.m.)
Indexed 4 hours, 8 minutes ago (Aug. 23, 2025, 1:10 a.m.)
Issued 11 years, 4 months ago (April 3, 2014)
Published 11 years, 4 months ago (April 3, 2014)
Published Online 11 years, 4 months ago (April 3, 2014)
Published Print 10 years, 8 months ago (Dec. 1, 2014)
Funders 0

None

@article{Agrawal_2014, title={Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters}, volume={3}, ISSN={2193-9772}, url={http://dx.doi.org/10.1186/2193-9772-3-8}, DOI={10.1186/2193-9772-3-8}, number={1}, journal={Integrating Materials and Manufacturing Innovation}, publisher={Springer Science and Business Media LLC}, author={Agrawal, Ankit and Deshpande, Parijat D and Cecen, Ahmet and Basavarsu, Gautham P and Choudhary, Alok N and Kalidindi, Surya R}, year={2014}, month=apr, pages={90–108} }