-
摘要: 在材料科學過去幾十年的發展過程中,經驗試錯法和基于密度泛函理論的方法等傳統的非晶合金開發方法,幫助研發人員探索出多種非晶合金體系。但是,這些方法由于開發周期長、效率低等缺點,目前已難以滿足研發人員的需求。而機器學習方法因其實驗成本低、性能強大以及開發周期短等優點,被越來越廣泛地應用到非晶合金材料的設計、分析和性能預測中。本文首先按照機器學習建模的主要流程闡述了各步驟的基本操作和發展情況。其次,著重介紹了數據預處理、模型構建以及模型驗證方面的研究工作,在數據預處理章節,簡述了數據收集、特征工程以及目前較為流行的數據預采樣方法;在模型構建章節,論述了四類在非晶合金開發中常用的機器學習算法,包括人工神經網絡、支持向量機、隨機森林以及極端梯度提升方法;在模型驗證章節,主要介紹了K折交叉驗證和留一法交叉驗證方法。最后,本文從多個角度對比分析了現有的機器學習應用,為后續的相關研究提供了可能的研究方向和思路。Abstract: Metallic glasses have received a lot of interest because of their excellent mechanical, physical, and chemical qualities. For example, they have a stronger resistivity than crystalline metals composed of the same elements and a lower viscosity coefficient. However, the difficulty in creating alloy compositions has been a concern for researchers. Traditional amorphous alloy systems design approaches, such as empirical trial-and-error methods and methods based on density functional theory (DFT), have assisted researchers in exploring numerous amorphous alloy systems during the growth of materials science over the last few decades. However, with the continuous development of materials science, these methods have been difficult to meet the needs of researchers due to their long development cycles and low efficiency. Additionally, the complex and long-range disordered structure of metallic glasses makes it difficult to understand their structure and nature in a comprehensive and clear way by conventional methods. Amorphous alloy composition design and property analysis are now often conducted using machine learning techniques because of their low experimental cost, short development cycle, strong data processing capability, and high predictive performance, among other advantages. They present new approaches and chances to address significant key bottlenecks in the field of metallic glass. In this study, the main processes of machine learning model building were introduced. Subsequently, the related studies on data pre-processing, model construction, and model validation were presented. For data pre-processing, data selection, feature engineering, and advanced data balancing methods were primarily described. In the feature engineering part, the model performance with various input features was examined, and it was shown that either employing physical properties or directly using the alloy compositions as the model input might result in high performance. Four machine learning algorithms were used to generate the machine learning model: artificial neural networks (ANN), support vector machines (SVM), random forest (RF), and extreme gradient boosting (XGBoost). A comparison indicates that SVM models work best with small data sets, whereas the performance of all other models tends to get better as the amount of training data increases. Generally, the XGBoost method outperforms several other methods and is, therefore, often used in machine learning competitions. Model validation approaches: K-fold cross-validation and leave-one-out cross-validation methods were presented. A good metallic glass performance prediction method needs to perform well in both validation methods. Finally, this study provides several possible future research directions on feature engineering, dataset construction, validation, and machine learning models.
-
Key words:
- metallic glasses /
- machine learning /
- alloy design /
- property prediction /
- data pre-processing /
- model selection /
- model validation
-
表 1 機器學習模型輸入特征和性能
Table 1. Input features and performance of machine learning model
Input features Machine learning models Features used Performance Physical properties ANN (MLP)[11] Pauling’s electronegativity difference (Δχ), atomic size difference (δ), theoretical density (ρ), ΔTx, parameter γm, fragility (m), enthalpy of mixing (Hmix), mismatch entropy (Sσ), configurational entropy (ΔSc), bulk modulus (K), and valence electron concentration (VEC). Accuracy (ACC) for ternary alloy prediction:77.8%~78.9%,
ACC for multicomponent alloys ( n > 3) prediction:83%.ANN (MLP)[34] Atomic properties (e.g., atomic radii, atomic radius mismatch), atomic packing properties, and thermodynamic properties (e.g., heat of mixing and configurational entropy of mixing), covering a comprehensive list used for general-purpose as well as for the MG systems (e.g., δ, Hmix), etc., 131 features in total. The model trained on the balanced dataset exhibits a 42% improvement in the alloy systems with sparse data and maintains the performance in the alloy systems with high-throughput data. ANN (MLP)[52] Tg, Tx, Tl. R2: 0.776. ANN (MLP)[53] The combined variable of lateral displacement and lateral force. MSE of validation dataset:
9.3158 × 10?2~3.0303 × 10?1,
MSE of testing dataset:
9.9836 × 10?2~2.9091 × 10?1.ANN (CNN)[52] Tg, Tx, Tl. R2: 0.716. SVM[40] 2 atomic weights (aw1, aw2), the mixing enthalpy (ΔH), 2 atomic radii (r1, r2), 2 liquidus temperatures for each element (Tliq1, Tliq2), the fictive liquidus temperature (Tfic), the difference in liquidus temperature (ΔTliq), and 2 contents of each element (c1, c2), etc., 11 features in total. Using the customized evaluation criterion E, the optimum result is 3.96 ( E = PTarget2 / Pall ). RF[44] 145 features, which fall into four distinct categories:
(1) Stoichiometric attributes,
(2) Elemental property statistics,
(3) Electronic structure attributes,
(4) Ionic compound attributes.ACC (for band gap prediction):90%,
ACC (for GFA prediction):80.2%.RF[45] Inputting all features in paper [44] and adding three additional features: Cluster Packing Efficiency Attributes, Nearest Special Clusters and Mean Packing Efficiency. ACC (for GFA prediction):89%,
MAE (for Dmax prediction):0.21 mm,
MAE (for ΔTx prediction):8.8 K.XGBoost[48] Theoretical melting point (Tm), theoretical molar volume (V), mean atom radius ($ \stackrel{-}{r} $), δ, ΔSc, ρ, electronegativity (χ), VEC, and valence electron concentration without FeCoNi (VEC'); 9 features in total. R2: 0.942. Alloy compositions ANN (CNN)[54] Alloy compositions. R2: 0.8137. 表 2 機器學習在非晶合金開發中的應用算法分析
Table 2. Advantages and limitations of several common machine learning algorithms
Methodology Advantages Limitations analysis ANN
(1) Liu et al.[11] used 11 alloy parameters or criteria as ANN inputs to predict GFA. The dataset they used contains 3227 data samples, achieving an accuracy of 77.8%~78.9% for ternary alloy prediction and 83% for multicomponent alloy (n > 3) prediction.
(2) Yao et al.[34] used 131 features as ANN inputs to predict GFA with a dataset containing 5725 data samples and proposed a data resampling method to balance the data sample size of different alloy systems. After data resampling, the model achieved a 42% performance improvement on data-barren alloy systems and maintained almost the same performance on data-sufficient alloy systems.
(3) Lu et al.[52] used Tg, Tx, and Tl as ANN input features to predict Dmax for amorphous alloys. They used a dataset containing 663 data samples and achieved an R2 of 0.776. In addition, they also used CNN for Dmax prediction, but their R2 of 0.717 is lower than that of the ANN.
(4) Yu et al.[53] used lateral force and lateral displacement as inputs to explore the deformation mechanism of plastic deformation of specific amorphous alloys using MLP regressions on the reconstructed lateral force. The MSEs for the validation and test datasets are 9.3158 × 10?2 ~ 3.0303 × 10?1 and 9.9836 × 10?2 ~ 2.9091 × 10?1, respectively.
(5) Zhang et al.[54] put all the elements that can make up the alloys into a 10 × 10 matrix, and used the alloy composition as the CNN model input to predict Dmax , achieving an R2 of 0.8137.(1) A high degree of accuracy of the ANNs.
(2) Better fault tolerance to data noise.
(3) Can fit complex non-linear relationships more easily.(1) A large number of hyperparameters to be tuned.
(2) Longer training time (especially for the deep neural network) compared to other models.
(3) The training process is difficult to observe, and the output is not interpretable, affecting the model credibility.SVM
(1) Sun et al.[40] used 11 features to predict the GFA of binary alloys with a training dataset of 31 alloys and a validation dataset containing 339 amorphous data samples and 1131 unlabeled data representing the likelihood of their use of the full range of binary compositions. The best prediction model in their test (E = 3.96) is when SVM parameters C = 2?6,γ = 22.
(2) Wu et al.[59] used SVM to structurally identify "liquid-like" atoms, which are prone to rearrangement, and "solid-like" atoms, which are not, and successfully identified the "liquid-like" atoms.(1) Performs better on small datasets.
(2) Good generalization capabilities.(1) Not suitable for large datasets.
(2) Lacks data sensitivity.
(3) Performs poorly when there are some noises in the dataset.RF
(1) Ward et al.[44] used 145 features as the input of the RF model. They predicted the band gap width of the alloy in a dataset containing 228,676 data samples and the amorphous state in a dataset containing 5,369 data samples, achieving 90% and 80.2% accuracy, respectively.
(2) Ward et al.[45] used 148 features as the input of the RF model, where 145 features are the same as Ward et al.[44]. They predicted the GFA, Dmax, and ΔTx in three data sets, with an accuracy of 89% for GFA, an MAE of 0.21 mm for Dmax, and an MAE of 8.8 K for ΔTx.(1) Easy to process high-dimensional feature data because the feature subset is randomly selected.
(2) Fast training speed.
(3) The interaction between input parameters can be detected in training. Also, the errors can be balanced for unbalanced data.(1) When regression is performed, it is impossible to make predictions beyond the scope of the training set data.
(2) Some specific noises will cause overfitting.
(3) Hard to converge on small datasets or insufficient input parameters.XGBoost
(1) Li et al.[48] used 9 input features to predict the saturated magnetization (Bs) of Fe-based MGs with a dataset containing 360 data samples. They tested the RF, ANN, and XGBoost methods, where XGBoost demonstrates the strongest correlation, with an R2 of 0.942.(1) Higher performance and accuracy compared to other algorithms.
(2) Very flexible, as it can build weak learners using various models.
(3) Adds the complexity of the tree model to the regular term to further avoid overfitting.(1) Sensitive to outlier samples, which may receive higher weights in the iterations. (2) High space complexity. XGBoost needs to store the feature values and the indexes of the gradient statistics of the samples corresponding to the features. 259luxu-164 -
參考文獻
[1] Klement W, Willens R H, Duwez P. Non-crystalline structure in solidified gold–silicon alloys. Nature, 1960, 187(4740): 869 [2] Koledin T D, Saini J S, Xu D H, et al. Characterization of a new rare-earth-free Cu-based bulk metallic glass. Microsc Microanal, 2022, 28(Suppl 1): 2782 [3] Ohashi Y, Wada T, Kato H. High-entropy design and its influence on glass-forming ability in Zr-Cu-based metallic glass. J Alloys Compd, 2022, 915: 165366 doi: 10.1016/j.jallcom.2022.165366 [4] Zhang X P, Lai L M, Xiao S M, et al. Effect of W on the thermal stability, mechanical properties and corrosion resistance of Fe-based bulk metallic glass. Intermetallics, 2022, 143: 107485 doi: 10.1016/j.intermet.2022.107485 [5] Shan F L, Sun T T, Song W D, et al. A bridge from metallic glasses to medium-entropy alloys in Ti–Cu–Zr–Pd–Co system: Design, microstructure, and deformation-induced-martensitic transformation. J Non Cryst Solids, 2022, 587: 121608 doi: 10.1016/j.jnoncrysol.2022.121608 [6] Samuel A L. Some studies in machine learning using the game of checkers. IBM J Res Dev, 1959, 3(3): 210 doi: 10.1147/rd.33.0210 [7] Ramprasad R, Batra R, Pilania G, et al. Machine learning in materials informatics: Recent applications and prospects. Npj Comput Mater, 2017, 3: 54 doi: 10.1038/s41524-017-0056-5 [8] Correa-Baena J P, Hippalgaonkar K, van Duren J, et al. Accelerating materials development via automation, machine learning, and high-performance computing. Joule, 2018, 2(8): 1410 doi: 10.1016/j.joule.2018.05.009 [9] Ren F, Ward L, Williams T, et al. Accelerated discovery of metallic glasses through iteration of machine learning and high-throughput experiments. Sci Adv, 2018, 4(4): eaaq1566 doi: 10.1126/sciadv.aaq1566 [10] Wen C, Zhang Y, Wang C X, et al. Machine learning assisted design of high entropy alloys with desired property. Acta Mater, 2019, 170: 109 doi: 10.1016/j.actamat.2019.03.010 [11] Liu X D, Li X, He Q F, et al. Machine learning-based glass formation prediction in multicomponent alloys. Acta Mater, 2020, 201: 182 doi: 10.1016/j.actamat.2020.09.081 [12] Hachmann J, Olivares-Amaya R, Atahan-Evrenk S, et al. The Harvard clean energy project: Large-scale computational screening and design of organic photovoltaics on the world community grid. J Phys Chem Lett, 2011, 2(17): 2241 doi: 10.1021/jz200866s [13] Hachmann J, Olivares-Amaya R, Jinich A, et al. Lead candidates for high-performance organic photovoltaics from high-throughput quantum chemistry - the Harvard Clean Energy Project. Energy Environ Sci, 2014, 7(2): 698 doi: 10.1039/C3EE42756K [14] Jain A, Ong S P, Hautier G, et al. Commentary: The materials project: A materials genome approach to accelerating materials innovation. APL Mater, 2013, 1(1): 011002 doi: 10.1063/1.4812323 [15] Saal J E, Kirklin S, Aykol M, et al. Materials design and discovery with high-throughput density functional theory: The open quantum materials database (OQMD). JOM, 2013, 65(11): 1501 doi: 10.1007/s11837-013-0755-4 [16] Kirklin S, Saal J E, Meredig B, et al. The open quantum materials database (OQMD): Assessing the accuracy of DFT formation energies. Npj Comput Mater, 2015, 1: 15010 doi: 10.1038/npjcompumats.2015.10 [17] Curtarolo S, Setyawan W, Wang S D, et al. AFLOWLIB. ORG: A distributed materials properties repository from high-throughput ab initio calculations. Comput Mater Sci, 2012, 58: 227 [18] Allen F H. The Cambridge Structural Database: A quarter of a million crystal structures and rising. Acta Cryst, 2002, B58: 380 [19] Kalidindi S R, De Graef M. Materials data science: Current status and future outlook. Annu Rev Mater Res, 2015, 45: 171 doi: 10.1146/annurev-matsci-070214-020844 [20] Kawazoe Y, Carow-Watamura U, Louzguine D V. Phase Diagrams and Physical Properties of Nonequilibrium Alloys: Subvolume C: Physical Properties of Multi-Component Amorphous Alloys. Berlin: Springer, 2019 [21] Turnbull D. Under what conditions can a glass be formed? Contemp Phys, 1969, 10(5): 473 [22] Inoue A, Zhang T, Masumoto T. Glass-forming ability of alloys. J Non Cryst Solids, 1993, 156-158: 473 doi: 10.1016/0022-3093(93)90003-G [23] Lu Z P, Liu C T. A new glass-forming ability criterion for bulk metallic glasses. Acta Mater, 2002, 50(13): 3501 doi: 10.1016/S1359-6454(02)00166-0 [24] Du X H, Huang J C, Liu C T, et al. New criterion of glass forming ability for bulk metallic glasses. J Appl Phys, 2007, 101(8): 086108 doi: 10.1063/1.2718286 [25] Inoue A. Stabilization of metallic supercooled liquid and bulk amorphous alloys. Acta Mater, 2000, 48(1): 279 doi: 10.1016/S1359-6454(99)00300-6 [26] Miedema A R, de Boer F R, Boom R. Model predictions for the enthalpy of formation of transition metal alloys. Calphad, 1977, 1(4): 341 doi: 10.1016/0364-5916(77)90011-6 [27] Ramakrishna Rao B, Gandhi A S, Vincent S, et al. Prediction of glass forming ability using thermodynamic parameters. Trans Indian Inst Met, 2012, 65(6): 559 doi: 10.1007/s12666-012-0215-9 [28] Mansoori G A, Carnahan N F, Starling K E, et al. Equilibrium thermodynamic properties of the mixture of hard spheres. J Chem Phys, 1971, 54(4): 1523 doi: 10.1063/1.1675048 [29] Fang S S, Xiao X S, Xia L, et al. Relationship between the widths of supercooled liquid regions and bond parameters of Mg-based bulk metallic glasses. J Non Cryst Solids, 2003, 321(1-2): 120 doi: 10.1016/S0022-3093(03)00155-8 [30] Guo S, Liu C T. Phase stability in high entropy alloys: Formation of solid-solution phase or amorphous phase. Prog Nat Sci Mater Int, 2011, 21(6): 433 doi: 10.1016/S1002-0071(12)60080-X [31] Angell C A. Formation of glasses from liquids and biopolymers. Science, 1995, 267(5206): 1924 doi: 10.1126/science.267.5206.1924 [32] Wang L M, Tian Y J, Liu R P. Dependence of glass forming ability on liquid fragility: Thermodynamics versus kinetics. Appl Phys Lett, 2010, 97(18): 181901 doi: 10.1063/1.3506900 [33] Mastropietro D G, Moya J A. Design of Fe-based bulk metallic glasses for maximum amorphous diameter (Dmax) using machine learning models. Comput Mater Sci, 2021, 188: 110230 doi: 10.1016/j.commatsci.2020.110230 [34] Yao Y, Sullivan T IV, Yan F, et al. Balancing data for generalizable machine learning to predict glass-forming ability of ternary alloys. Scr Mater, 2022, 209: 114366 doi: 10.1016/j.scriptamat.2021.114366 [35] Butler K T, Davies D W, Cartwright H, et al. Machine learning for molecular and materials science. Nature, 2018, 559(7715): 547 doi: 10.1038/s41586-018-0337-2 [36] Wu W, Sun Q. Applying machine learning to accelerate new materials development. Sci Sin-Phys Mech Astron, 2018, 48(10): 107001 [37] Park S, Fonseca J H, Marimuthu K P, et al. Determination of material properties of bulk metallic glass using nanoindentation and artificial neural network. Intermetallics, 2022, 144: 107492 doi: 10.1016/j.intermet.2022.107492 [38] Han G, Marimuthu K P, Lee H. Evaluation of thin film material properties using a deep nanoindentation and ANN. Mater Des, 2022, 221: 111000 doi: 10.1016/j.matdes.2022.111000 [39] Reddy G J, Kandavalli M, Saboo T, et al. Prediction of glass forming ability of bulk metallic glasses using machine learning. Integr Mater Manuf Innov, 2021, 10(4): 610 doi: 10.1007/s40192-021-00239-y [40] Sun Y T, Bai H Y, Li M Z, et al. Machine learning approach for prediction and understanding of glass-forming ability. J Phys Chem Lett, 2017, 8(14): 3434 doi: 10.1021/acs.jpclett.7b01046 [41] Li Z, Long Z L, Lei S, et al. Predicting the glass formation of metallic glasses using machine learning approaches. Comput Mater Sci, 2021, 197: 110656 doi: 10.1016/j.commatsci.2021.110656 [42] Zhang Y X, Xing G C, Sha Z D, et al. A two-step fused machine learning approach for the prediction of glass-forming ability of metallic glasses. J Alloys Compd, 2021, 875: 160040 doi: 10.1016/j.jallcom.2021.160040 [43] Fan Z, Ma E, Falk M L. Predicting the location of shear band initiation in a metallic glass. Phys Rev Mater, 2022, 6(6): 065602 doi: 10.1103/PhysRevMaterials.6.065602 [44] Ward L, Agrawal A, Choudhary A, et al. A general-purpose machine learning framework for predicting properties of inorganic materials. Npj Comput Mater, 2016, 2: 16028 doi: 10.1038/npjcompumats.2016.28 [45] Ward L, O'Keeffe S C, Stevick J, et al. A machine learning approach for engineering bulk metallic glass alloys. Acta Mater, 2018, 159: 102 doi: 10.1016/j.actamat.2018.08.002 [46] Schultz L E, Afflerbach B, Francis C, et al. Exploration of characteristic temperature contributions to metallic glass forming ability. Comput Mater Sci, 2021, 196: 110494 doi: 10.1016/j.commatsci.2021.110494 [47] Deng B H, Zhang Y L. Critical feature space for predicting the glass forming ability of metallic alloys revealed by machine learning. Chem Phys, 2020, 538: 110898 doi: 10.1016/j.chemphys.2020.110898 [48] Li X, Shan G C, Shek C H. Machine learning prediction of magnetic properties of Fe-based metallic glasses considering glass forming ability. J Mater Sci Technol, 2022, 103: 113 doi: 10.1016/j.jmst.2021.05.076 [49] Xiong J, Shi S Q, Zhang T Y. Machine learning prediction of glass-forming ability in bulk metallic glasses. Comput Mater Sci, 2021, 192: 110362 doi: 10.1016/j.commatsci.2021.110362 [50] Lu Z C, Chen X, Liu X J, et al. Interpretable machine-learning strategy for soft-magnetic property and thermal stability in Fe-based metallic glasses. Npj Comput Mater, 2020, 6: 187 doi: 10.1038/s41524-020-00460-x [51] McCulloch W S, Pitts W. A logical calculus of the ideas immanent in nervous activity. Bull Math Biophys, 1943, 5: 115 doi: 10.1007/BF02478259 [52] Lu F, Liang Y C, Wang X Y, et al. Prediction of amorphous forming ability based on artificial neural network and convolutional neural network. Comput Mater Sci, 2022, 210: 111464 doi: 10.1016/j.commatsci.2022.111464 [53] Yu L P, Guo X X, Wang G, et al. Extracting governing system for the plastic deformation of metallic glasses using machine learning. Sci China Phys Mech Astron, 2022, 65(6): 264611 doi: 10.1007/s11433-021-1840-9 [54] Zhang T, Long Z L, Peng L, et al. Prediction of glass forming ability of bulk metallic glasses based on convolutional neural network. J Non Cryst Solids, 2022, 595: 121846 doi: 10.1016/j.jnoncrysol.2022.121846 [55] Zeng S M, Zhao Y C, Li G, et al. Atom table convolutional neural networks for an accurate prediction of compounds properties. Npj Comput Mater, 2019, 5: 84 doi: 10.1038/s41524-019-0223-y [56] Dan Y B, Zhao Y, Li X, et al. Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials. Npj Comput Mater, 2020, 6: 84 doi: 10.1038/s41524-020-00352-0 [57] Cortes C, Vapnik V. Support-vector networks. Mach Learn, 1995, 20(3): 273 [58] Gunn S R. Support vector machines for classification and regression. ISIS Tech Rep, 1998, 14(1): 5 [59] Wu Y C, Wang W H, Guan P F, et al. Identifying packing features of atoms with distinct dynamic behaviors in metallic glass by machine-learning method. Sci China Mater, 2021, 64(7): 1820 doi: 10.1007/s40843-020-1626-3 [60] Cubuk E D, Schoenholz S S, Kaxiras E, et al. Structural properties of defects in glassy liquids. J Phys Chem B, 2016, 120: 6139 doi: 10.1021/acs.jpcb.6b02144 [61] Sussman D M, Schoenholz S S, Cubuk E D, et al. Disconnecting structure and dynamics in glassy thin films. Proc Natl Acad Sci USA, 2017, 114(40): 10601 doi: 10.1073/pnas.1703927114 -