Machine Learning model for the prediction of self-diffusion coefficients in liquids, compressed gases and supercritical fluids

abstract

Machine Learning (ML) models were developed in this work to predict self-diffusion coefficients (D11) of dense fluids using four training algorithms: Gradient Boosting, k-Nearest Neighbors, Decision Tree, and Random Forest. A database of 7931 experimental points from 223 substances, at different pressures and temperatures, was used to train the models. From an initial set of 34 input features (variables/properties), the eight most important ones, ranked by decreasing relevance, were: density, acentric factor, temperature, critical temperature, critical volume, number of NH and/or OH bonds, pressure, and number of rotatable bonds. The best performance was achieved by models using the first 5 and 8 input features (ML5-D11 and ML8-D11) using the Gradient Boosting algorithm, for which the average absolute relative deviations (AARDglobal) were 9.06 % and 7.14 %, for the test set. The performance of the ML5-D11 and ML8-D11 models was compared with four phenomenological models - the predictive Zhu et al. equation, the 2-parameters Dymond-Hildebrand-Batschinski correlation, and the 1-parameter and 4-parameters Lennard-Jones correlations (LJ1 and LJ4) - which showed AARDglobal of 104.05 %, 82.94 %, 16.92 % and 7.97 %, respectively, for the same test set. Despite the good results of LJ4 equation, it is worth noting it embodies 4 substance-specific parameters that must be fitted to experimental data in advance. The new ML5-D11 and ML8-D11 models are purely predictive and can be applied to polar/nonpolar, spherical/ non-spherical, and even hydrogen-bonding molecules in liquids, compressed gases or supercritical fluids. The ML5-D11 and ML8-D11 are provided for use as a Python program.

keywords

PERTURBED-CHAIN SAFT; LENNARD-JONES; HARD-SPHERE; DENSE FLUIDS; REAL FLUIDS; VISCOSITY; EQUATION; SYSTEMS; STATE

subject category

Engineering

authors

Dias, AFF; Portugal, I; Aniceto, JPS; Silva, CM

our authors

acknowledgements

This work was developed within the scope of the project CICECO- Aveiro Institute of Materials, UIDB/50011/2020 (DOI 10.54499/UIDB/50011/2020) , UIDP/50011/2020 (DOI 10.54499/UIDP/50011/2020) & LA/P/0006/2020 (DOI 10.54499/LA/P/0006/2020) , financed by national funds through the FCT/MCTES (PIDDAC) . J.P.S.A. thanks FCT (Fundacao para a Ciencia e a Tecnologia) for funding under the Scientific Employment Stimulus-CEEC Individual 2020 (DOI 10.54499/2020.02534.CEECIND/CP1589/CT0014) .

Share this project:

Related Publications

We use cookies for marketing activities and to offer you a better experience. By clicking “Accept Cookies” you agree with our cookie policy. Read about how we use cookies by clicking "Privacy and Cookie Policy".