TY - JOUR
T1 - Neural network-based correlation and statistical identification of data outliers in H2S-alkanolamine-H2O and CO2-alkanolamine-H2O datasets
AU - Imai, Bruno
AU - Nasir, Qazi
AU - Maulud, Abdulhalim Shah
AU - Nawaz, Muhammad
AU - Nasir, Rizwan
AU - Suleman, Humbul
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.
PY - 2022/10/10
Y1 - 2022/10/10
N2 - Throughout the published literature for phase equilibrium data of CO
2-alkanolamine-H
2O and H
2S-alkanolamine-H
2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation studies developed using such experimental datasets. Hence, it is important that the data outliers are identified and later corrected or removed before developing a model or simulation. This study proposes a modified approach to identifying data outliers present in the phase equilibrium data of CO
2-alkanolamine-H
2O and H
2S-alkanolamine-H
2O systems using an artificial neural network and data outlier identification methods. Firstly, the suggested approach correlates the experimental phase equilibrium data (2152 data points) of CO
2 and H
2S-loaded monoethanolamine, diethanolamine, and N-methyldiethanolamine solutions by developing an artificial neural network. Following this, the data outliers are identified by applying a modified IQR method and compared graphically to 2.5 standard deviation method. The identified data outliers can then be truncated or winsorised for developing reliable and accurate models/simulations. The modified IQR method coupled with a neural network (based on the normalised data values) can robustly identify data outliers within a large experimental dataset. The proposed approach is superior to the previous data outlier identification techniques that used 2.5 standard deviations method, as it alleviates the need for a human decision in determining the congruence of experimental values. The results also indicate that the developed method can be reliably extended to other/larger non-linear experimental datasets having similar correlative complexity.
AB - Throughout the published literature for phase equilibrium data of CO
2-alkanolamine-H
2O and H
2S-alkanolamine-H
2O systems, it is common to find some discrepant data, called data outliers. The presence of these erroneous values induces inaccuracies and prediction errors in the models and simulation studies developed using such experimental datasets. Hence, it is important that the data outliers are identified and later corrected or removed before developing a model or simulation. This study proposes a modified approach to identifying data outliers present in the phase equilibrium data of CO
2-alkanolamine-H
2O and H
2S-alkanolamine-H
2O systems using an artificial neural network and data outlier identification methods. Firstly, the suggested approach correlates the experimental phase equilibrium data (2152 data points) of CO
2 and H
2S-loaded monoethanolamine, diethanolamine, and N-methyldiethanolamine solutions by developing an artificial neural network. Following this, the data outliers are identified by applying a modified IQR method and compared graphically to 2.5 standard deviation method. The identified data outliers can then be truncated or winsorised for developing reliable and accurate models/simulations. The modified IQR method coupled with a neural network (based on the normalised data values) can robustly identify data outliers within a large experimental dataset. The proposed approach is superior to the previous data outlier identification techniques that used 2.5 standard deviations method, as it alleviates the need for a human decision in determining the congruence of experimental values. The results also indicate that the developed method can be reliably extended to other/larger non-linear experimental datasets having similar correlative complexity.
UR - http://www.scopus.com/inward/record.url?scp=85139545410&partnerID=8YFLogxK
U2 - 10.1007/s00521-022-07904-z
DO - 10.1007/s00521-022-07904-z
M3 - Article
SN - 0941-0643
VL - 35
SP - 3395
EP - 3412
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 4
ER -