TY - JOUR
T1 - Reconciliation of outliers in CO2-alkanolamine-H2O datasets by robust neural network winsorization
AU - Suleman, Humbul
AU - Maulud, Abdulhalim Shah
AU - Man, Zakaria
PY - 2016/2/3
Y1 - 2016/2/3
N2 - It is normal to find at least a few measured values in CO2-alkanolamine-H2O datasets that deviate greatly from the majority of published data, as the data come from different sources. These values, termed as data outliers, are the major source of conflict in modeling, simulation and process development studies. Therefore, removal of data outliers is mandatory. However, available statistical techniques are known to lose information at the boundaries of the system and exhibit substantial deviation from holistic data trend. Hence, an adaptive approach combining artificial neural networks and robust winsorization is presented for identification and reconciliation of data outliers in CO2-alkanolamine-H2O system. The proposed approach flexibly transforms to the nonlinear data distribution and predicts corrected values for data outliers (winsorized values), thus maintaining the information at extremes of the system. The results have been graphically analyzed and show good conformance in treated data, with retention of winsorized values. The proposed method improves the shortcomings of previous statistical approaches and can be potentially extended to other nonlinear experimental datasets in chemical process systems.
AB - It is normal to find at least a few measured values in CO2-alkanolamine-H2O datasets that deviate greatly from the majority of published data, as the data come from different sources. These values, termed as data outliers, are the major source of conflict in modeling, simulation and process development studies. Therefore, removal of data outliers is mandatory. However, available statistical techniques are known to lose information at the boundaries of the system and exhibit substantial deviation from holistic data trend. Hence, an adaptive approach combining artificial neural networks and robust winsorization is presented for identification and reconciliation of data outliers in CO2-alkanolamine-H2O system. The proposed approach flexibly transforms to the nonlinear data distribution and predicts corrected values for data outliers (winsorized values), thus maintaining the information at extremes of the system. The results have been graphically analyzed and show good conformance in treated data, with retention of winsorized values. The proposed method improves the shortcomings of previous statistical approaches and can be potentially extended to other nonlinear experimental datasets in chemical process systems.
UR - http://dx.doi.org/10.1007/s00521-016-2213-z
U2 - 10.1007/s00521-016-2213-z
DO - 10.1007/s00521-016-2213-z
M3 - Article
SN - 0941-0643
VL - 28
SP - 2621
EP - 2632
JO - Neural Computing and Applications
JF - Neural Computing and Applications
IS - 9
ER -