TY - JOUR
T1 - Classification of structured validation data using stateless and stateful features
AU - Schwenk, G.
AU - Pabst, R.
AU - Müller, K. R.
N1 - Funding Information:
The authors gratefully acknowledge the funding from P3 communications GmbH and partial support by the Federal Ministry for Education and Research (BMBF), Germany , the Deutsche Forschungsgesellschaft (DFG) and the Brain Korea 21 Plus Program, South Korea . This work was supported by Institute for Information & Communications Technology Promotion, South Korea and funded by the Korea government (MSIT) (No. 2017-0-00451 , No. 2017-0-01779 ).
Publisher Copyright:
© 2019 The Authors
PY - 2019/4/15
Y1 - 2019/4/15
N2 - To reliably identify problems impacting the service quality and system dependability of mobile communication networks, the monitored data needs to be validated. This paper proposes and evaluates analysis methods, features and learning methods for the automatic validation of such data, with a special focus on failure data of mobile communication data. This data can be analyzed for discriminating failures caused by problems in the infrastructure (valid failures) from those caused by other circumstances like device imperfections (invalid failures), with the purpose of filtering the invalid failures, which effectively increases both dependability and value of the underlying data. To represent the complex structural and temporal properties of the mobile communication data, two complementary feature representations are proposed and compared, followed by a discussion of classification methods which are suitable for these feature spaces and for an interpretation of their results to support manual auditing. Their classification performances on these feature spaces are evaluated and compared to competitive approaches. In the evaluation a classification performances of up to 97% AUC–ROC is achieved. This renders our approach a good alternative to using manual matching rules, which require costly expert-knowledge and are much more time-consuming to define and maintain — while also highlighting the relevance of combining feature spaces of different problem perspectives. Additionally it is shown that using non-proprietary data analysis can enable feature representations nearly as expressive as those created by using proprietary analysis methods, which allows a broader application of the proposed methods, due to the lower processing requirements.
AB - To reliably identify problems impacting the service quality and system dependability of mobile communication networks, the monitored data needs to be validated. This paper proposes and evaluates analysis methods, features and learning methods for the automatic validation of such data, with a special focus on failure data of mobile communication data. This data can be analyzed for discriminating failures caused by problems in the infrastructure (valid failures) from those caused by other circumstances like device imperfections (invalid failures), with the purpose of filtering the invalid failures, which effectively increases both dependability and value of the underlying data. To represent the complex structural and temporal properties of the mobile communication data, two complementary feature representations are proposed and compared, followed by a discussion of classification methods which are suitable for these feature spaces and for an interpretation of their results to support manual auditing. Their classification performances on these feature spaces are evaluated and compared to competitive approaches. In the evaluation a classification performances of up to 97% AUC–ROC is achieved. This renders our approach a good alternative to using manual matching rules, which require costly expert-knowledge and are much more time-consuming to define and maintain — while also highlighting the relevance of combining feature spaces of different problem perspectives. Additionally it is shown that using non-proprietary data analysis can enable feature representations nearly as expressive as those created by using proprietary analysis methods, which allows a broader application of the proposed methods, due to the lower processing requirements.
KW - Feature modeling
KW - Interpretable learning
KW - Mobile communication
KW - Quality of service
KW - Stateless and stateful features
KW - Structured data
KW - Supervised learning
UR - http://www.scopus.com/inward/record.url?scp=85062691719&partnerID=8YFLogxK
U2 - 10.1016/j.comcom.2019.02.007
DO - 10.1016/j.comcom.2019.02.007
M3 - Article
AN - SCOPUS:85062691719
SN - 0140-3664
VL - 138
SP - 54
EP - 66
JO - Computer Communications
JF - Computer Communications
ER -