Gerz, Fabian; Al-Shrouf, Loui; Jelali, Mohieddine:
A Comparative Analysis of Concept Drift Detection Methods with a Systematic and Innovative Approach of Method Selection
In: Structural Health Monitoring 2023 : Designing SHM for Sustainability, Maintainability, and Reliability / Farhangdoust, Saman; Guemes, Alfredo; Chang, Fu-Kuo (Eds.). - 14th International Workshop on Structural Health Monitoring (IWSHM); Stanford University, CA; September 12-14, 2023 - Lancaster, Pennsylvania: DEStech Publications, Inc., 2023, pp. 1571 - 1578
2023Essay (Conference) in Conference proceedingsOA Gold
Faculty of Process Engineering, Energy and Mechanical Systems » Institute of Product Development and Engineering Design
Title:
A Comparative Analysis of Concept Drift Detection Methods with a Systematic and Innovative Approach of Method Selection
Author:
Gerz, FabianTH Köln
DHSB-ID
THK0022839
ORCID
0009-0006-0163-2132ORCID iD
SCOPUS
57943914200
Other
person connected with TH Köln
corresponding author
;
Al-Shrouf, LouiTH Köln
DHSB-ID
THK0002713
SCOPUS
57943674700
Other
person connected with TH Köln
;
Jelali, MohieddineTH Köln
DHSB-ID
THK0001866
ORCID
0000-0002-0347-9913ORCID iD
SCOPUS
6603471393
Other
person connected with TH Köln
Year of publication:
2023
„Publication Channel“:
OA Gold
Language of text:
English
Type of resource:
Text
Practice Partner:
No
Category:
Research
Part of statistic:
Part of statistic

Abstract in English:

One of the mostsignificant challengesin data-driven modeling of complex systems is dealing with concept drift, i.e., the unpredictable changes in the underlying data distribution over time. In this work, nine concept drift detection (CDD) methods are evaluated with respect to different types of concept drift, including abrupt, gradual, incremental, and real concept drift, for both supervised and unsupervised application scenarios. For the supervised case, the methods EDDM, FHDDMSadd, MDDM-E, and EFDT are compared against a classification without change detection using Naïve Bayes as the base classifier. In the unsupervised application scenario, CluStream, ClusTree, DenStream, StreamKM++, and D-Stream are evaluated. The experiments are conducted using the Massive Online Analysis (MOA) evaluation platform, and the performance of each method is measured in terms of classification accuracy, memory consumption, and computation time. This empirical research showsthat classification accuracy can be improved by 20% by implementing a CDD method, highlighting the importance of CDD in SHM data streams. However, there is no single method that proves to be superior in all scenarios, and the choice depends on the characteristics of the considered data stream and application requirements. Selecting the appropriate CDD method from the approximately 340 different methods found in the literature is not a trivial task and can lead to suboptimal selection. To tackle this issue, an innovative approach is proposed to assist researchers and practitioners find the appropriate CCD method for their application.