Gerz, Fabian; Al-Shrouf, Loui; Jelali, Mohieddine:
A Comparative Analysis of Concept Drift Detection Methods with a Systematic and Innovative Approach of Method Selection
In: Structural Health Monitoring 2023 : Designing SHM for Sustainability, Maintainability, and Reliability / Farhangdoust, Saman; Guemes, Alfredo; Chang, Fu-Kuo (Hrsg.). - 14th International Workshop on Structural Health Monitoring (IWSHM); Stanford University, CA; September 12-14, 2023 - Lancaster, Pennsylvania: DEStech Publications, Inc., 2023, S. 1571 - 1578
2023Aufsatz (Konferenz) in TagungsbandOA Gold
Fakultät für Anlagen, Energie- und Maschinensysteme » Institut für Produktentwicklung und Konstruktionstechnik
Titel:
A Comparative Analysis of Concept Drift Detection Methods with a Systematic and Innovative Approach of Method Selection
Autor*in:
Gerz, FabianTH Köln
DHSB-ID
THK0022839
ORCID
0009-0006-0163-2132ORCID iD
SCOPUS
57943914200
Sonstiges
der TH Köln zugeordnete Person
korrespondierende*r Autor*in
;
Al-Shrouf, LouiTH Köln
DHSB-ID
THK0002713
SCOPUS
57943674700
Sonstiges
der TH Köln zugeordnete Person
;
Jelali, MohieddineTH Köln
DHSB-ID
THK0001866
ORCID
0000-0002-0347-9913ORCID iD
SCOPUS
6603471393
Sonstiges
der TH Köln zugeordnete Person
Erscheinungsjahr:
2023
OA-Publikationsweg:
OA Gold
Sprache des Textes:
Englisch
Ressourcentyp:
Text
Praxispartner*in:
Nein
Kategorie:
Forschung
Teil der Statistik:
Teil der Statistik

Abstract in Englisch:

One of the mostsignificant challengesin data-driven modeling of complex systems is dealing with concept drift, i.e., the unpredictable changes in the underlying data distribution over time. In this work, nine concept drift detection (CDD) methods are evaluated with respect to different types of concept drift, including abrupt, gradual, incremental, and real concept drift, for both supervised and unsupervised application scenarios. For the supervised case, the methods EDDM, FHDDMSadd, MDDM-E, and EFDT are compared against a classification without change detection using Naïve Bayes as the base classifier. In the unsupervised application scenario, CluStream, ClusTree, DenStream, StreamKM++, and D-Stream are evaluated. The experiments are conducted using the Massive Online Analysis (MOA) evaluation platform, and the performance of each method is measured in terms of classification accuracy, memory consumption, and computation time. This empirical research showsthat classification accuracy can be improved by 20% by implementing a CDD method, highlighting the importance of CDD in SHM data streams. However, there is no single method that proves to be superior in all scenarios, and the choice depends on the characteristics of the considered data stream and application requirements. Selecting the appropriate CDD method from the approximately 340 different methods found in the literature is not a trivial task and can lead to suboptimal selection. To tackle this issue, an innovative approach is proposed to assist researchers and practitioners find the appropriate CCD method for their application.