Multivariate statistical process control using dynamic ensemble methods

Thumbnail Image

Date

2015

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

One important challenge with some applications such as credit card fraud detection, intrusion detection and network traffic monitoring is that data arrive in streams over time and leads to changes in concepts which are known in data mining as concept drift. Thus, models analyzing such data become obsolete and efficient learning should be able to identify these changes and quickly update the system to them. The objective of this dissertation is to investigate the effectiveness of ensemble methods and Statistical Process Control (SPC) techniques in detecting changes in processes in order to improve the robustness of tracking concept drift and coping with the dynamics of online data stream processes. For reaching this objective, different heuristics were proposed. First, an improved dynamic weighted majority Winnow algorithm based on ensemble methods is proposed. Furthermore, parameters optimization based on genetic algorithm of the proposed method as well as an analysis of its robustness are investigated. Second, in order to handle the problem of concept drift while monitoring nonstationary environment using SPC tools, a time adjusting control chart based on a recursive adaptive formulas of the charting statistics is proposed. Results show that the updating charts cope much better with the nonstationarity of the environment. Also, two new heuristics are proposed based on both ensemble methods and adaptive control charts. The first is an offline learning chart model while the second is an online batch learning algorithm. Results show that quick adaptation of the system and accurate shift point identification are achieved when using both heuristics together. Also, the new adaptive ensemble charts have better performance in learning concept drifts along with a good suitability to nonlinearity and noise issues.

Description

Table of contents

Keywords

Ensemble methods, Statistical process control

Citation