Novel Meta-Features for Automated Machine Learning Model Selection in Anomaly Detection

This repository contains the scripts which evaluate a novel set of meta-features for model selection in anomaly detection tasks based on domain-specific properties.

By using different kinds of metadata, such as the properties of the data, algorithm properties, or correlation previously derived from the data, it is possible to select different models to effectively solve a given anomaly detection task. The meta-learning approach based on a set of meta-features that describes data properties and correlation can enable efficient model selection in AutoML frameworks.

Experiments with 63 datasets from different repositories with varying schemas show that the proposed set of meta-features achieves the accuracy of 87% for model selection, while the achieved accuracy for simple meta-features is 74%, for statistical meta-features 68%, for information theory metafeature 70%, and for a comprehensive set of meta-features by pyMFE 73%.

Results are in /results directory. Evaluated algorithms are in /algorithms directory.

Datasets

Datasets are collected from repositories in Harvard Dataverse and Numenta Anomaly Benchmark and cover a broad range of domains including manufacturing, transportation, healthcare, intrusion detection, and system log analysis.

Dataset repository is available here and here.

Prerequests

Python 3.7
Pip
Numpy

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.ipynb_checkpoints		.ipynb_checkpoints
old		old
pycharm		pycharm
README.md		README.md
decision_support.ipynb		decision_support.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Novel Meta-Features for Automated Machine Learning Model Selection in Anomaly Detection

Datasets

Prerequests

About

Releases

Packages

Languages

kotlarmilos/meta-features-anomaly-detection

Folders and files

Latest commit

History

Repository files navigation

Novel Meta-Features for Automated Machine Learning Model Selection in Anomaly Detection

Datasets

Prerequests

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages