Anomaly Detection using Unsupervised methods

Univariate Outlier detection
Boxplot

Histogram

Distribution based approach

Multimodal distribution

Multivariate Outlier detection

Histogram based outlier score (HBOS)

Neighborhood methods
KNN

Local Outlier Factor (LOF)

Connectivity Outlier Factor (COF)



One-class classification
One-class SVM

Clustering

DBSCAN

Approaches for High-Dimensional Data
In higher dimensions the similarity between two similar people is decreased and increased for irrelevant people - Curse of dimensionality
In high dimensions, distance metrics such as Eculidean distance and neighborhood concept does not make sense
Solutions for Anomaly detection in High-dimensional data
- Dimensions Reduction Techniques
- PCA
- Matrix / Tensor Factorization
- Autoencoder
- Angle-based outlier detection
- Ensemble Approaches
- Isolation Forest
- Feature Bagging
PCA

Matrix Factorization

Tensor Factorization



Autoencoder


Angle based Outlier detection


Ensemble
Isolation Forest


Feature Bagging

Comparison of Various approaches


Factors to consider for Anomaly detection
