Anomaly Detection using Unsupervised methods
Univariate Outlier detection
Boxplot
Histogram
Distribution based approach
Multimodal distribution
Multivariate Outlier detection
Histogram based outlier score (HBOS)
Neighborhood methods
KNN
Local Outlier Factor (LOF)
Connectivity Outlier Factor (COF)
One-class classification
One-class SVM
Clustering
DBSCAN
Approaches for High-Dimensional Data
In higher dimensions the similarity between two similar people is decreased and increased for irrelevant people - Curse of dimensionality
In high dimensions, distance metrics such as Eculidean distance and neighborhood concept does not make sense
Solutions for Anomaly detection in High-dimensional data
- Dimensions Reduction Techniques
- PCA
- Matrix / Tensor Factorization
- Autoencoder
- Angle-based outlier detection
- Ensemble Approaches
- Isolation Forest
- Feature Bagging
PCA
Matrix Factorization
Tensor Factorization
Autoencoder
Angle based Outlier detection
Ensemble
Isolation Forest
Feature Bagging
Comparison of Various approaches