Anomaly detection with SECODA for the R environment. SECODA is a general-purpose unsupervised non-parametric anomaly detection algorithm for datasets containing numerical and/or categorical attributes.
SECODA version 0.5.4 - 20200228
This project includes several types of files:
- SECODA implementations for R (see below)
- Example datasets and code (run the code in "Examples.R")
- Publications, Help/Description files, version information and links to other resources
This project contains several implementations of SECODA:
- The regular SECODA as a code file that can be executed by R.
- The regular SECODA as an R package that can be installed in the R environment.
- The SECODA base R version as a code file, which runs more slowly than the regular implementation (but does not need the data.table package). The results are identical to the regular version. Note, however, that this base R version does not offer all options.
This software should be regarded as a beta version, and as such will undoubtedly still contain bugs.
Help improve this algorithm and software by sending comments to Ralph Foorthuis ([email protected]). If you report a bug in the code, please also provide a small dataset and the statements to reproduce the error.
I would also appreciate it if you send me comments about how SECODA did (or did not) help you in your analysis.
The software in this container is licensed under a GNU AFFERO GENERAL PUBLIC LICENSE VERSION 3.
See the Help file, documents or "Examples.R" for information on using the SECODA() algorithm in R.
See the following references for background information on SECODA and the typology of anomalies:
- Foorthuis, R.M. (2017). SECODA: Segmentation- and Combination-Based Detection of Anomalies. In: Proceedings of the 4th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2017), Tokyo, Japan.
- Foorthuis R.M. (2019). All or In-cloud: How the Identification of Six Types of Anomalies is Affected by the Discretization Method. In: Atzmueller M., Duivesteijn W. (eds). Artificial Intelligence. BNAIC 2018. Springer, Communications in Computer and Information Science, Vol. 1021, pp 25-42. DOI: 10.1007/978-3-030-31978-6_3
- Foorthuis R.M. (2021). On the Nature and Types of Anomalies: A Review of Deviations in Data. International Journal of Data Science and Analytics, DOI: 10.1007/s41060-021-00265-1
- Foorthuis, R.M. (2018). A Typology of Data Anomalies. In: Proceedings of IPMU 2018, the 17th International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems, Cádiz, Spain.
- Foorthuis, R.M. (2017). Anomaly Detection with SECODA. Poster Presentation at the 4th IEEE International Conference on Data Science and Advanced Analytics (DSAA 2017), Tokyo, Japan.
- Foorthuis, R.M. (2017). The SECODA Algorithm for the Detection of Anomalies in Sets with Mixed Data. Presentation on www.foorthuis.nl