Eswar, Srinivas, Ramakrishnan Kannan, Richard Vuduc, and Haesun Park. “ORCA: Outlier detection and Robust Clustering for Attributed graphs” Journal of Global Optimization (2021), 3:1-23.
doi:10.1007/s10898-021-01024-z [→ PDF] [→ code]
A framework is proposed to simultaneously cluster objects and detect anomalies in attributed graph data. Our objective function along with the carefully constructed constraints promotes interpretability of both the clustering and anomaly detection components, as well as scalability of our method. In addition, we developed an algorithm called Outlier detection and Robust Clustering for Attributed graphs (ORCA) within this framework. ORCA is fast and convergent under mild conditions, produces high quality clustering results, and discovers anomalies that can be mapped back naturally to the features of the input data. The efficacy and efficiency of ORCA is demonstrated on real world datasets against multiple state-of-the-art techniques.