Anomaly / outlier detection using isolation forest in Scala

Introduction

This article is describing the journey of implementing the unsupervised learning algorithm for anomaly / outlier detection from scratch.

Why?

  • I could not find a Scala implementation library and implementing the complete algorithm from scratch gave me a good understanding of the algorithm including it’s good and bad parts …
  • I wrapped the implementation in a small project that can read actual data from a topic …