KNN approximation Apache Spark

K-nearest-neighbors is a very well known classification algorithm. It is based on the phrase – “show me who your friends are and I’ll tell you who you are”. Apache Spark MLLib contains several algorithms including linear regression, k-means, etc . But it does not currently include an implementation to KNN. One of the reasons for […]

Read more "KNN approximation Apache Spark"

5 interesting things (15/11/2015)

Counting things in Python¬†– This post spotlights very nicely and simple how Python and Pythonic writing changed over the years. One of the interesting things in this post is the analogy to natural languages. Natural languages also changes and evolve over time – slang, new phrases, out-dates expressions etc. and apparently so is programming languages. […]

Read more "5 interesting things (15/11/2015)"