SO end of year surveys

Recently Stack Overflow published  few posts comparing the usage of Stack Overflow between different segments \ scenarios: How Do Students Use Stack Overflow? What Programming Languages Are Used Most on Weekends? Women in the 2016 Stack Overflow Survey Few comments regarding those posts – How Do Students Use Stack Overflow? “R and MATLAB are pretty […]

Read more "SO end of year surveys"

Davies-Bouldin Index

TL;DR – Yet another clustering evaluation metric Davies-Bouldin index was suggested by David L. Davies and Donald W. Bouldin in “A Cluster Separation Measure” (IEEE Transactions on Pattern Analysis and Machine Intelligence. PAMI-1 (2): 224–227. doi:10.1109/TPAMI.1979.4766909, full pdf) Just like Silhouette score, Calinski-Harabasz index and Dunn index, Davies-Bouldin index provide an internal evaluation schema. I.e. the […]

Read more "Davies-Bouldin Index"

5 interesting things (28/08/2016)

Joel test for Data Science – the inspiration and adjustment to data science done in Domino, both were interesting reads for me. https://blog.dominodatalab.com/joel-test-data-science/ What I Wish I Knew About Data For Startups – I speciialy related to documenting and testing events tracking. I was surpsirsed about how little this topic and its best practices are discussed. http://www.jeannicholashould.com/what-I-wish-I-knew-about-data-for-startups.html […]

Read more "5 interesting things (28/08/2016)"

5 Python NLP pacakges

NLP is a broad term which contains many types of question and challenges such as – language detection, Part-of-Speech tagging, relation extraction, named entity recognition, OCR, speech recognition, sentiment extraction and many more. There are of course, several Python libraries which try to tackle some of those problems. This post aims to provide a short […]

Read more "5 Python NLP pacakges"