Data Science Interview Question #2

Categories: Data Science, Interview Questions, Mathematics,
The second entry in the DS interview question series comes from ryxcommar. Let's put pen to paper (i.e. keyboard to LaTeX) and solve it!
Categories: Data Science, Interview Questions, Mathematics,
The second entry in the DS interview question series comes from ryxcommar. Let's put pen to paper (i.e. keyboard to LaTeX) and solve it!
Categories: Data Science, Interview Questions,
This is a solution to an interview question posed by Quantian on Twitter. It is the first in a series of interview questions I plan to post.
Tags: lda, probability, topic modeling
Categories: Data Science, Mathematics,
The cosine similarity is a useful distance measure for comparing NLP document vectors, but should not be used with probability distributions.
Categories: Coding, Shell Scripting,
I demonstrate how to use gztool and SQLite to provide near random access to large gzip files, and enable querying by fields of interest.
Categories: Data Science,
I discovered that some demo versions of pirated slot games produce artificially high RTPs to inspire overconfidence within potential customers.