## Data Science Interview Question #2

** Categories: **
Data Science, Interview Questions, Mathematics,

The second entry in the DS interview question series comes from ryxcommar. Let's put pen to paper (i.e. keyboard to LaTeX) and solve it!

The second entry in the DS interview question series comes from ryxcommar. Let's put pen to paper (i.e. keyboard to LaTeX) and solve it!

This is a solution to an interview question posed by Quantian on Twitter. It is the first in a series of interview questions I plan to post.

** Tags: **
lda, probability, topic modeling

** Categories: **
Data Science, Mathematics,

The cosine similarity is a useful distance measure for comparing NLP document vectors, but should not be used with probability distributions.

** Categories: **
Coding, Shell Scripting,

I demonstrate how to use gztool and SQLite to provide near random access to large gzip files, and enable querying by fields of interest.

** Categories: **
Data Science,

I discovered that some demo versions of pirated slot games produce artificially high RTPs to inspire overconfidence within potential customers.