Originally a software engineer:
Converted to a data scientist:
Over the last year:
* who speak Python
A topic model is a type of statistical model for discovering the abstract "topics" that occur in a collection of documents.
GIGO:
Not really a topic model, but still useful
TF-IDF is the product of:
Intuition: give high weight to words that are topic-specific
LSI/A: latent semantic indexing/analysis
Singular value decomposition of the frequency/tf-idf matrix, followed by dimensionality reduction
Intuition: denoise to extract latent factors/topics
Intuition: becomes clear when you get into Bayesian stats
Word2Vec: representing words as vectors
Not a classic topic model, inspired by deep learning
Claim to fame: