Faster Word Co-Occurrence Calculation In Large Document CorpusFor one of my research papers I had to calculate co-occurrence information between pairs of words for a large number of topics in order to…Dec 26, 2021Dec 26, 2021
A Simple Intuition Behind The Normal Distribution EquationEver since I learned about the Normal Distribution Equation in school it always looked menacing to me. I could not memorize it and I could…Nov 18, 2021Nov 18, 2021
A New Way to Think About the “Bias vs. Variance” tradeoffIn ML, the tradeoff between overfitting and underfitting is often depicted using a U-shape. Traditionally, we want our model to learn…Nov 14, 2021Nov 14, 2021
A Simple Trick to Understand the t-testThe t-test is a statistical test to compare the means of two groups. It is one of the most common techniques to check if two groups come…Nov 4, 20211Nov 4, 20211
Does my sample have to be normally distributed for a t-test?While reading up on t-test’s normality assumption I came across a lot of conflicting information. Most of the resources online suggest that…Oct 26, 20213Oct 26, 20213
Top-5 Essential Python Libraries for Data AnalysisPython is one of the best choices when it comes to analyzing data. In fact, Python is THE language when it comes to programming in general…Oct 16, 2021Oct 16, 2021
Generating Fake Trump Tweets with LSTMThis is a part 2 of my series of articles on generating fake Trump tweets. You can read part 1 here where I used Markov Chains to fake…Apr 16, 2021Apr 16, 2021
Generating fake Trump tweets using Markov ChainsDonald Trump, a former US president, is one of the most controversial presidents in the modern US history. His presidency was remembered…Apr 10, 2021Apr 10, 2021
An Analysis of Red Hot Chili Pepper’s Lyrics Using NLPI heard Red Hot Chili Peppers for the first time about 10 years ago and I instantly became obsessed with them. They were (and still are)…Apr 7, 2021Apr 7, 2021