…just another human
Brief introduction to SparkUI
This is a post on Spark UI and what to look for while Spark jobs are running.
Why you should use logging instead of print statements?
This is a post on how one should get started with logging in Python.
Personal finance聽101
This is a post on how one should start his journey into investments.
Cluster NSE top 500 companies
This is a blog post on clustering NSE top 500 companies.
Amazon's item-item Collaborative filtering recommendation algorithm [paper summary]
This is a paper summary of item-item collaborative filtering paper by Amazon
Types of data in recommender systems
This tutorial explains the type of data one deals with when building recommender systems.
Handling errors with try-catch in Python
This tutorial is on handling errors with try-catch in Python.
Subsetting a dataframe in Python
This tutorial is on how to filter rows of a dataframe based on conditions on columns in dataframe in Python.
Working with dates in Python
This tutorial is on ways to deal with dates in Python.
git and github for data scientists
This is a tutorial on git and github for anyone who wants to use it.