Brief introduction to SparkUI

This is a post on Spark UI and what to look for while Spark jobs are running.

August 31, 2020 · 7 min · Manish Barnwal

Why you should use logging instead of print statements?

This is a post on how one should get started with logging in Python.

May 26, 2020 · 7 min · Manish Barnwal

Personal finance 101

This is a post on how one should start his journey into investments.

April 7, 2020 · 6 min · Manish Barnwal

Cluster NSE top 500 companies

This is a blog post on clustering NSE top 500 companies.

October 28, 2019 · 2 min · Manish Barnwal

Amazon's item-item Collaborative filtering recommendation algorithm [paper summary]

This is a paper summary of item-item collaborative filtering paper by Amazon

July 16, 2019 · 5 min · Manish Barnwal

Types of data in recommender systems

This tutorial explains the type of data one deals with when building recommender systems.

September 27, 2018 · 6 min · Manish Barnwal

Handling errors with try-catch in Python

This tutorial is on handling errors with try-catch in Python.

June 18, 2018 · 2 min · Manish Barnwal

Subsetting a dataframe in Python

This tutorial is on how to filter rows of a dataframe based on conditions on columns in dataframe in Python.

June 18, 2018 · 1 min · Manish Barnwal

Working with dates in Python

This tutorial is on ways to deal with dates in Python.

June 18, 2018 · 1 min · Manish Barnwal

git and github for data scientists

This is a tutorial on git and github for anyone who wants to use it.

April 18, 2018 · 10 min · Manish Barnwal