<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Data Science on Manish Barnwal</title>
    <link>https://manishbarnwal.com/categories/data-science/</link>
    <description>Recent content in Data Science on Manish Barnwal</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Tue, 26 May 2020 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://manishbarnwal.com/categories/data-science/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Why you should use logging instead of print statements?</title>
      <link>https://manishbarnwal.com/posts/logging_in_python/</link>
      <pubDate>Tue, 26 May 2020 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/logging_in_python/</guid>
      <description>This is a post on how one should get started with logging in Python.</description>
    </item>
    <item>
      <title>Cluster NSE top 500 companies</title>
      <link>https://manishbarnwal.com/posts/cluster_nse_companies/</link>
      <pubDate>Mon, 28 Oct 2019 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/cluster_nse_companies/</guid>
      <description>This is a blog post on clustering NSE top 500 companies.</description>
    </item>
    <item>
      <title>Amazon&#39;s item-item Collaborative filtering recommendation algorithm [paper summary]</title>
      <link>https://manishbarnwal.com/posts/item_item_recs_paper_summary/</link>
      <pubDate>Tue, 16 Jul 2019 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/item_item_recs_paper_summary/</guid>
      <description>This is a paper summary of item-item collaborative filtering paper by Amazon</description>
    </item>
    <item>
      <title>Types of data in recommender systems</title>
      <link>https://manishbarnwal.com/posts/types_data_recommender_system/</link>
      <pubDate>Thu, 27 Sep 2018 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/types_data_recommender_system/</guid>
      <description>This tutorial explains the type of data one deals with when building recommender systems.</description>
    </item>
    <item>
      <title>Handling errors with try-catch in Python</title>
      <link>https://manishbarnwal.com/posts/try_catch_in_python/</link>
      <pubDate>Mon, 18 Jun 2018 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/try_catch_in_python/</guid>
      <description>This tutorial is on handling errors with try-catch in Python.</description>
    </item>
    <item>
      <title>Subsetting a dataframe in Python</title>
      <link>https://manishbarnwal.com/posts/subsetting_dataframe_python/</link>
      <pubDate>Mon, 18 Jun 2018 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/subsetting_dataframe_python/</guid>
      <description>This tutorial is on how to filter rows of a dataframe based on conditions on columns in dataframe in Python.</description>
    </item>
    <item>
      <title>Working with dates in Python</title>
      <link>https://manishbarnwal.com/posts/working_with_dates_in_python/</link>
      <pubDate>Mon, 18 Jun 2018 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/working_with_dates_in_python/</guid>
      <description>This tutorial is on ways to deal with dates in Python.</description>
    </item>
    <item>
      <title>git and github for data scientists</title>
      <link>https://manishbarnwal.com/posts/git_and_github_for_data_scientists/</link>
      <pubDate>Wed, 18 Apr 2018 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/git_and_github_for_data_scientists/</guid>
      <description>This is a tutorial on git and github for anyone who wants to use it.</description>
    </item>
    <item>
      <title>Creating a virtual environment in Python</title>
      <link>https://manishbarnwal.com/posts/virtual_environment_in_python/</link>
      <pubDate>Tue, 23 Jan 2018 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/virtual_environment_in_python/</guid>
      <description>This is a tutorial on how to create a virtual environment in Python.</description>
    </item>
    <item>
      <title>Common docker commands</title>
      <link>https://manishbarnwal.com/posts/common_docker_commands/</link>
      <pubDate>Thu, 23 Nov 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/common_docker_commands/</guid>
      <description>This post is a tutorial on the commonly used docker commands.</description>
    </item>
    <item>
      <title>How to choose the probability cut-off in classification problem</title>
      <link>https://manishbarnwal.com/posts/choosing_probability_cut-off_in_classification/</link>
      <pubDate>Thu, 18 May 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/choosing_probability_cut-off_in_classification/</guid>
      <description>This post describes how to choose the probability cut-off in classification problem.</description>
    </item>
    <item>
      <title>Tutorial on dplyr- a package for data manipulation in R</title>
      <link>https://manishbarnwal.com/posts/tutorial_on_dplyr/</link>
      <pubDate>Mon, 15 May 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/tutorial_on_dplyr/</guid>
      <description>This post is a tutorial on dplyr - a package for data manipulation in R.</description>
    </item>
    <item>
      <title>The essence of machine learning is function estimation</title>
      <link>https://manishbarnwal.com/posts/machine_learning_function_estimation/</link>
      <pubDate>Fri, 12 May 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/machine_learning_function_estimation/</guid>
      <description>This post talks explains how the essence of machine learning is function estimation.</description>
    </item>
    <item>
      <title>Time series and forecasting using R</title>
      <link>https://manishbarnwal.com/posts/time_series_and_forecasting_using_r/</link>
      <pubDate>Wed, 03 May 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/time_series_and_forecasting_using_r/</guid>
      <description>This post talks about time series and forecasting.</description>
    </item>
    <item>
      <title>Diving into H2O with R</title>
      <link>https://manishbarnwal.com/posts/h2o_with_r/</link>
      <pubDate>Tue, 28 Mar 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/h2o_with_r/</guid>
      <description>This post talks about building machine learning models in H2O using R.</description>
    </item>
    <item>
      <title>An illustrated introduction to adversarial validation part 2</title>
      <link>https://manishbarnwal.com/posts/introduction_to_adversarial_validation/</link>
      <pubDate>Thu, 16 Feb 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/introduction_to_adversarial_validation/</guid>
      <description>This post talks about adversarial validation- an approach to the problem of differently distributed training and test data.</description>
    </item>
    <item>
      <title>An illustrated introduction to adversarial validation part 1</title>
      <link>https://manishbarnwal.com/posts/introduction_to_adversarial_validation/</link>
      <pubDate>Wed, 15 Feb 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/introduction_to_adversarial_validation/</guid>
      <description>This post talks about adversarial validation- an approach to the problem of differently distributed training and test data.</description>
    </item>
    <item>
      <title>How to use Git and Github</title>
      <link>https://manishbarnwal.com/posts/git_and_github/</link>
      <pubDate>Wed, 15 Feb 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/git_and_github/</guid>
      <description>This post talks about the usage of git and github</description>
    </item>
    <item>
      <title>The curse of bias and variance [draft]</title>
      <link>https://manishbarnwal.com/posts/the_curse_of_bias_and_variance/</link>
      <pubDate>Wed, 08 Feb 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/the_curse_of_bias_and_variance/</guid>
      <description>This post talks about the trade-off between bias and variance.</description>
    </item>
    <item>
      <title>Visualization in ML is under-rated</title>
      <link>https://manishbarnwal.com/posts/visualization_is_under-rated/</link>
      <pubDate>Fri, 27 Jan 2017 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/visualization_is_under-rated/</guid>
      <description>This post talks about visualization -one of the most important aspects of data science that often gets ignored.</description>
    </item>
    <item>
      <title>Random Forest explained intuitively</title>
      <link>https://manishbarnwal.com/posts/random_forest_explained_intuitively/</link>
      <pubDate>Tue, 18 Oct 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/random_forest_explained_intuitively/</guid>
      <description>This post explains random forest intuitively</description>
    </item>
    <item>
      <title>Improve runtime of Random Forest in R</title>
      <link>https://manishbarnwal.com/posts/improve_runtime_random_forest_r/</link>
      <pubDate>Thu, 13 Oct 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/improve_runtime_random_forest_r/</guid>
      <description>This post talks about a trick to improve the runtime of random forest package in R for large datasets</description>
    </item>
    <item>
      <title>How to install a package of a particular version in R</title>
      <link>https://manishbarnwal.com/posts/install_a_package_particular_version_in_r/</link>
      <pubDate>Wed, 05 Oct 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/install_a_package_particular_version_in_r/</guid>
      <description>This post explains how to install a package of a particular version in R</description>
    </item>
    <item>
      <title>Shell commands come in handy for a data scientist</title>
      <link>https://manishbarnwal.com/posts/shell_commands_for_data_scientist/</link>
      <pubDate>Fri, 30 Sep 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/shell_commands_for_data_scientist/</guid>
      <description>This post talks about shell commands that come in handy for data-science people</description>
    </item>
    <item>
      <title>ROC and AUC - The three lettered acronyms</title>
      <link>https://manishbarnwal.com/posts/three_letter_acronym_roc_and_auc/</link>
      <pubDate>Mon, 26 Sep 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/three_letter_acronym_roc_and_auc/</guid>
      <description>This post talks about roc curve and confusion matrix.</description>
    </item>
    <item>
      <title>Vim/Vi editor shortcuts</title>
      <link>https://manishbarnwal.com/posts/vim_shortcuts/</link>
      <pubDate>Thu, 22 Sep 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/vim_shortcuts/</guid>
      <description>This post talks shortcuts in vim editor</description>
    </item>
    <item>
      <title>When R package is not available across the cluster</title>
      <link>https://manishbarnwal.com/posts/r_packages_not_in_cluster/</link>
      <pubDate>Tue, 02 Aug 2016 00:00:00 +0000</pubDate>
      <guid>https://manishbarnwal.com/posts/r_packages_not_in_cluster/</guid>
      <description>This post talks about a hack which is useful if a particular R package is unavailable across the cluster</description>
    </item>
  </channel>
</rss>
