Mark SubraData Analysts — Jr. Data Scientists or Different Role Alltogether?In my last post I went over the differences and similarities between data engineers and data scientists. In this post I’ll go over the…2 min read·Dec 6, 2020----
Mark SubraData Engineering vs Data ScienceThe terms data science and data engineering get thrown around a log, but what is the difference? What are the similarities? Both have to…2 min read·Nov 21, 2020----
Mark SubrainAnalytics VidhyaHands-On Machine Learning — Simple Linear RegresPerhaps one of the best books on python and data science is Géron’s Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow. This…2 min read·Nov 7, 2020----
Mark SubraPython vs R for Data SciencePython and R both have their strengths and weaknesses when it comes to data science. One language isn’t necessarily better than the other…3 min read·Oct 31, 2020----
Mark SubrainAnalytics VidhyaPredicting Diabetes with Neural NetworksDiabetes is a chronic medical condition which is estimated to affect 415 million people in the world. 5 million deaths a year can be…4 min read·Oct 24, 2020----
Mark SubrainDataDrivenInvestorData Analytics for Football: Why the Vikings lost to the SeahawksFootball is a wonderfully complicated game. It is akin to two armies lining up and moving up and down the battlefield. Coaches routinely…6 min read·Oct 16, 2020----
Mark SubrainAnalytics VidhyaImage Classification: Cats and Dogs — Pre-trained Neural Network vs ConstructedChapter 4 of Neural Network Projects with Python goes through a guided project for classifying cats and dogs from a dataset provided by…4 min read·Oct 10, 2020----
Mark SubrainDataDrivenInvestorNeural Network Projects with PythonI recently came across a great resource, Neural Network Projects with Python, by James Loy. I am fascinated with neural networks and their…2 min read·Oct 3, 2020----
Mark SubraGaussian Mixture ModelsClustering is an important technique for unsupervised learning algorithms. It refers to grouping similar data points by their attributes…2 min read·Sep 26, 2020----
Mark SubraUnsupervised Learning — k-Means ClusteringPerhaps one of the most widely used methods of unsupervised learning is k-Means clustering. In simple terms, the algorithm partitions the…2 min read·Sep 19, 2020----