Mark SubraData Analysts — Jr. Data Scientists or Different Role Alltogether?In my last post I went over the differences and similarities between data engineers and data scientists. In this post I’ll go over the…Dec 6, 2020Dec 6, 2020
Mark SubraData Engineering vs Data ScienceThe terms data science and data engineering get thrown around a log, but what is the difference? What are the similarities? Both have to…Nov 21, 2020Nov 21, 2020
Mark SubrainAnalytics VidhyaHands-On Machine Learning — Simple Linear RegresPerhaps one of the best books on python and data science is Géron’s Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow. This…Nov 7, 2020Nov 7, 2020
Mark SubraPython vs R for Data SciencePython and R both have their strengths and weaknesses when it comes to data science. One language isn’t necessarily better than the other…Oct 31, 2020Oct 31, 2020
Mark SubrainAnalytics VidhyaPredicting Diabetes with Neural NetworksDiabetes is a chronic medical condition which is estimated to affect 415 million people in the world. 5 million deaths a year can be…Oct 24, 2020Oct 24, 2020
Mark SubrainDataDrivenInvestorData Analytics for Football: Why the Vikings lost to the SeahawksFootball is a wonderfully complicated game. It is akin to two armies lining up and moving up and down the battlefield. Coaches routinely…Oct 16, 2020Oct 16, 2020
Mark SubrainAnalytics VidhyaImage Classification: Cats and Dogs — Pre-trained Neural Network vs ConstructedChapter 4 of Neural Network Projects with Python goes through a guided project for classifying cats and dogs from a dataset provided by…Oct 10, 2020Oct 10, 2020
Mark SubrainDataDrivenInvestorNeural Network Projects with PythonI recently came across a great resource, Neural Network Projects with Python, by James Loy. I am fascinated with neural networks and their…Oct 3, 2020Oct 3, 2020
Mark SubraGaussian Mixture ModelsClustering is an important technique for unsupervised learning algorithms. It refers to grouping similar data points by their attributes…Sep 26, 2020Sep 26, 2020
Mark SubraUnsupervised Learning — k-Means ClusteringPerhaps one of the most widely used methods of unsupervised learning is k-Means clustering. In simple terms, the algorithm partitions the…Sep 19, 2020Sep 19, 2020