Web scraping using Python and Beautiful Soup
-
We walk you through how to scrape information from a website using Python and Beautiful Soup and export it to a csv. [read more]
Reading and writing to Google Spreadsheets using Python
We show you how to read and write to a Google Spreadsheet using Python. You can follow along and finish with a Google sheet where you can import/export data into/out of a Pandas dataframe. [read more]
Using Reddit API to find the most popular domains
-
We show you how to use Python to scrape the top domains (urls) posted on Reddit. We use this at Find-me to help identify content creators to add to our database. [read more]
Sending email using Python and Gmail
-
We walk you through how to send email using Python. You'll learn how to build an html template and send email programmatically through your gmail account. [read more]
Correlating stock returns using Python
-
We walk you through how to pull stock returns via the IEX api, correlate against each other, and visualize in a heatmap. [read more]
Which McDonald's menu items get you the most calories/dollar?
-
We stack rank McDonald's items on a dollar per calorie and dollar per protein calorie basis. [read more]
US Counties with the highest annual savings potential
-
See which U.S. counties allow you to save the most money each year. We compared median income to minimum necessary cost of living to arrive at a list of counties which allow residents to maximize savings. [read more]
11 websites to find free, interesting datasets
-
If you're new to the data space, looking to practice a new skill, or just trying to build a more robust data science/analyst portfolio, a perfect way of solidifying your skills is to do some mini-projects using publicly available datasets. Here we outline a few places you can find publicly available data for your next project. [read more]
A curated list of 70 data science interview questions
-
Interviewing for a role as a data scientist or analyst? You've come to the right place! Here we've curated a list of data science interview questions from multiple sources to help make your preparation easier. [read more]
The web of board members that link American corporations, mapped
-
78% of the top 50 companies in the S&P 500 are directly connected through one or more board members, explore the connections through an interactive data visualization. [read more]
Why you're stuck with an open office floor plan
-
Offices have gotten more crowded 'open' in recent years, with the average square feet per person shrinking ~33% from 225 in 2010 to just ~150 in 2017. This can generally be attributed to companies adopting 'open offices'... [read more]
Intro to machine learning with sci-kit learn
-
-
We will walk through an example that involves training a model to tell what kind of wine will be "good" or "bad" based on a training set of wine chemical characteristics. You'll learn how to train a model and evaluate its efficacy using sci-kit learn. [read more]
SQL Joins
-
-
Overview of the different types of SQL joins and how to use them. Very practical guide for beginner-left analyst looking to get more familiar with SQL. [read more]
SQL 101 - SELECT, FROM, WHERE Clauses
-
-
Introduction to SQL that will take you through the minimum requirements of building a SQL query to pull data, using SELECT, FROM, and WHERE clauses. [read more]
Which news publications have the most positive headlines? (simple sentiment analysis with Python using Textblob)
-
-
Walking through an example of running sentiment analysis on news headlines in Python, using pre-trained classifier from Textblob. Complete walk through from loading data in, feeding through model, and analyzing output. [read more]
The never-ending variety of breakfast cereal
-
-
Over half of all cereals have been on the market for 5 years or less. Check out a few stats we pulled as we go way too deep into an analysis on breakfast cereal. [read more]
45 data analyst interview questions
-
Interviewing for a role as a data analyst? You've come to the right place! Here we've curated a list of data analyst interview questions from multiple sources to help make your preparation easier. [read more]
Intro to forecasting with Prophet in python
-
Prophet is a forecasting tool developed by Facebook to quickly forecast time series data, available in R and Python. In this post I'll walk you through a quick example of how to forecast U.S. candy sales using Prophet and Python. [read more]
On average, foregoing college in favor of entering the workforce + investing tuition costs nets higher returns
-
We've all heard how expensive college is getting, along with plenty of criticism surrounding its value in a changing job market. Of course, there are many benefits beyond the monetary ones that should be considered when exploring college options, but for the purpose of this post... [read more]
Data Science Case Study Example 1: An eCommerce Company Evaluation
-
You’re a Data Scientist / Business Analyst working for a new eCommerce company called A&B Co. (similar to Amazon) and you’ve been asked to prepare a presentation for the Vice President of Sales and the Vice President of Operations that summarizes sales and operations thus far... [read more]
Introduction to Linear Regression using Python
-
This post will walk through a practice problem which analyzes NBA player's season stats. We've also provided the practice problem's... [read more]
Simulating future stock prices using Monte Carlo methods in Python
-
Here we'll step through simulating future stock prices using Monte carlo. Everything from pulling the data to assessing the simulation's output.[read more]
Calculating Value at Risk (VaR) of a stock portfolio using Python
-
Walkthrough of what Value at Risk (VaR) is, how it's calculated, and a full example of calculating the metric for a stock portfolio (from pulling the data to calculating to checking the distributions).[read more]