Datacracy
  • Posts
  • About Me
  • About this site

Posts

May 5, 2022

Mountain Bike Categorization Analysis

Introduction Overview The Data EDA Label (Mountainbike Category) Categorical Variables Continuous Variables ~Normally Distributed Variables: Skewed Variables: Multi-Modal Distributed Variables: Average bikes by flip-chip setting Methodology Variation Amongst Featureset 1. Correlation 2. Principal Component Analysis (PCA) Clustering K-Means Gaussian Mixture Model (GMM) GMM - 3 Clusters GMM - 6 Clusters Multi-class SVM Conclusions Findings Opportunities for Improved Analysis Introduction Overview For this post, I worked with Mike Czerwinski to determine whether the specifications of mountain bikes (MTB) are enough to differentiate between the different types of mountain bike categories.
March 10, 2021

The Federalist Papers | NLP Analysis (Part 1)

When Alexander Hamilton, John Jay, and James Madison came together in support of the ratification of the Constitution, they created what has become one of the most celebrated series of political texts in history. The Federalist Papers, a series of 85 essays, helped push New Yorkers towards ratification and laid some of the strongest arguments in favor of a strong Federal Government. In part one of our analysis, we perform an Exploratory Data Analysis (EDA) using modern Natural Language Processing (NLP) techniques to to better understand these essays.
January 26, 2021

Polling Places | Exploratory Data Analysis

In a democracy, the polling booth is much more than a location where an unwitting citizen fills in a bubble on a piece of paper in a make-shift booth. It is a symbol of the promise of democracy, a connector between the citizen and the government. It is the shrine that, if protected and respected, powers a democracy. This article analyzes that symbol of American democracy by taking an analytical look at a dataset of U.
December 20, 2020

Reading Multiple CSVs into Merged R Dataframe

The purpose of this script is to load and clean all of the various .csv files containing polling place data into R. The data, which is available for download here, is structured as follows: - Each state (32 in total) has its own folder - Within each state (folder), there are a variable number of CSV files, one for each year that polling place data is available
September 12, 2020

Women in Politics

It all started in Montana. In 1916, Jeannette Rankin, a peace activist and a strong advocate for women’s suffrage, broke down centuries-long barriers by becoming the first woman elected to Congress. Since then, 366 women have served in U.S. Congress, and thousands more in various elected and executive offices at the state-level. In this post, we will analyze the Women in Politics dataset published by the Eagleton Institute of Politics’ Center for American Women and Politics.
  • ««
  • «
  • 1
  • 2
  • »
  • »»
© Datacracy 2024