mini-assignment

NYPD Vehicle Collisions

  1. Get the data (from here!)
  2. Aggregate injuries / involvement of vehicles, pedestrians, cyclists in some way
  3. Plot your findings by some other feature (vehicle type, borough, cross streets, etc.)
  4. Tell us something about date or time of day and traffic crashes.
  5. Discuss potential research questions that you could begin to answer with this data — have intersection reconstructions or :traffic_light: (e.g. West End Ave; West 96th St/Broadwayreduced collisions or their severity? Do bike lanes make roads safer for cyclists? Are :bicyclist: a menace to :walking:?

Extra challenge 1

Get an api key and write a script that makes a query (you can hard code the query, the point is to do away with file snapshots).

Extra challenge 2

Parse the (long, lat) string into numeric longitude and latitude columns.

Extra challenge 3

Using either the api, a filter, or another snapshot slice, compare to another month, quarter, or year.

  • link to malecki's mini presentation messing with maps with this data here.

Disease Cases in the US over time

Use this from WSJ as a starting point.

The data comes from the CDC, via Project Tycho.

  1. Get the data for one or more diseases
  2. Read the data into R or python
  3. Aggregate to years
  4. Aggregate to regions
  5. Make plots by year and/or week and discuss.

extra challenge 1

Parse the ‘week’ information into an actual date with Week resolution.

Does the seasonality vary across states or regions?

extra challenge 2

Plot something from this dataset on a map. Whether a snapshot, some summary, or some comparison, or all of them.

Published: February 12 2015

  • category: