mini-assignment
NYPD Vehicle Collisions
- Get the data (from here!)
- Aggregate injuries / involvement of vehicles, pedestrians, cyclists in some way
- Plot your findings by some other feature (vehicle type, borough, cross streets, etc.)
- Tell us something about date or time of day and traffic crashes.
- Discuss potential research questions that you could begin to answer with this data — have intersection reconstructions or :traffic_light: (e.g. West End Ave; West 96th St/Broadwayreduced collisions or their severity? Do bike lanes make roads safer for cyclists? Are :bicyclist: a menace to :walking:?
Extra challenge 1
Get an api key and write a script that makes a query (you can hard code the query, the point is to do away with file snapshots).
Extra challenge 2
Parse the (long, lat)
string into numeric longitude and latitude columns.
Extra challenge 3
Using either the api, a filter, or another snapshot slice, compare to another month, quarter, or year.
- link to malecki's mini presentation messing with maps with this data here.
Disease Cases in the US over time
Use this from WSJ as a starting point.
The data comes from the CDC, via Project Tycho.
- Get the data for one or more diseases
- Read the data into R or python
- Aggregate to years
- Aggregate to regions
- Make plots by year and/or week and discuss.
extra challenge 1
Parse the ‘week’ information into an actual date with Week resolution.
Does the seasonality vary across states or regions?
extra challenge 2
Plot something from this dataset on a map. Whether a snapshot, some summary, or some comparison, or all of them.