This is a repository for my final class project for the course STA/OPR9750 Software Tools for Data Analysis (Fall 2021 semester) at the Zicklin School of Business. This was an individual class project and involved using R Markdown to create a PDF report.
The goal of this analysis was to study the NYC Open Data dataset that had information on film permits and to analyze it from the perspective of how much of a burden they were on NYPD resources. To do this, three datasets had to be utilized:
- Film permit data provided by the Mayor’s Office of Media and Entertainment
- NYPD precinct map data
- NYPD complaint data
Permits are required with productions that use large equipment in public locations or production vehicles that require parking privileges. Additionally, any production that needs exclusive use of NYC-owned property, or uses prop weapons, prop vehicles, actors in police uniform or stunts is required to file a permit. Most uses of a permit require the assistance of the NYPD Movie/TV Unit, which provides its services free of charge to media productions. Services range from posting No Parking Orders to lending NYPD presence for safety and crowd-control purposes. Therefore, any NYPD precinct that serves an area for which a large number of film permits are filed will have to dedicate resources towards handling the privileges provided to production crews.
- sf: used to encode spatial vector data
- tidyverse: a set of packages used in everyday data analysis. Tidyverse packages used in this project include:
- readr: to read in data
- dplyr: to manipulate and transform data, such as through filtering, mutating, arranging or grouping
- ggplot2: to create graphics to visualize data
- R Markdown Code: the code used to generate the PDF report
- R Markdown Report: the PDF report generated from R Markdown, including comments, code chunks and written analysis
Other files included in this repository were additional files imported into R Markdown.