Case Studies in Big Data: Joshua Cook
Case Studies in Big Data: Joshua Cook
Case Studies in Big Data: Joshua Cook
Joshua Cook
2
Chapter 1
Programming
Fundamentals
3
4 CHAPTER 1. PROGRAMMING FUNDAMENTALS
Chapter 2
APIs
5
6 CHAPTER 2. APIS
Chapter 3
MongoDB
7
8 CHAPTER 3. MONGODB
Chapter 4
Aggegration
Process Users
Intro to ETL
The DAG
• $project
• $match
• $sample
3. seven DAGs
• count by day
• plot points with folium
• parse text topics
• counts by dow-hour
• unique users
• check to see if all users have been processed
• find duplicate users
9
10 CHAPTER 4. AGGEGRATION
Chapter 5
Framework
Data Preparation
Implementation
Refinement
Model Evaluation
Model Justification
Presentation of Results