Application

Application#

Click here for a Jupyter Notebook template for this chapter’s application problems.

All the materials to reproduce your work (notebook and data files, API call, etc) should be included in the assignment Google Drive folder.

Question 1#

Q1A: Identify an aspect of your final project data (or another civic dataset) that you want to analyze. Answer to this question briefly describes the data you’re working with and what aspects of it you want to analyze.

Q1B: Develop an outline for your data analysis workflow. This could be a list with bullet points, a narrative, or a visual diagram (or a combination of these elements). Answer to this question includes notes on a desired final data structure and preliminary outputs.

  • NOTE: No code is required as part of this answer.

Q1C: Develop a Python program that calculates at least 3 summary or descriptive statistics for your dataset. At least one of these programs should include a .groupby() or filtering operation. Answer to this question includes code + comments.

Q1D: What challenges did you encounter? How did you approach solving them?

Question 2#

Q2A: Using your Q1 dataset, another aspect of your final project data, or another civic dataset, identify one of the more advanced modeling workflows you want to explore. Answer to this question briefly describes what workflow you’re wanting to explore (regression, classification, etc) and how it’s a good fit for your dataset.

Q2B: Using the resources linked in the modeling section of this chapter as a starting point, develop an outline for this data analysis workflow. This could be a list with bullet points, a narrative, or a visual diagram (or a combination of these elements). Answer to this question includes notes on a desired final data structure and preliminary outputs.

  • NOTE: No code is required as part of this answer.

Q2C: Develop a Python program that implements your Q2B outline. Answer to this question includes code + comments.

  • NOTE: You may not get to a full working program. That’s fine! Get as far as you can. Pseudocode can be helpful.

Q2D: What challenges did you encounter? How did you approach solving them?

Question 3#

Q3A: Build on your Q1 & Q2 work to develop an analysis outline or workflow for your final project data. Start by briefly describing the data you’re working with and what aspects of it you want to analyze. Answer to this question briefly describes what workflows you’re wanting to explore and how they’re a good fit for your dataset

A couple places to start:

The “Choosing a Chart Type” page (from an upcoming textbook chapter) can also be helpful.

If you’re wanting to explore more advanced analysis/modeling workflows, including text or geospatial data:

Q3B: Develop an outline for this data analysis workflow. This could be a list with bullet points, a narrative, or a visual diagram (or a combination of these elements). Answer to this question includes notes on a desired final data structure and preliminary outputs.

Q3C: Develop a Python program that implements your Q3B outline. Answer to this question includes code + comments.

  • NOTE: You may not get to a full working programs for everything. That’s fine! Get as far as you can. Pseudocode can be helpful.

Q3D: What challenges did you encounter? How did you approach solving them?