Due: Tuesday, July 27, 2021 at 6 pm CDT
Throughout this semester, you have grown into an amazing Data Scientist! You are analyzing datasets in Python, performing advanced statistical tests, and finding the answers to complex questions using data. You have seen dozens of datasets we have provided. For the final project, we want you to teach us something about your Chicago community - we want to learn about something you are passionate about!
For this final project in Discovering Data Science, you will use Data Science to explore something you are passionate about or interested in learning more about in your Chicago community. At the end, you will write a small paper telling us about what you found and teaching us something! We only have a few minimal requirements:
To complete this project, there is no starter code – you are building it from scratch! However, we do want to check out your work so make sure you keep your code, dataset, and other files together in one place to share or submit your project when you turn it in in Week 5.
Our hope is that you will use a dataset you are passionate about that relates to your Chicago community. It can be anything – it can be a dataset used from another class (eg: think if you had any data you get in Excel), it can be a dataset you found online, or it can be a dataset you gather yourself. Some ideas include:
The dataset must relate to Chicago. A great place to find lots of Chicago data is the Chicago Data Portal. You should be able to find plenty of datasets in the Chicago Data Portal, but if you want to look in other places for Chicago related datasets you can search through these other free resources that contain millions of datasets:
The major deliverables for this project is a small paper or report over what you found and your code. We want to learn something from you about your interest/passion in the Chicago community, so tell us a story about what you discovered!
The only requirements are:
You need to prepare a short 2-5 minute presentation (i.e. google slides, prezi, or any other presentation tool you wnat to use) to share your exciting discovery with your class! You don’t have to explain any of the Python or Data Science to us, but you should not assume we know anything about your specific interest/passion.
Presentations will take place during the last two days of classes (i.e. Wednesday and Thursday, July 28-29). Time of presentations will be announced later.
When you are ready to submit, there are three things you will submit.
You can submit or share your deliverables in any method you prefer (i.e. Google Drive link, GitHub, or other methods) but you must make a submission in Gradescope indicating the method with your files or link if applicable. If submission fails, then email your files or link to Jonas at wreger2@illinois.edu
We can’t wait to read your project and see your presentation! :)
Once you have submitted your project and recieved feedback, you can upload your project description and link to your LinkedIn Profile! Also, make sure the link is viewable by anyone so your future employers can see them! :)
Modified from Wade Fagen-Ulmschneider & Karle Flanagan’s STAT 107 - Fall 2019/Spring 2020 guides with permission.