Extra Credit Assignment


Quick Details

Assignment Due: Thursday, Dec 5th @ 1159 UTC

Peer-grading Due: Wednesday, Dec 11th @ 1159 UTC

Two deadlines = Two steps: You must submit a project AND complete peer grading in order to receive credit. If you miss either one of those things you will not get credit.

Point Value: Up to 3% added on to your final grade

Grading: Peer-graded (brief rubric below)

Where/how to submit work: In the Extra credit assignment. You can find it linked towards the bottom of Canvas/EdX titled "Extra Credit". Complete your work there and hit "submit".

No Plagiarism: This will result in a 0 on the extra credit, and a report to GT OSI


Sections

Your Task

General Requirements

Technical Requirements

Grading

FAQ


Your Task

Assignment: Pick a topic that interests you. Find some data related to that topic and come up with some questions you might answer using that data. Create a Jupyter notebook that processes the data and implements an analysis to address your questions. We expect your notebook will need to clean the data, analyze it using tools you’ve learned in this class, and produce some outputs that will help you show what you’ve learned from the data. Remember that your outputs may disprove your original theory or be inconclusive, a situation that is very common in analytics; this result is okay if you demonstrate a solid effort to answer your questions.

Requirements

Below are the specific requirements for your project. Outside of what's required below, the project is open-ended and other design considerations are up to you.

General Requirements

  • Project Structure:
    • Jupyter Notebook: All of your work should be in here. This includes all of the code, comments on the code, markdown blocks with a write-up explaining what you're doing. All of your charts/final produts should also be in here
    • Data files: All of the data you need
  • Notebook content:
    • Your analysis should look similar to what the exams and notebooks in this course look like. Your classmates will be reading and running your notebook from the top down, so ensure there is an explanation for each block of code that explains waht you're doing. It doesn't need to be long, but it does need to explain what's happening as your classmates will have limited time to review it.
  • Types of analysis:
    • Your analysis must make a solid and thoughtful attempt at answering the question you set out to answer. You're not required to do a regression analysis or anything specific, but your notebook should do more than perform a basic data exploration.
  • Presentation/Visualization:
    • You need to include a few visuals in the notebook, not including simply displaying dataframes. THese charts should support what you're saying and make your analysis easier to understand.

Technical Requirements

These requirements are strict and you may receive a 0 if your project does not follow them.

  • You must upload the Jupyter notebook and data files into the Vocareum assignment.
  • You must then submit your work. Simply hit "submit" at the top right. You have unlimited submissions up until the deadline
  • Your work must be all self-contained in the Vocareum assignment. It cannot rely on any external connections (like to download data)
  • The data files should be no larger than 20MB
  • Running your notebook from top to bottom should not take more than 2 minutes.
  • Your code must not generate any errors. Any errors = 0 points
    • Your code may have warnings (red boxes), but you should document that so your peers understand the warning was expected
  • You can only import packages, you cannot install packages. import <package>
    • Even if you manually install a package, your peers won't be able to run the package. Therefore installing is not allowed
  • Your code must be setup to read the data in the current directory, or in a folder next to the current file
    • Ex: pd.read_csv('test_data_1.csv') is all you need. Do not use os.getcwd() or anything to generate a dynamic path. It won't work for grading. Simply read in the file directly like above

Grading

The section below applies to you as a person submitting a prjoect, then to you as a peer grader.

  • Your assignment will be peer-graded by 3 peers (a few will have only 2)
  • How peers should assign a grade:
    • Whole number only (round-up)
    • Score = 3/3 | Analysis is complete and well-documented. It includes helpful explanations and visuals.
    • Score = 2/3 | Analysis is thorough, but some parts are lacking depth.
    • Score = 1/3 | There was some work submitted but the analysis lacks depth, or there are other issues
    • Score = 0/3 | Notebook is missing, data files are missing, or there are errors. The analysis
  • The grade should be a 0 if the requirements were not met. Otherwise when assigning a score from 1-3, it is up to the grader's judgement.
  • As part of grading, you must provide a couple of comments. Providing meaningful feedback will help both parties learn, and is a core part of conducting a real-world analysis. You can provide some suggestions, list some questions you had, or offer suggestions for future reserach. When we review grades at the end, you may lose extra credit points if you did not provide comments to your classmates.
  • Note: Grades are not final until all of the peer reviews are completed for your submission. It will show a score as soon as one peer review is done, but that's not your final grade. Please do not make a post and ask us about this.

FAQ’s

Can you provide some example projects? The past and current midterms closely resemble the style we’re looking for, but that level of complexity is not necessary (but is doable!). In particular, look at how each one starts out with some data, goes through some cleaning and analysis, and then outputs something, whether it’s a specific output or just well-documented insights.

How long should I spend on this? We can’t answer that for you, but it’s reasonable to expect this to take 10-15 hours, depending on how comfortable you are with the material we’ve covered in the class so far. We’re not asking you to do anything new, just use things you’ve learned in the class.

How do I upload the dataset or files to Vocareum? Open up the assignment from Canvas/edX like a normal notebook. When you're in the folder view (click the Jupyter logo at the top left if you're in a notebook), you can click the "upload" button at the top right to upload files.

Can the deadline be extended for this? Unfortunately, no. We need to leave enough time for the teaching staff to calculate and input grades before the Georgia Tech grade submissions deadline.

I had a great idea and a good plan, but once I did the analysis the results weren’t clear. That’s ok! As long as you explain all of that in your notebook, you will still be eligible for credit. A well-documented project and breakdown of your findings (or lack thereof) is still useful. Academic research papers and real business projects often end with this result, it’s just part of the process.

The dataset I want to use can’t be publicly shared. Please choose a different project or dataset to use. Your peers need to be able to run your code against the data to see the results. You can investigate using a smaller piece of the dataset, or redacting some of it, but we recommend you don’t spend much time on that step. You’d be better served by finding a new dataset and focusing your time on analyzing it and building your project around that.

Can I create the notebook locally and then upload it? Yes, but your code must be executable within the Vocareum/Jupyter environment. The file will need to correctly link to the data file once uploaded, and you’ll need to make sure that any package differences don’t affect your work. We won’t be able to help you debug this, so you will need to make sure this causes 0 issues before the deadline.

Will there be partial credit? Grading will be done by your peers and may result in any number score in the range [0,3]. Everyone should be able to get either 2-3 points with enough effort.

I made a mistake in the in my submission, can I resubmit it? If it’s before the assignment deadline, then yes you can submit as many times as you want. When the deadline hits, your peer reviewers will be given only your latest submission.

Can I collaborate with another student? Yes, but each student will need to submit their own work. You can share a dataset and work together to brainstorm methodologies, but your analysis must be your own. Please also name the student you worked with in the top block of your notebook.

How do I access and submit my peer reviews? The peer review process is accessed through the same link in your LMS (Canvas or EdX). The day after the assignment closes (usually by noon ET the following day), you simply click on that link again and it will take you to an interface to view your peer submissions and submit grades + feedback.

My notebook returns a warning, is that ok? Errors are instant 0’s, but warnings are allowed. However, you should ensure that the warning you’re getting is not related to something you’re doing incorrectly. Sometimes they’re notifications of deprecated functionality, sometimes they help highlight a way you incorrectly referenced data.

Can my code import the data from a website instead of storing it in Vocareum? No, the data must be uploaded into the Vocareum environment for peer grading.

One of my peers gave me a 0/3 and I think I deserve higher, can you regrade it? No, there will be no re-grades.

The peer feedback window has an option for “recommend for future exam use”, what is this for? That’s a flag we ask you to enter [0,1] for where 1 indicates you think that assignment would be good for the instructors to review and use as inspiration for a future exam.

Can the TA’s take a look at my project to confirm it was submitted correctly? As long as you followed the instructions above, you should be good to go. If you want the TA's to look then please make a private piazza post, but depending on timing we may not be able to reply quickly (if you ask two days before the deadline, we may not get a chance to look until after the deadline hits)


The question I have wasn’t answered in this document.

Here’s how to get an answer: * If it’s a general question: ask as a follow-up in the Extra Credit Piazza post * If you want to brainstorm with others and get suggestions: create a public piazza post or ask in Slack. You can work with others to refine your ideas or get advice, just make sure the work in your notebook is your own. * If it’s specific to your project: create a private post but please use your judgement—we can’t guarantee that we’ll respond in a timely fashion, and while we would like to help everyone out with tips on their projects, we simply don’t have the ability due to end of semester activities. We would prefer that you continue making progress on the project while waiting for a reply fr

Updated: 2024-12-10