DSCI 510 Final Project Homework 3
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
DSCI 510 Final Project
Final project for this course is divided into three sections: Homework 3, Homework 4 and Homework 5 (including presentation). Homework 4 & 5 will be explained letter. This document states the guidelines for Homework 3.
NOTE: Homework 3 is an ungraded homework but mandatory for everyone to submit. You will get feedback on your projects, which you can later incorporate into Homework 4 & 5.
Homework 3 (Ungraded):
Due Date: Monday, April 3, 2023
1. Objectives:
a. Find three (3) data sets on the web that are of interest to you.
• One must require “scraping” (i.e., not available via external API)
• One must be available via external public API. (You should be able to access it without a ton of trouble)
• The third can be an API, scraped, or a database
NOTE: Sites must be such that they require automation to extract data; If you can just cut-and-paste the data, it’s not appropriate.
b. Describe in a paragraph what analysis or presentation can be expected from the combined data.
2. Result expectations:
a. A word document including the proper links to the data.
b. Description of how the student plan to use the datasets and the type of
analysis/questions student would like to answer based on their data. Sample Projects:
1) https://drive.google.com/drive/folders/1ePko_27OWfcKy4WoapAHtv1GA2MMT8bJ?us p=share_link
2) https://drive.google.com/drive/folders/158q0zDPXgu-zsgs5YssE4- NtR_pnCyIx?usp=sharing
You can use any source of data on the web. Here are some sites that list available datasets/APIs if you need inspiration:
https://datasetsearch.research.google.com/
https://github.com/awesomedata/awesome-public-datasets
2023-04-14