Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

DSCI 510 Final Project

Final  project for this  course  is  divided  into three  sections:  Homework  3,  Homework  4  and Homework 5 (including presentation). Homework 4 & 5 will be explained letter. This document states the guidelines for Homework 3.

NOTE: Homework 3 is an ungraded homework but mandatory for everyone to submit. You will get feedback on your projects, which you can later incorporate into Homework 4 & 5.

Homework 3 (Ungraded):

Due Date: Monday, April 3, 2023

1. Objectives:

a.    Find three (3) data sets on the web that are of interest to you.

•    One must require scraping” (i.e., not available via external API)

•   One must be available via external public API. (You should be able to access it without a ton of trouble)

•   The third can be an API, scraped, or a database

NOTE: Sites must be such that they require automation to extract data; If you can just cut-and-paste the data, it’s not appropriate.

b.   Describe in a paragraph what analysis or presentation can be expected from the combined data.

2. Result expectations:

a.   A word document including the proper links to the data.

b.    Description of how the student plan to use the datasets and the type of

analysis/questions student would like to answer based on their data. Sample Projects:

1) https://drive.google.com/drive/folders/1ePko_27OWfcKy4WoapAHtv1GA2MMT8bJ?us p=share_link

2) https://drive.google.com/drive/folders/158q0zDPXgu-zsgs5YssE4- NtR_pnCyIx?usp=sharing

You can use any source of data on the web. Here are some sites that list available datasets/APIs if you need inspiration:

https://datasetsearch.research.google.com/

https://github.com/awesomedata/awesome-public-datasets