ESE1008 Data Visualisation and Analytics
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
All Engineering Diplomas
Data Visualisation and Analytics
(ESE1008)
Project Title: Wildfires
Data source:https://www.kaggle.com/datasets/rtatman/188-million-us-wildfires
Background:
This dataset contains a spatial database of wildfires that occurred in the United States from 1992 to 2015. It is the third update of a publication originally generated to support the national Fire Program Analysis (FPA) system. The wildfire records were acquired from the reporting systems of federal, state, and local fire organizations. The following core data elements were required for records to be included in this data publication: discovery date, final fire size, and a point location at least as precise as Public Land Survey System (PLSS) section (1-square mile grid). The resulting product, referred to as the Fire Program Analysis fire-occurrence database (FPA FOD), representing a total of 140 million acres burned during the 24-year period.
Objectives:
The objectives of this project are:
1. To pre-process the data to ensure data is cleaned and ready for next stage of analysis.
2. To perform exploratory data analysis to gain insights into the data. Possible operations are gathering of statistical data and perform necessary data transformations.
3. To generate data visualisation for interesting discovery of variables’ patterns, correlations etc. Possible operations are plot of frequency of occurrence on fire over the years. You may build an interactive dashboard for better visulaization on the impact of fire.
4. To fit the data into a Logistic regression model to predict the Fire size class. Discuss the accuracy of the model.
Variable |
Description |
SOURCE_SYSTEM_TYPE |
Type of source database or system that the record was drawn from (federal, nonfederal, or interagency) |
NWCG_REPORTING_AGENCY |
Active National Wildlife Coordinating Group (NWCG) Unit Identifier for the agency preparing the fire report BIA = Bureau of Indian Affairs, BLM = Bureau of Land Management, BOR = Bureau of Reclamation, DOD = Department of Defense, DOE = Department of Energy, FS = Forest Service, FWS = Fish and Wildlife Service, IA = Interagency Organization, NPS = National Park Service, ST/C&L = State, County, or Local Organization, TRIBE = Tribal Organization |
NWCG_REPORTING_UNIT_ID |
Active NWCG Unit Identifier for the unit preparing the fire report. |
FIRE_YEAR |
Calendar year in which the fire was discovered or confirmed to exist. |
STAT_CAUSE_DESCR |
Cause of the fire. |
FIRE_SIZE_CLASS |
Classification of the fire size |
OWNER_DESCR |
Name of primary owner or entity responsible for managing the land at the point of origin of the fire at the time of the incident. |
STATE |
Two-letter alphabetic code for the state in which the fire burned (or originated), based on the nominal designation in the fire report. |
DISCOVERY_DATE2 |
Date on which the fire was discovered or confirmed to exist. |
FIRE_SIZE |
Estimate of acres within the final perimeter of the fire. |
LATITUDE |
Latitude (NAD83) for point location of the fire (decimal degrees). |
LONGITUDE |
Longitude (NAD83) for point location of the fire (decimal degrees). |
Table 1
Data Assigned:
Data File |
Assigned (Tick) |
|
1 |
WildFires_Data1.csv |
|
2 |
WildFires_Data2.csv |
|
3 |
WildFires_Data3.csv |
|
4 |
WildFires_Data4.csv |
|
5 |
WildFires_Data5.csv |
|
2023-07-22
Wildfires