Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

All Engineering Diplomas

Data Visualisation and Analytics

(ESE1008)

Project Title: Wildfires

Data source:https://www.kaggle.com/datasets/rtatman/188-million-us-wildfires

Background:

This dataset contains a spatial database of wildfires that occurred in the United States from 1992 to 2015. It is the third update of a publication originally generated to support the national Fire Program Analysis (FPA) system. The wildfire records were acquired from the reporting systems of federal, state, and local fire organizations. The following core data elements were required for records to be included in this data publication: discovery date, final fire size, and a point location at least as precise as Public Land Survey System (PLSS) section (1-square mile grid). The resulting product, referred to as the Fire Program Analysis fire-occurrence database (FPA FOD), representing a total of 140 million acres burned during the 24-year period.

Objectives:

The objectives of this project are:

1.  To pre-process the data to ensure data is cleaned and ready for next stage of analysis.

2.  To perform exploratory data analysis to gain insights into the data. Possible operations are gathering of statistical data and perform necessary data transformations.

3.  To   generate   data   visualisation   for   interesting   discovery   of  variables’   patterns, correlations etc. Possible operations are plot of frequency of occurrence on fire over the years. You may build an interactive dashboard for better visulaization on the impact of fire.

4.  To fit the data into a Logistic regression model to predict the Fire size class. Discuss the accuracy of the model.

Variable Description

Variable

Description

SOURCE_SYSTEM_TYPE

Type of source database or system that the

record was drawn from (federal, nonfederal, or interagency)

NWCG_REPORTING_AGENCY

Active National Wildlife Coordinating Group (NWCG) Unit Identifier for the agency

preparing the fire report

BIA = Bureau of Indian Affairs,

BLM = Bureau of Land Management,

BOR = Bureau of Reclamation,

DOD = Department of Defense,

DOE = Department of Energy,

FS = Forest Service,

FWS = Fish and Wildlife Service,

IA = Interagency Organization,

NPS = National Park Service,

ST/C&L = State, County, or Local

Organization,

TRIBE = Tribal Organization

NWCG_REPORTING_UNIT_ID

Active NWCG Unit Identifier for the unit preparing the fire report.

FIRE_YEAR

Calendar year in which the fire was discovered or confirmed to exist.

STAT_CAUSE_DESCR

Cause of the fire.

FIRE_SIZE_CLASS

Classification of the fire size

OWNER_DESCR

Name of primary owner or entity responsible  for managing the land at the point of origin of the fire at the time of the incident.

STATE

Two-letter alphabetic code for the state in

which the fire burned (or originated), based on the nominal designation in the fire report.

DISCOVERY_DATE2

Date on which the fire was discovered or confirmed to exist.

FIRE_SIZE

Estimate of acres within the final perimeter of the fire.

LATITUDE

Latitude (NAD83) for point location of the fire (decimal degrees).

LONGITUDE

Longitude (NAD83) for point location of the fire (decimal degrees).

Table 1

Data Assigned:

S/N

Data File

Assigned (Tick)

1

WildFires_Data1.csv

 

2

WildFires_Data2.csv

 

3

WildFires_Data3.csv

 

4

WildFires_Data4.csv

 

5

WildFires_Data5.csv