关键词 > FIT5196
FIT5196 - Data wrangling
发布时间:2023-11-02
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
FIT5196 - Data wrangling
Overview
This unit introduces tools and techniques for data wrangling. It will cover the problems that prevent raw data from being effectively used in analysis and the data cleansing and pre-processing tasks that prepare it for analytics. These include, for example, the handling of bad and missing data, data integration and initial feature selection. It will also introduce text mining and web analytics. Python and the Pandas environment will be used for implementation.
Offerings |
S1-01-CLAYTON-FLEXIBLE Location: Clayton Teaching period: First semester Attendance mode: A combination of on-campus and online teaching (FLEXIBLE) |
S2-01-CLAYTON-FLEXIBLE Location: Clayton Teaching period: Second semester Attendance mode: A combination of on-campus and online teaching (FLEXIBLE) |
S2-01-MALAYSIA-ON-CAMPUS Location: Malaysia Teaching period: Second semester Attendance mode: Teaching activities are on-campus (ON-CAMPUS) |
Requisites Prerequisite
|
||
|
|
Rules
Enrolment Rule
Prerequisites:
For C6007 students who commenced in 2020: None
Contacts |
Chief Examiner(s) Dr Jackie Rong
Email: [email protected] Offering(s):
|
Learning outcomes |
On successful completion of this unit, you should be able to: |
1. Parse data in the required format; |
2. Assess the quality of data for problem identification; |
3. Resolve data quality issues ready for the data analysis process; |
4. Integrate data sources for data enrichment; |
5. Document the wrangling process for professional reporting; |
6. Write program scripts for data wrangling processes. |
Teaching approach
Active learning
Assessment Assessment 1 Value %: 35 |
Assessment 2 Value %: 35 |
Assessment 3 Value %: 30 |
Scheduled teaching activities Applied sessions Total hours: 24 hours Offerings:
|
Lectures Total hours: 24 hours Offerings:
|
Seminars Total hours: 24 hours Offerings:
|
Workload requirements
Workload
Minimum total expected workload to achieve the learning outcomes for this unit is 144 hours per semester typically comprising a mixture of scheduled online and face to face learning activities and independent study. Independent study may include associated reading and preparation for scheduled teaching activities.
Learning resources |
Technology resources
The Programming environments: Python 3 has been added. Python 3 -
https://www.python.org/downloads/ and Jupyter Notebook - https://jupyter.org/
https://www.anaconda.com/distribution/