DATA2001: Data Science, Big Data and Data Variety
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
DATA2001: Data Science, Big Data and Data Variety
This course focuses on methods and techniques to efficiently explore and analyse large data collections. Where are hot spots of pedestrian accidents across a city? What are the most popular travel locations according to user postings on a travel website? The ability to combine and analyse data from various sources and from databases is essential for informed decision making in both research and industry. Students will learn how to ingest, combine and summarise data from a variety of data models which are typically encountered in data science projects, such as relational, semi-structured, time series, geospatial, image, text. As well as reinforcing their programming skills through experience with relevant Python libraries, this course will also introduce students to the concept of declarative data processing with SQL, and to analyse data in relational databases. Students will be given data sets from, eg. , social media, transport, health and social sciences, and be taught basic explorative data analysis and mining techniques in the context of small use cases. The course will further give students an understanding of the challenges involved with analysing large data volumes, such as the idea to partition and distribute data and computation among multiple computers for processing of 'Big Data'.
Details
Academic unit |
Computer Science |
|||
Unit code |
DATA2001 |
|
||
Unit name |
Data Science, Big Data and Data Variety |
|
|
|
Session, year ? |
Semester 1, 2022 |
|
|
|
Location |
Camperdown/Darlington, Sydney |
Credit points |
6 |
Enrolment rules
Prohibitions ? |
DATA2901 |
Prerequisites ? |
DATA1002 OR DATA1902 OR INFO1110 OR INFO1910 OR INFO1903 OR INFO1103 or ENGG1810 |
Corequisites ? |
None |
Available to study abroad and exchange students |
Yes |
Teaching staff and contact details
Coordinator |
Uwe Roehm, [email protected] |
Lecturer(s) |
Uwe Roehm , [email protected] |
Type |
Description |
Weight |
Due |
Length |
Final exam (Take- home short release) D |
Final Examination Final exam; online, short- release and timed |
55% |
Formal exam period |
2 hours |
Outcomes assessed: LO1 LO7 LO6 LO5 LO4 |
||||
Online task |
Weekly Homework weekly online quiz in Canvas |
10% |
Multiple weeks |
ca. 20 min each week |
Outcomes assessed: LO2 LO4 LO5 LO6 LO7 |
Type |
Description |
Weight |
Due |
Length |
|
Small test |
SQL Test SQL online test; mid- semester. |
15% |
Please select a valid week from the list below Due date: 06 Apr 2022 Closing date: 06 Apr 2022 |
1 hour |
|
Outcomes assessed: LO1 LO4 LO3 |
|||||
Assignment |
Practical Assignment Practical data integration and data analysis assignment. |
20% |
Week 12 |
4 weeks |
|
Outcomes assessed: LO1 LO4 LO3 LO2 |
|||||
group assignment ? |
D = Type D final exam
|
Homework: Short weekly homework quizzes in Canvas. They are designed to help you
review your learning of each week’s topic.
SQL tutorials and SQL Test: Students work through weekly online tutorials introducing
increasingly sophisticated usage of SQL. The SQL tutorials provide simple feedback and allow multiple attempts, and example solutions are available after the submission deadline has passed. Solutions are provided for each week, and the topics are assessed in a mid-semester SQL test.
Practical Assignment: Students work in teams on a larger data integration and data
analysis task, where some supplied datasets have to be combined with additional data researched by students. The final submission consists of the source code artifacts developed by the teams, plus a short report of their findings, and a group demo during the labs of Week 12.
Final Examination: Understanding of all of this unit’s material is reviewed. Detailed information for each assessment can be found on Canvas.
Assessment criteria
The University awards common result grades, set out in the Coursework Policy 2014 (Schedule 1).
As a general guide, a high distinction indicates work of an exceptional standard, a distinction a
very high standard, a credit a good standard, and a pass an acceptable standard.
2022-02-26