Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

Academic Year 2020/21

BNM861

Data Mining and Web Analytics

Coursework Title:

Application of Data Mining and Web Analytics to Real Data

Task Details/Description:

This is an individual assignment focuses on using data mining and text mining algorithms to develop a model to solve a typical business problem in a real case study. This is an individual assignment and has two parts.

Part 1 (60%)

1. Select a company or industry sector of interest to you. Your first task is to try to gain as much insight as possible into the business goals and explain how data mining / text mining can be used to analysis data within the selected company/industry. Explain how data mining could help this case. (10%)

2. Find a source of numerical data for that topic, download a relevant dataset, clean the data and prepare a data quality report. (20%)

3. Using data sources mentioned in the class for unstructured data, collect related data to the selected topic, use web analytics to summaries your findings. (15%).

4. Use data mining methods presented in the class, visualize the data and prepare a report on descriptive analysis. (15%) 

Part 2 (40%)

5. Identify a specific problem within this industry that can be addressed using the data mining / text mining methods. Use appropriate data mining methods to solve the problem, analyse the results from both structured and unstructured data sources and decide whether they are useful for making a business decision and if so what decision or type of decision would you propose. You will need to use various data mining / text mining methods in the IBM Modeler software. (40%)

Note:

1. When you decide on your industry/topic, contact the instructors to have this agreed and to get relevant advice before you start analysing the data. This data set should be different with the one used in your coursework during the class.

2. Coursework submitted must include code and screen shots of the software used as an appendix. You should also upload the stream you developed in IBM-SPSS Modeler and the dataset to system. The description of the problem, method and analysis should be no more than 3000 words long excluding tables and figures, etc.

Module Learning Outcomes Assessed:

1. Know the fundamentals of business intelligence and its application to business decision making

2. Demonstrate an understanding of the data and resources available on the web of relevance to business intelligence and be able to access such structured and unstructured data

3. Practice with leading data mining methods and their applications to real-world problems

4. Apply the practical experience and the theoretical insight needed to reveal patterns and valuable information hidden in large data sets;