ECOM7126: Machine Learning for Business and E-Commerce (2025-26)
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
MSc in E-Commerce and Internet Computing
ECOM7126: Machine Learning for Business and E-Commerce
(2025-26)
Assignment 1 – Medical Insurance Claims
AXE Medical is a medium size insurance company in Hong Kong wanting to use machine learning to predict health insurance claims using past claim data the company has collected over the past few years.
The dataset consists of 1,300 former successful claim cases, in .csv format and the following is the data dictionary of the dataset:
|
Client # |
client number (anonymized) |
|
age |
age of the insured |
|
sex |
gender of the insured |
|
home |
home district of the insured: Hong Kong / Kowloon / South NT / North NT |
|
bmi |
body mass index (weight in kg / height in m2) |
|
children |
number of dependent children |
|
smoker |
yes/no |
|
drinking |
no (non-drinker) / occasional / frequent |
|
past claims |
amount in HKD of total past insurance claims |
Note: NT = New Territory
Deliverable:
1. A report in PDF format.
2. The Colab notebook (Python program codes with comments and notes explaining what you are doing and why) that you use to produce your results in .ipynb (Colab notebook format).
You should include the following in your report:
1. An account of what you have done in investigating and transforming your data and why.
2. Use the dataset provided to train a Linear Regression model and build a prediction engine. (Hint: You may not need to include all features in the dataset. Justify your decision.) DO NOT use other models for this assignment, you are asked to use Linear Regression only.
3. Any observation you may have in your project to the management of the company.
4. If you have used any AI tools to complete this assignment, you are required to clearly state how and what tools you have used.
2026-03-02
Medical Insurance Claims