Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

ECE 608: Quantitative Methods in Biomedical Engineering

Assignment #3

OVERVIEW

The primary aim of this assignment is to give you some practice on various concepts covered during the            lectures. Its secondary aim is to familiarize you with R programming. There is only one part to this assignment:

1.   Four statistical analysis questions (5 marks each).

This assignment is graded out of 20 marks. It contributes 10% to your final course grade.

SUBMISSIONS INSTRUCTIONS

Submit your completed response document through the LEARN portal. There is a Dropbox folder for this    assignment, under Submit ➔ Dropbox ➔ Problem Set #3. The deadline for submission is  10pm on July 25, 2022 (Monday).

You are required to submit one R notebook file (.Rmd) which contains the codes and your responses. Please    make sure your code will be able to run without any modifications; dependent libraries should be loaded at the beginning of the R notebook. You should also comment your code to clearly explain each of your steps in case your code does not compile properly.

Statistical Analysis

(20 marks in total, 5 marks each)

1.   Richard has developed a new non-invasive measurement tool for brain blood flow (BBF). The research hypothesis that he has is that the new measurement tool performs the same as the current standard. To test this research hypothesis, he designs a study that measures the BBF using MRI, transcranial Doppler, and the new tool. The BBF derived from MRI serves as a reference, whereas transcranial Doppler is the current standard. The collected data are included in Q1.csv where BBF is the reference value for each subject, MCA is the flow rate measured using transcranial Doppler and CCA is the flow rate measured using the new tool. Richard decides to conduct a correlation analysis to statistically test the hypothesis. The null hypothesis is that the correlation coefficient between BBF and CCA is the same as the one between BBF and MCA. With α = 0.05

(i)   Show that flow rate measured using the new tool (CCA) is statistically correlated with the reference flow rate (BBF)

(ii)  Show that flow rate measured using transcranial Doppler (MCA) correlates with the reference flow

rate (BBF) as well

(iii) By comparing the correlation coefficients derived in (i) and (ii), show that the new tool performs the same as the current standard (i.e. RCCA-BBF = RMCA-BBF are the same).

2.  A company is developing a new tool that can non-invasively quantify arterial elasticity in terms of stiffness. To calibrate this new device, the company researchers measure the arterial stiffness from 20 cadavers, and their corresponding arterial elasticity was measured using the tensile test. The raw data can be found in Q2.csv. Simple linear regression is to be performed to calibrate the device and the null hypothesis for the regression is that the regression coefficient between arterial stiffness from the new device (non-destructive) and elasticity from the destructive test is zero. With α = 0.05

(i)   Perform simple linear regression and show that the relationship between the arterial stiffness           (nondestructive as the dependent variable) and elasticity (destructive as the independent variable) is linear

(ii)  Check the following model assumptions: 1. Linearity; 2. Equal residual spreads; and 3. Normally

distributed residuals

(iii) Derive the confidence and prediction interval for an artery that has an elasticity of 30.

3.   Jason wants to know the relationship between the amount of body fat, triceps skinfold thickness, thigh circumference, and midarm circumference. To achieve this objective, he randomly recruits a group of 20 subjects and measures their body fat content and other parameters as mentioned. The file Q3.csv contains the raw data for all 20 subjects. Try to construct a model that best describes the relationship between body fat content and measured parameters. The null hypothesis is that there is no relationship between body fat (fat), triceps skinfold thickness (thickness), thigh circumference (thigh), and midarm circumference (midarm). With α = 0.05

(i)   Regress a multiple regression model withfat as the dependent variable and the rest as independent variables (i.e., full model); provide a summary table on statistics

(ii)  Regress a reduced model with only triceps skinfold thickness and thigh circumference as

independent variables; is this reduced model statistically different from the full model? (iii) Check all the model assumptions for the reduced model in (ii)

4.   To meet the increasing demand for hay, an agricultural company wants to improve its hay yield. The company first needs to better understand the relationship between hay yield and other soil parameters. The company researchers have collected various soil parameters and hay yields and the parameters are summarized in the table below:

Parameter

Description

Parameter

Description

Hay Yield

Alfalfa-bromegrass in tons per acre per year

Organic Matter

The percent of plant and animal residue in the soil in various stages of                decomposition

Low Slope

The low range of incline of the surface of a soil

Organic Matter Range

The range of percent plant and animal residue in the soil in various stages of decomposition.

High Slope

The high range of incline of the surface of a soil

Ph

A measure of surface horizon alkalinity of a soil

acidity or

Flood Freq

The temporary covering of soil with water from overflowing streams and runoff from adjacent slopes

Kw factor

Indicator of the susceptibility of a soil to sheet and rill erosion by water

 

Water

The average capacity of soils to hold water available for use by most plants

 

t factor

An estimate of the maximum average   annual rate of soil erosion by water that can occur without affecting crop            productivity over a sustained period

Surface Density

Average bulk density of surface

Clay

The average % content of mineral soil particles <0.002 mm in diameter

Subsoil density

Average bulk density of subsoil: the weight of soil (oven dry) per unit volume

Sand

The average percentage of total sand in the surface horizon

Depth

Depth midpoint (0-60) to a discontinuity in texture or composition

Topsoil

Thickness of topsoil (the upper part of the soil) which is the most favorable   material for plant growth.

Permeability

The quality of the soil that enables water to move through the profile

Topsoil range

Range of topsoil thickness

Drainage

The frequency and duration of periods of saturation or partial saturation

 

 

The raw data are stored in Q4.csv. Follow the instructions below to construct a regression model to explain hay yield with the variables above. With α = 0.1

(i)   Perform a step-down regression to construct a regression model that uses the fewest IVs to explain the hay yield

(ii)  Repeat the step-wise regression but with a step-up approach; compare the AIC values from both step-up and step-down models and check if these two models are statistically different

(iii) Standardize the step-up model and determine which variable causes most change in hay yield.