闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

Homework 1

DSO 530: Applied Modern Statistical Learning Methods

Spring 2023

Deadline. Thursday, Feb 9th, 4 pm, Los Angeles time.

Submission instruction. To submit your homework, please go to the Assessments folder on Blackboard and ﬁnd HW1. The submitted document should be in pdf format. Submission of other formats will incur a 0.5 point (the total is 4 for this HW) penalty. Before you submit, please double check if your document is readable. If you did not generate PDF documents before, please read the companion guide. You can submit your homework multiple times before the deadline, and the TA will grade the last submission. Note that the TA will grade the HW on 0.5 incremental deduction.

Late submission policy. All late submission should be sent to dso530. .cominstructor@gmail. Late submission will incur 1 point penalty within 24 hours after the deadline, and 2 points penalty between 24

and 48 hours. No submission after 48 hours of the deadline will be accepted . As we have a large class this year, I wish that everybody respects this policy.

Additional instructions for HW1. When the path to an answer involves coding, please show the Python

codes and proper output. Answering questions clearly and concisely is better than writing page-long convoluted responses. Discussion among students is strongly encouraged. But everyone should write up their own solution. If a student copies others’ homework or lets others to copy the homework, they

will receive 0 for this homework and face further penalty in the ﬁnal grade .

1. (i) Did you go over Lectures 1, 2a, 2b, Python tutorials 1 and 2, and Lecture 2Ex (and your notes). This homework, like every other homework, only covers a small part of our course materials. (ii) How many hours outside class time did you spend on DSO 530 each week on average so far? (an honest answer will help the instructor adjust the pace if necessary).

2. For a regression equation log y = 1 + 70 log x, how does changes in y associate with changes in x? (You don’t have to type latex math formulas; as long as your derivation is readable by the grader, it is ﬁne; you might need Python or a calculator to help get the answer).

3. Recall that in Lecture 2b, we regress medv on ptratio and rm using the Housing data. Repeat this regression, but standardize (as deﬁned in Tutorial 2) both features before running the regression. Compared with what you see in the Lecture, do you get a diﬀerent R2 , or do you get the same one?

4. With the Housing data, use the per capita crime rate (crim) as the response, and the other variables (including the medv variable) as the predictors. Report the R2 and adj-R2 . For which predictors can we reject the null hypothesis H〇 : βj = 0 (at the 5% level)? Then, regress crim on the signiﬁcant predictors ONLY; how do you interpret the slopes in the regression?

5. Randomly split the Housing data into two parts with 30% as test data. Use random_state = 2 in this split. Because this is a regression problem, you don’t want to use stratify = y part of the code from our Python tutorial. Regress medv on river and rm using the training data. Compute R2 on both the training data and the test data (i.e., in-sample R2 and out-of-sample R2 ). (Hint: if you use the sklearn package, computing the out-of-sample R2 just needs you to replace the arguments of the .score( . , .) that computes the in-sample R2 .)

6. (optional, not for grading) The out-of-sample R2 can be negative even if one uses the least squares method to ﬁt the linear model. Can you design a simulation setting where this phenomenon indeed occurs?

2023-02-23

Java

物理(Physical)

LINUX

C++

Python

Processing

sas

ios

maths

maple

C语言