关键词 > statistics代写
Homework #3.
发布时间:2023-10-26
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
Homework #3.
Due online (NYU Brightspace – Assignments) on October 26 at 9:30AM. Late homework will NOT be accepted. No exceptions.
Instructions:
(1) Must be submitted as ONE SINGLE DOCUMENT
Acceptable formats: Word (.doc, .docx), PDF (.pdf), image (.jpg, .png) (2) Show your questions in order.
1. (20 points) Each of the following scatterplots shows a cluster of points and one “stray” point. For each, answer these questions: 1) Does it have high leverage, a large residual, or both? 2) Do you think that point is an influential point? 3) If that point were removed,
would the correlation become stronger or weaker? 4) If that point were removed, would the slope of the regression line increase or decrease?
2. (10 points) According to Pew Research, the contact rate (probability of contacting a selected household) was 90% in 1997 and 62% in 2012. However, the cooperation rate (probability of someone at the contacted household agreeing to be interviewed) was 43% in 1997 and dropped to 14% in 2012.
(a) What is the probability (in 2012) of obtaining an interview with the next household on the sample list? (To obtain an interview, an interviewer must both contact the household and then get agreement for the interview.)
(b) Was it more likely to obtain an interview from a randomly selected household in 1997 or in 2012?
3. (10 points) The American Red Cross says that about 45% of the U.S. population has Type O blood, 40% Type A, 11% Type B, and the rest Type AB.
(a) Someone volunteers to give blood. What is the probability that this donor 1) has Type AB blood?
2) has Type A or Type B?
3) is not Type O?
(b) Among four potential donors, what is the probability that
1) all are Type O?
2) no one is Type AB?
3) they are not all Type A?
4) at least one person is Type B?
4. (20 points) 56% of all American workers have a workplace retirement plan, 68% have health insurance, and 49% have both benefits. We select a worker at random.
(a) What’s the probability he has neither employer-sponsored health insurance nor a retirement plan?
(b) What’s the probability he has health insurance if he has a retirement plan?
(c) Are having health insurance and a retirement plan independent events? Explain. (d) Are having these two benefits disjoint events? Explain.
5. (15 points) An airline offers discounted “advance-purchase” fares to customers who buy tickets more than 30 days before travel and charges “regular” fares for tickets
purchased during those last 30 days. The company has noticed that 60% of its customers take advantage of the advance-purchase fares. The “no-show” rate among people who paid regular fares is 30%,but only 5% of customers with advance-purchase tickets are no-shows.
(a) What percent of all ticket holders are no-shows?
(b) What’s the probability that a customer who didn’t show had an advance-purchase ticket?
(c) Is being a no-show independent of the type of ticket a passenger holds? Explain.
6. (25 points) (Graphs must be printed or scanned from Excel)
To develop a way to estimate the weight of an alligator, the wildlife researchers measured the lengths and weights of several captured alligators. Then they used those data (on next page) to develop a model enabling them to estimate an alligator’s weight from its length – something they can guess from a safe distance! Officials hope to use this model to
identify alligators that should be relocated because they have grown so large as to pose a threat to humans.
Weight (pounds) |
Length (inches) |
83 |
86 |
70 |
88 |
61 |
72 |
54 |
74 |
44 |
61 |
106 |
90 |
84 |
89 |
39 |
68 |
42 |
76 |
197 |
114 |
102 |
90 |
57 |
78 |
130 |
94 |
51 |
74 |
640 |
147 |
28 |
58 |
80 |
86 |
110 |
94 |
33 |
63 |
90 |
86 |
36 |
69 |
38 |
72 |
366 |
128 |
84 |
85 |
80 |
82 |
(a) Use Excel, create a linear model to estimate an alligator’s weight from its length. Display the scatterplot together with the linear regression equation and R2.
(b) Use Excel, create the residual plot for the above regression.
(c) Discuss the appropriateness of the linear model.
(d) Use Excel, create a more appropriate model using power of 0 in the “ladder of
powers”. Display the scatterplot with the linear regression equation and R2, and create the residual plot for this new model. Justify why this new model is better.
(e) Using this new model, predict the weight of an alligator that is 100 inches long.