闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

STATS 606: GRADIENT DESCENT PROBLEM SET

This problem set is due at noon ET on Feb 17, 2023. Please upload a PDF ﬁle of your solutions and a ZIP ﬁle containing (properly commented) code to Canvas that reproduces any computer output in the solutions. You are encouraged to collaborate on problem sets with your classmates, but the ﬁnal write-up (including any code) must be your own.

1. Implementing gradient descent. Developing good implementation of optimization algo- rithms is not easy. Your task this week is to develop a good implementation of gradient descent. As you shall see, the key to a good implementation of gradient descent (or any line search methods) is a good line search.

(a) Implement gradient descent with a backtracking line search. Use your implementation to minimize

(1.1) f (s1 . s2 ) 全 (1 > s1 )2 + 100(s2 > s1(2))2 n

starting from the initial point (2. 2). Count cost function evaluations, and plot the convergence behavior (e .g. plot suboptimality gap or norm of gradient vs cost function evaluations) of your implementation.

Counting cost function evaluations: Implement the cost function in a separate subroutine that accepts a vector s e R2 and returns a tuple (f (s). ,f (s)). For example, in Python, you could implement the cost function as

def costF(x):

fx = # evaluate cost function value

Dfx = # evaluate cost function gradient

return fx, Dfx

Each time you call this subroutine counts as one cost function evaluation. To minimize cost function evaluations, you can cache previous cost function evaluations.

(b) Implement gradient descent with Barzilai–Borwein step sizes:

st+1 ← st > ηt,f (st).

ft(T)zt

ηt ←

where ft 全 st > st_1 and zt 全 ,f (st) > ,f (st_1 ). Show that the Barzilai-Borwein step size is more aggressive than the theoretically justiﬁed step size; i. e . ηt - . Use your implementation to minimize (1.1) starting from the initial point (2. 2). Count cost function evaluations and plot the convergence behavior.

(c) Replace the backtracking line search with your own, more sophisticated line search. Use gradient descent with your line search to minimize (1.1) starting from (>1n2. 1). Describe your line search (like the algorithm/methods section of a research paper), and plot the convergence behavior of gradient descent with your line search. How close to the minimum can you get with 100 cost function evaluations? Report |,f (s)|2 , where s is the iterate from the algorithm after 100 function evaluations.

We encourage you to work in groups to develop a sophisticated line search. As an incentive, you’ll receive「log10 ( |f (x(1))|2 )| extra credit points on this part. If you work in a group, you must list your group members and include a statement on the contributions of each group member in your problem set submission.

Hint: One idea to improve the backtracking line search is to use polynomial interpolation: ﬁt a polynomial to previous cost function values and derivatives and use the minimum of the ﬁtted polynomial as a trial step. To develop a line search from this idea, you must properly safeguard the trial step.

2023-02-19

Java

物理(Physical)

LINUX

C++

Python

Processing

sas

ios

maths

maple

C语言