闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

CS/ECE/ME 532 Sec. 004

Matrix Methods in Machine Learning

[Unit 2] Supervised Learning and Solving Systems of Linear Equations – Overview

Learning Objectives

At the end of this module, students will be able to:

Design classiﬁers using training data and labels

Write systems of linear equations in matrix-vector form

Apply the deﬁnition of linear independence

Apply the concept of rank

Determine when a system of linear equations has no solution, a unique solution, or a nonunique solution Determine whether a function is a valid norm

Find the least-squares solution to a system of linear equations

Compute gradients of linear and quadratic functions of a vector

Apply the deﬁnition of positive deﬁnite and positive semideﬁnite matrices

Apply the orthogonality principle to solve a least-squares problem

Project vectors onto a subspace spanned by a set of linearly independent vectors

Design classiﬁers using orthonormal bases

Find orthonormal bases using the Gram-Schmidt procedure

Solve the Tikhonov-regularized least-squares problem

Apply cross-validation to assess classiﬁer performance

Signiﬁcance of Unit

Supervised learning is a problem of training our machine learning algorithm using data where we know the right answers. We present a whole bunch of data to our algorithm, so it can learn the patterns in the data because it knows, for example, when it looks at a picture, it knows the identity of that person. Once it learns those patterns, we can apply new data, that the algorithm hasn’t yet seen, and it can make a decision, a prediction, or a classiﬁcation. Supervised learning assumes you have data where you know the truth. When you train a classiﬁer or a model using supervised learning, you end up getting a system of linear equations that needs to be solved. Much of the linear algebra that we’ll look at, involves solving systems of linear equations, which will introduce ideas such as rank, subspace, linear independence, bases, positive deﬁnite property for matrices and so on. Now, when you have a system of linear equations, there are three possibilities that you might image: 1–there’s no solution to that system of linear equations 2–there’s a single, exact solution 3–there are an inﬁnite number of solutions. The cases that we’re interested in are 1 and 3. The middle case, where there’s a single solution, rarely happens. Often times, we’re asking our algorithm to do something very difﬁcult in which there may be no exact solution, so we’re going to try to ﬁnd a solution that’s good. For example, minimizes the squared error is a criterion that we’re going to use. So, we’re going to look at ﬁnding solutions that are approximate, that give us a good result. The other case, where we have an inﬁnite number of solutions, occurs often and this is a problem because some of those solutions end up giving us very poor performance when we generalize to new data, so we want to pick (of the many possible solutions) a good one. So, we will introduce criteria that will bias us in that direction, using a technique called regularization. One of the things we’re going to do is visualize the geometry of these equations and concepts. Linear algebra can be super geometrical, and there is a lot of powerful insight you can get by looking at the geometry, so do your best at drawing these pictures. Finally, we’ll look at something for evaluating performance, which his called cross validation, in which you use some of your data to train the algorithm, and then use the rest of the data to test the performance, which allows us to get a good prediction of how our results will generalize to new data.

Key Topics

1. Write classiﬁer learning problem as a system of linear equations in matrix-vector form

2. Exact solutions of systems of linear equations

2.1. Linear independence of vectors

2.2. Rank of a matrix

2.2.1. Outer product representation for low-rank matrices

2.3. Conditions for no solution, a unique solution, or non-unique solutions

3. Approximate solutions of linear equations

3.1. Norms

3.2. Least squares formulation

3.3. Positive deﬁnite and semi-positive deﬁnite matrices

3.4. Gradients of linear and quadratic functions of a vector