闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

SCC462 – Distributed Artiﬁcial Intelligence

Instructions

❼ This assignment consists of essay questions and coding questions.

❼ Please submit your replies on the Quiz available on Moodle. You can only submit

one time.

❼ The Quiz closes on 14/03/2022, 4pm, in order to allow for delayed submissions.

However, submissions made after 11/03/2022, 4pm, will have a 10% penalty in the grade. Your attempt will be automatically submitted on 14/03/2022, 4pm, if you do not click to ﬁnish the attempt on Moodle.

❼ Coding questions must have many comments. Please use the Precheck button

to check if you made enough comments in your code. Solutions without enough comments will not be accepted. Note that the system only recognises the # character for comments.

❼ Coding questions will be checked against test cases, and will be pre-graded based

on the weight assigned to each test that successfully passes. The Moodle Quiz shows some tests as an example (usually one test), and the Precheck button will check against the example test with no penalties. There are (many) more tests that are not visible to you now.

❼ There is no “Check” button. Once you submit, your code will be checked against

all test cases. The ﬁnal grade is given by me, not by the system.

❼ There are no test cases where the input has wrong type. However, the input could

be such that a calculation is impossible due to theoretical reasons.

❼ This is an INDIVIDUAL assignment. You must study the course contents, and

write your replies and your code based on your understanding.

– Solutions copied from another student will not be accepted, even if they have minor modiﬁcations.

– Solutions found on-line will not be accepted, even if you include a reference.

– You are allowed to check references only for:

✯ Fundamental Python information. For example, how to handle lists of

lists, how to randomly sample a number between 0 and 1 using the stan- dard library, etc.

✯ Theoretical information. For example, what is the formal deﬁnition of a Nash Equilibrium, what is the pseudo-code of AdaBoost (not Python code), etc.

– Hence, Python code obtained on-line that, for example, implements Ad- aBoost, calculates Nash Equilibria, etc, will not be accepted.

1. Write a rejection review of Marcolino, Passos et al (2016). That is, you must summarise the paper, explain why it is not yet suitable for publication, and give suggestions to the authors for further improvement. (10%)

2. According to Schapire (1990), each recursive application of the Boosting algorithm reduces the error of a classiﬁer from x to 3x2 − 2x3 (see Figure 1 in the paper). Based on this, write a Python function howManyIterations(x, epsilon), which returns how many recursive iterations of the Boosting algorithm would be needed to reduce the error of a base classiﬁer from x to a value lower than epsilon. In case the number of iterations cannot be calculated, then the function must return False. (10%)

3. Which variation is proposed by Schapire (1990) in order to make the Boosting algorithm run faster than its original version? Why is it faster? (5%)

4. Assume you are given a class WeakClassifier for a Weak Classiﬁer in a binary classiﬁcation problem, with the following methods:

- train(dataset, labels, p): trains a WeakClassifier object, using the dataset and the ground truth labels labels. The training set will be sampled using a probability distribution p over items. We assume that dataset is a list of lists, where each list corresponds to one item to be classiﬁed. Each item’s label is either 0 or 1.

- classify(item): returns a label predicted by the WeakClassifier object. For the purposes of this exercise, we will use the index of the item in the dataset as item, not the list with the features of the item.

Using that class, implement in Python the AdaBoost algorithm, following Fre- und and Schapire (1996). You must write your own code, solutions from on-line resources are not going to be accepted.

Write a class AdaBoost, with the following methods:

- train(dataset, labels, D, T), which trains the AdaBoost system with T classiﬁers, given a dataset in dataset and the corresponding ground truth labels in labels. D is the starting distribution over the examples, as in the AdaBoost paper.

Attention: diﬀerently from the original algorithm, you must re-train a Weak Clas- siﬁer if the error obtained after training would be such that the probability of correctly classiﬁed items would increase.

- classify(item), which classiﬁes an item using the trained AdaBoost system. Again, we will use the index of the item in the dataset as item, not the list with the features of the item.

Additionally, for the purposes of this exercise, we will use a fake WeakClassifier , see the ﬁle WeakClassiﬁer.py attached. The code for the Weak Classiﬁer will be automatically added to the beginning of your code. (15%)

5. In Freund and Schapire (1996), how does the on-line allocation problem and pro- posed solution (ﬁrst part of the paper) relate to the development of the AdaBoost algorithm (second part of the paper)? (10%)

6. Open BI has an ensemble system composed of n neural networks. They use one- hot encoding in a classiﬁcation problem, with l possible labels. That is, at training time, assuming for example 3 labels, the label 0 would be represented as [1, 0, 0], label 1 as [0, 1, 0], and label 2 as [0, 0, 1], where each element is the desired output of the corresponding output neuron. At inference time, however, each neuron returns a value between 0 and 1. These neural networks have a softmax layer in the end, so their output can be seen as a probability distribution function.

You were asked to interpret the output of the neurons as a ranking, and write a function to aggregate the rankings using the Borda voting rule. Additionally, all ties must be broken randomly. When deciding the winner of a tie, please use the function choice from the random module.

Hence, you must implement bordaAggregation(opinions), where:

- opinions is a list of lists. Each inner list corresponds to a neural network, and each element of that list corresponds to the output of an output neuron. For example, opinions = [[0.4, 0.5, 0.1], [0.2, 0.3, 0.5]] represents a system with two neural networks, where the ﬁrst one assigns value 0.4 to label 0, 0.5 to label 1, and 0.1 to label 2. Similarly, the second neural network assigns value 0.2 to label 0, 0.3 to label 1, and 0.5 to label 2. Note that in general we can have any number of neural networks and labels/output neurons.

- The function returns ranking, where each element corresponds to the ﬁnal rank- ing position. For instance, ranking = [2, 0, 1] would mean that label 2 is in the top position of the ﬁnal aggregated ranking, label 0 in the middle position, and label 1 in the last position. (10%)

7. Giggle wants to apply Stacked Generalisation to improve its predictions about cus-

tomer behaviour. Currently they are using a Logistic Regression class (LogisticRegression), which has the following interface:

- LogisticRegression(nFeatures, alpha = 0.15, threshold = 0.5, nEpochs = 200): Class constructor. The ﬁrst input is the number of features in the dataset, and alpha is the learning rate. If alpha is not passed, it uses 0.15. The hyper-parameter threshold is used to deﬁne whether the output of the logistic regression will be interpreted as a label 0 or 1. That is, outputs lower than or equal to threshold will be deﬁned as 0 when classifying items. nEpochs deﬁnes the number of epochs that will be used during training.

- train(dataset, labels): Trains the Logistic Regression classiﬁer, using the dataset dataset and the corresponding labels labels.

- classify(item): Returns a predicted label for the item item, using the trained Logistic Regression classiﬁer.

This classiﬁer is already given to you, see the ﬁle LogisticRegression.py attached. It will be automatically added to the beginning of your code.

You were hired to implement the class StackedGeneralisation for Giggle. Although Stacked Generalisation can work with a diverse set of classiﬁers, we will use only Logistic Regression as our base classiﬁers and as our aggregator, for their initial tests.

We will use blocks of size 1. That is, the training set will be divided in n blocks of 1 item each. In order to match with their test cases, please generate the training set of the aggregator following the original ordering of the training set, and train the classiﬁers in order. That is, when training the base classiﬁers, train ﬁrst the ﬁrst classiﬁer, followed by training the second classiﬁer, then the third classiﬁer and so on. Similarly, when creating the dataset for the aggregator, please generate ﬁrst the ﬁrst row, then the second row, etc.

Hence, your task is to implement the StackedGeneralisation class with the following methods:

- StackedGeneralisation(classifiers, aggregator): Class constructor. Receives a list classifiers of classiﬁer objects, in order. That is, the ﬁrst element corresponds to the ﬁrst classiﬁer, the second to the second classiﬁer and so on. Additionally, receives a classiﬁer object aggregator , which will learn the aggregation rule.

- train(dataset, labels): Trains the Stacked Generalisation system, given a dataset in dataset, and the corresponding labels in labels. The dataset will be represented as a list of lists, where each inner list is an item, and each element of the inner lists are features of the item.

- classify(item): Classify the item in item using the trained StackedGeneralisa- tion system. It returns a label (0 or 1). (15%)

8. Consider the team/ensemble success prediction system described in Marcolino, Lakshminarayanan et al. (2016). Let’s assume that we have an ensemble of 3 classiﬁers (c0 , c1 , c2 ), working across batches of 4 items, in a problem with 3 possible labels (0, 1 or 2). The ﬁnal decision of the ensemble is given by the plurality voting rule.

For each batch of items, we store the vote of each agent for each item, the ﬁ- nal classiﬁcation and the corresponding ground truth label, in a list of lists. Using this data, and the Logistic Regression class provided, create a classiﬁer to predict the success of the ensemble. You must implement two functions: predictionFeatureVector(batch) and successPrediction(batches).

predictionFeatureVector(batch) is an auxiliary function that returns the feature vector of the prediction methodology, given the stored information for one batch of items. In detail:

- batch is a list of lists, where each list corresponds to one item within the batch. For each item, we ﬁrst list the votes of each classiﬁer, then the deci- sion taken by the team, and ﬁnally the ground truth label. For example, batch = [[0, 1, 0, 0, 1], [1, 1, 0, 1, 1], [0, 1, 2, 0, 0], [0, 0, 0, 0, 0]] shows a case where the votes for each item were, respectively: [0, 1, 0], [1, 1, 0], [0, 1, 2], [0, 0, 0]. Additionally, the de- cision for each item was: 0, 1, 0, 0, respectively. Finally, the ground truth labels were 1, 1, 0, 0.

- The function returns a list featureVector, where each element corresponds to a subset of the classiﬁers. We will use the following ordering across possible subsets: {c0 }, {c1 }, {c2 }, {c0 , c1 }, {c0 , c2 }, {c1 , c2 }. Hence, featureVector = [1/4, 0, 0, 1/4, 1/2, 0] would mean that in this batch classiﬁer {c0 } was responsible for 1/4 of the ﬁnal team decisions, {c0 , c1 } for 1/4 of the decisions, and {c0 , c2 } for 1/2 of the decisions.

Concerning successPrediction(batches):

- batches is a 3D Python list (list of lists of lists), containing a list of batches, as de- scribed above. For example, batches = [[[0, 1, 0, 0, 1], [1, 1, 0, 1, 1], [0, 1, 2, 0, 0], [0, 0, 0, 0, 0]], [[0, 0, 0, 0, 0], [1, 1, 2, 1, 2], [0, 1, 1, 1, 1], [2, 2, 1, 2, 2]]] contains information about two batches of items. The ﬁrst batch is [[0, 1, 0, 0, 1], [1, 1, 0, 1, 1], [0, 1, 2, 0, 0], [0, 0, 0, 0, 0]] as described above, and the second batch is [[0, 0, 0, 0, 0], [1, 1, 2, 1, 2], [0, 1, 1, 1, 1], [2, 2, 1, 2, 2]].

- The function returns an object of type Logistic Regression, with the trained classiﬁer. Use default learning rate, threshold for classiﬁcation, and number of epochs when training in the Logistic Regression.

The code for the Logistic Regression classiﬁer will be automatically added to the beginning of your code. (15%)

9. Consider we have a game between two agents A and B , with two possible actions, as follows:

A/B	0	1
0	r0 ﹐ 0 , r0(′) ﹐ 0	r0 ﹐ 1 , r0(′) ﹐ 1
1	r1 ﹐ 0 , r 1(′) ﹐ 0	r1 ﹐ 1 , r 1(′) ﹐ 1

Reward r乞﹐j is the reward the Row player (agent A) receives when it takes action a乞 and Column player (agent B) takes action aj. Similarly, r乞(′)﹐j corresponds to the reward for the column player (agent B) when A takes a乞 and B takes aj. We will represent that table in Python as a list of lists, where each inner list corresponds to a row, as follows: [[(r0 ﹐ 0 , r0(′) ﹐ 0 ), (r0 ﹐ 1 , r0(′) ﹐ 1 )], [(r1 ﹐ 0 , r1(′) ﹐ 0 ), (r1 ﹐ 1 , r1(′) ﹐ 1 )]].

(a) Write in Python a function nashEquilibria(rewards), which returns all the

pure strategy Nash Equilibria in the game. rewards is a list of lists with the rewards for each player, as deﬁned above. The function returns a list of tuples, where each tuple is a pair of actions for each player if that pair is a Nash Equilibria of the game. For instance, [(0, 0), (1, 1)] means that there are two Nash Equilibria in the game: when A takes action a0 and B takes action a0 , and when A takes action a1 and B takes action a1 . If the game has no pure strategy Nash Equilibria, then it returns an empty list: []. (5%)

(b) Write in Python a function mixedEquilibria(rewards), which returns a mixed

strategy Nash Equilibria in the game deﬁned by the rewards in rewards. The function returns a tuple (x, y) where x is the probability of the A player play- ing a0 and y is the probability of the B player playing a0. If there is no mixed strategy Equilibria, then it returns an empty tuple: (). (5%)