闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

Assignment 4: A Multi-Threaded HTTP Server

CSE 130: Principles of Computer Systems Design

Due: March 15, 2024 at 11:59 PM

Goals This assignment provides you with experience managing concurrency through synchronization. Nearly all modern systems are faced with managing concurrency to increase their hardware utilization. So, this project is a great way to have some real-world experience! We encourage you to consider the design of your server before you start building the code—design is extra important when dealing with multiple threads.

Overview This project will combine Assignments 2 and 3 to build a multi-threaded HTTP server. Namely, your server will add a thread-safe queue and some reader-writer locks (Asgn 3) to an HTTP server (Asgn 2) so that the server can serve multiple clients simultaneously. Think: what performance metric does this form of multi-threading improve, latency or throughput?.

While your server should process multiple clients simultaneously, it must ensure that its responses conform to a coherent and atomic linearization of the client requests. In effect, this means that an outside observer could not differentiate the behavior of your server from a server that uses only a single thread. Your server must create an audit log that identifies the linearization of your server (See below for more details).

Submission As always, the code that you submit must be in your repository on git.ucsc.edu. In particular, your assignment must build your HTTP server, called httpserver, when we execute the command make from the asgn4 directory. Submit a 40-character commit ID hash on Canvas to identify the commit that you want us to grade. We will grade the last hash that you submit and will use the timestamp of your last upload to determine grace days. For example, if you post a commit hash 36 hours after the deadline, we will subtract 2 grace days from your total. When you submit, you should only submit the commit ID you want us to grade.

Assignment Details

Your server must implement the functionality explained below subject to the limitations below.

Functionality

Your new httpserver should take two command line arguments, port—the port to listen on (this is the same as Assignment 2)—and threads—the number of worker threads to use. The port argument is required and the threads argument is optional (defaulting to 4). We recommend that you use getopt to parse the command-line arguments.

./httpserver [-t threads]

Like in Assignment 2, your server will create, listen, and accept connections on a socket that is listening on a port. However, it will also use threads worker threads to support up to threads multiple clients at the same time (as in Assignment 2, each client will send only one request in each connection) and will output an audit log to stderr.

Audit Log

Your server should produce an audit log that indicates the order in which it processed requests; your server should output the audit log to stderr. When your server processes each request, it should add an entry to the log with the following format:

,,,\n

The comma-separated single line of text starts with the type of operation that was performed, i.e., PUT or GET. Then, the line includes the URI (With the same limitations as in Assignment 2, i.e., the format /%63[A-Za-z.-]s). The line should then include the status code that your server intended to produce in the response to the request (for example, if your server intended to send a 200 Status-Code, but the client closed before reading the entire message, the audit log entry should still contain a 200 Status-Code). Finally, the line should include the value of an optional HTTP header, called RequestID, that a client will specify (or the keyword 0 if the RequestID header was not found in the request associated with the line).

Log entries should be Atomic (i.e., there should not be any partial, overwritten, or otherwise corrupted lines in your log). The log should be durable: your server should ensure that the log entries for each request processed by your server are resident in the log. If you are doing something extra ambitious (e.g., adding your own buffering for your log), then here are the durability semantics: your server must ensure that log entries are resident in the log after we send your server a SIGTERM signal (you can add a signal handler to add code that runs when a signal arrives to your server). Otherwise, we encourage you to use fprintf to produce audit-log entries; it provides the durability and atomicity requirements that you need.

Ordering Requirements

Your server should produce responses that are coherent and atomic with respect to the ordering specified in your audit log. That is, if an entry for a request R1 , is earlier than an entry for a request, R2 , in the log, then the server’s response to R2 must be as though the processing for R1 occurred in its entirety before any of the processing for R2. We call this ordering a linearization because it creates a single linear ordering of all client requests. We say that this ordering is a total order because it provides an ordering for all elements (i.e., for any two unique requests, R1 or R2 , your audit log will identify that either R1 happens-before R2 or that R2 happens-before R1 ).

Your server’s linearization must conform to the order of requests that it received in the following way: if two requests, R1 and R2 , arrive such that the start time of R2 is after the end time of R1 , then your audit log should indicate the line for R2 after the log line for R1. However, if R1 and R2 overlap (i.e., the start time of R1 is before the start time of R2 and the start time of R2 is before the end time of R1 ), then your server’s audit log could produce audit log entries in either order (i.e., R1 could be before R2 or R2 could be before R1 ).

Efficiency

Your server should be as efficient as possible while still providing the ordering requirement. In other words, your server should only block request processing if either (1) there are already thread active clients or (2) blocking request processing is necessary to ensure the coherency/atomicity of your server’s linearization of requests. This means that your server must concurrently process requests when it is safe to do so. It cannot simply spawn threads on demand, but only perform work in a single thread.

Multi-threading

Your server should use a thread-pool design, a popular concurrent programming construct for implementing servers. A thread pool has two types of threads, worker threads, which process requests, and a dispatcher thread, which listens for connections and dispatches them to the workers. It would probably make sense to dispatch requests by having the dispatcher push them into a threadsafe queue (Assignment 3, see Resources below) and having worker threads pop elements from that queue.

Worker Threads Your server must create exactly threads worker threads (Note: this means that the server must have exactly threads + 1 threads). A worker thread should be idle (i.e., sleeping by waiting on a lock, conditional variable, or semaphore) if there are no requests to be processed. Each worker thread should perform HTTP processing for a request (Assignment 1, see Resources below). You will need to carefully implement synchronization to ensure that your server properly maintains a state that is shared across worker threads and to ensure coherency and atomicity.

Dispatcher Thread Your server should use a single dispatcher thread. A typical design uses the main thread (i.e., the function call main), as the dispatcher. The dispatcher should wait for connections from a client. Once a connection is initiated, the dispatcher should alert one of the worker threads and listen for a new client. If there are no idle worker threads, then the dispatcher should wait until a worker thread finishes its current task. Your server will have to implement correct synchronization to ensure that the dis- patcher thread and worker threads correctly “hand-off” client requests without dropping any client requests, corrupting any data, or crashing the server.

Additional Functionality

In addition to supporting the methods listed above, your httpserver should not have any memory leaks. If you are feeling ambitious: the best way to ensure this is to write a signal handler that frees all memory. Otherwise, don’t worry, you can get all of your points as long as your server never uses more than 10 MBs of memory.

You are not allowed to use the function, flock in this assignment.

Limitations

You must write httpserver using the ‘C‘ programming language. Your program cannot use functions, like system or execve, that allow you to execute external programs. If your submission does not meet these minimum requirements, then the maximum score that you can get is 5%.

Examples

We include a number of examples to illustrate the intended functionality of httpserver.

Example 1

This example focuses on the audit log format. Suppose that the server receives the following requests, one after the other, and that a.txt exists initially, but b.txt does not:

GET /a.txt HTTP/1.1\r\nRequest-Id: 1\r\n\r\n

GET /b.txt HTTP/1.1\r\nRequest-Id: 2\r\n\r\n

PUT /b.txt HTTP/1.1\r\nRequest-Id: 3\r\nContent-Length: 3 \r\n\r\nbye

GET /b.txt HTTP/1.1\r\n\r\n

the server should produce the audit log:

GET,/a.txt,200,1

GET,/b.txt,404,2

PUT,/b.txt,201,3

GET,/b.txt,200,0

Example 2

This example focuses on times when your server should operate totally concurrently—i.e., when it should allow process multiple requests at the same time.

Suppose that we start your server with two threads in a directory containing the files a.txt, b.txt, and c.txt. Your server should create two worker threads and one dispatcher thread. The worker threads should wait for a message from the dispatcher, while the dispatcher thread should wait for a connection on its listen socket. Then, suppose that the following three requests arrive at your server concurrently (i.e., at the same time):

GET /a.txt HTTP/1.1\r\nRequest-Id: 1\r\n\r\n

GET /b.txt HTTP/1.1\r\nRequest-Id: 2\r\n\r\n

GET /c.txt HTTP/1.1\r\nRequest-Id: 3\r\n\r\n

The dispatcher thread should wake up one of the worker threads to handle one of the requests and wake up another worker thread to handle a second of the requests. The dispatcher should track that it has seen, but not processed, the third request 2 . Request processing should synchronize as little as possible—in this case, since the requests do not access the same URI, the processing should not need to synchronize at all! As soon as either worker finishes processing their request, that worker should begin processing the third request. The other thread should wait to be alerted by the dispatcher that there are more requests. Because the requests are operated concurrently, your server can produce any of the following six audit logs:

GET,/a.txt,200,1 GET,/a.txt,200,1 GET,/b.txt,200,2

GET,/b.txt,200,2 GET,/c.txt,200,3 GET,/a.txt,200,1

GET,/c.txt,200,3 GET,/b.txt,200,2 GET,/c.txt,200,3

GET,/b.txt,200,2 GET,/c.txt,200,3 GET,/c.txt,200,3

GET,/c.txt,200,3 GET,/a.txt,200,1 GET,/b.txt,200,2

GET,/a.txt,200,1 GET,/b.txt,200,2 GET,/a.txt,200,1

After all of this processing, the server should be in the steady state of having (1) the workers wait- ing for alerts from the dispatcher and (2) the dispatcher thread waiting for a connection (i.e., calling listener accept() on the listen socket) .

Example 3

This example identifies scenarios when your server’s request processing should synchronize.

Suppose that we start your server with two threads in a directory that does not contain the file, a.txt. Your server should create two worker threads and one dispatcher thread. The worker threads should wait for a message from the dispatcher, while the dispatcher thread should wait for a connection on its listen socket. Suppose that the following two requests arrive at your server concurrently:

GET /a.txt HTTP/1.1\r\nRequest-Id: 1\r\n\r\n

PUT /a.txt HTTP/1.1\r\nRequest-Id: 2\r\nContent-Length: 3 \r\n\r\nbye

Your server will need to produce an audit log and responses to requests that are consistent with each other. Namely, there are only two possible combinations of audit log and response:

Option 1.

Audit Log

GET,/goodbye.txt,404,1

PUT,/goodbye.txt,201,2

Request Response

1 HTTP/1.1 404 Not Found\r\nContent-Length: 10 \r\n\r\nNot Found\n

2 HTTP/1.1 201 Created\r\nContent-Length: 8 \r\n\r\nCreated\n

Option 2.

Audit Log

PUT,/goodbye.txt,201,2

GET,/goodbye.txt,200,1

Request Response

2 HTTP/1.1 201 Created\r\nContent-Length: 8 \r\n\r\nCreated\n

1 HTTP/1.1 200 OK\r\nContent-Length: 3 \r\n\r\nOK\n

After all of this processing, the server should be back in the steady state of having (1) the workers waiting for requests to arrive and (2) the dispatcher thread waiting for a connection (i.e., calling accept() on the listen socket).

Rubric

We will use the following rubric for this assignment:

CSE 130: Principles of Computer Systems Design Assignment 4