闪电代写 -代写CS作业_CS代写_Finance代写_Economic代写_Statistics代写_代码代做_IT代写_加急帮助

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

CSCI 2100: Course Project (Fall 2022))

Due: 12 noon, 20 December 2022

(absolute deadline as we need to assign grades)

Goals of Course Project

1. To carry out the design of a solution for solving an application problem using some of the data structures learned in CSCI 2100 and to analyze the resulting performance of your solution.

2. Programming is one of the essential parts of the project in implementing your solution. However, a good design and an evaluation of your solution are equally important and will be graded accordingly.

3. You submission is graded according to the following parts:

❼ Tests passed (half are simple, half are complex) (35% total score),

❼ Report and analysis of results (35% total score),

❼ Program and functions implemented with proper documentation and comments (30% total score)

– Implementing a hash table with insertion and deletion (10%)

– Implementing the k-selection and range-selection functions (10%)

– Quality of code (10%)

4. Your submission (through Blackboard) should have the following parts:

❼ A short (1-2 page) report containing the following:

– The design of the your program and tradeoffs made (for instance, why do you choose one solution over another). It should include all the parameter combinations studied in your implementation. Examples of tradeoffs include the size of the hash table and the parameters of the hash function.

– A summary of the amortized performance in processing the queries, including the space/time complexity of your algorithm, total and amortized times in generating your results on the test data for each type of query. You should discuss the effects of your trade- offs on solution times.

– A summary of the amortized performance relative to the case of a hash table of one entry,

– Discuss with justifications the best choice of the hash table size and the hash function parameters to support this application. Analyze the results you have obtained and other approaches to improve them.

❼ A properly documented program that can be tested by the teaching assistants in running new test queries beyond those available to you. Your program should compile correctly with no errors and be able to run using the data supplied by the teaching assistants.

❼ A VeriGuide report on your submission.

Project Specifications

1. You are given 100K data records in a text file, each line containing two fields of a fixed length. The records are not given in any particular order, and the keys were generated randomly using a uniform distribution over its space.

2. The first field in each record contains a search key with 256 bits. The range of the search key (U = 2256 − 1) is much larger than the number of records (n ≈ 100000) in the text file and those added by the queries.

3. Create an empty hash table and a database to store the records. Read the text file, one line at a time to insert the record into the database and the corresponding pointer and other information in the hash table. Each database record is identified by its starting address (or index) of the record.

4. You are given a sequence of queries on the database (in files test hash table .py, test hash table .java, test hash table . cpp) of the following types:

❼ Inserting new records into the database or deleting existing records from the database, one at a time, each using the given search key;

❼ Finding the pointer to the record with the k-th smallest value of the search key (and a set of pointers to multiple records if there are duplicates), where k can range from 1 to the number of database records;

❼ Returning the pointers of all database records who key values are in a given range of the search key; examples of ranges are the 1000 records with the smallest keys, records whose keys are the 200 largest, and records whose keys are in in a given range of key values (say 105 to 5 × 105 ).

5. We will provide you with the following program files that can be down- loaded from Blackboard (as a zip file)

❼ Record.py, Record. cpp, Record.java: for reading data from the input file and classes to store the data records,

❼ example .py, example . cpp, example .java: the class of HashTable that annotate what you will implement,

❼ test hash table .py, test hash table . cpp, test hash table .java: tests that you run to test your program.

6. Your design should not be hard-coded for a given database.

❼ Your solution should be able to adapt to the dynamic growth and shrinking of the database.

❼ Your solution should be based on hashing only; all the queries should be run using the hash tables maintained; you should not maintain any sorted list of the keys. However, auxiliary information derived from the hash table can be maintained.

❼ If you used a linear search, a binary search, or any sorted list in your solution, you will get a very low score.