Stats 21 Final Exam Description
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
Stats 21 Final Exam Description
Finals week – Wed Dec 7, 2022 3 – 6PM
Exam is open note / open book. You may use any notes I have created for this class. You may use any of
your *own* homework submissions.
Official Python documentation sites:
- The Python Language Reference — Python 3.10.2 documentation
o Example Google search that does not break rules:
▪ insert item into a list site:docs.python.org
- NumPy Reference — NumPy v1.22 Manual
o Example Google search that does not break rules:
▪ matrix of ones site:numpy.org
- API reference — pandas 1.4.1 documentation (pydata.org)
o Example Google search that does not break rules:
▪ date index site:pandas.pydata.org
- Matplotlib: Python plotting — Matplotlib 3.4.2 documentation
o Example Google search that does not break rules:
▪ change symbol in scatterplot site:matplotlib.org
You can look up existing questions that already exist on stackexchange.
You cannot post questions related to the exam on sites like stackexchange, chegg, etc. No collaboration allowed.
Do not share the exam file with anyone. Do not post or distribute the exam file on any site, including github, bitbucket, or coursehero.
Exams are synchronous and proctored via Zoom (webcam on, mics muted).
I will provide a Jupyter Notebook .ipynb file with the questions. I’ll also upload any necessary data.
Final exam topics
No material from week 9 or week 10 will be on the Final exam. No SQL. No statsmodels. No scikit learn. Two parts:
- Data manipulation, summarizing, visualization with Pandas and Matplotlib
- Python programming: functions and classes
Data manipulation, summarizing, visualization with Pandas and Matplotlib
Recommended: review lectures covering Numpy, Pandas, matplotlib, seaborn,
I will provide a csv file. You will read it in with Pandas.
I will ask you do several tasks with the data, which could include any of the following:
- Change the index
- Create a date-based index
- Reindex the data and fill-in missing values or remove missing values
- Subset the data based on variable criteria
- Group the variable based on one variable and calculate the mean or some other summary statistic for each group
- Create a new column using a calculation based on other variables
- Calculate the correlation between variables
- Create a line plot for numeric variables, possibly multiple numeric variables, with legend
- Create a stacked or side-by-side barplot for categorical variables
- Ask you to use a function that was not explicitly taught in class. I will provide the name of the function and you should be able to find the appropriate help page, read about the function, and be able to use it.
- Ask you to explain or elaborate findings
Python Programming: functions and classes
Recommended: review lectures covering object-oriented programming, lectures on strings, tuples, lists, dictionaries
I will ask you to write a few functions that will take in arguments, perform a task, and output something. They can cover a range of topics, but I especially recommend familiarizing yourself with methods and attributes associated with strings, tuples, lists, and dictionaries.
I will give you instructions to create a class definition that incorporates a few of Python’s double-under methods like: __init__, __str__, __lt__, __add__, __len__, __getitem__, etc.
2022-12-06