--- title: "Technical Workshop Series - Final Project" author: "Your Name Here" date: "Due: Wednesday, April 28, 2021" output: html_notebook --- # Introduction For the final project, we are going to take what we learned in R Studio, data wrangling and graphing. As I have said throughout the lessons, please try to use Google as much as possible for working through code. If you cannot find a solution online, my email is erg5331@psu.edu. I will also hold office hours on Sunday, April 25 from 6:00-7:00 PM EST. If you cannot attend these, please email me with questions. Put the assignment and question number in the subject of your email. We can either schedule a 1-on-1 Zoom "office hours" for help or we can engage in a discussion through email. With that said, here is the final project - good luck! (Reminder: There is only one submission for the final project.) # Load Packages For this assignment, we will need to use the packages for graphing (hint: there are 2), chaining, wrangling, renumbering, and organizing graphs (hint: there are 2). ```{r} # clear R environment rm(list = ls()) # Use this code chunk to load the 7 packages needed for this assignment. ``` # Import Data To answer the questions below, we need to import the data from the Excel portion of the assignment. Save the table from Excel as a .csv file and import that file into R using the code chunk below. Then, perform some wrangling to change the variable names and reformat the "Tuition Cost After Aid" variable as a number without the dollar sign. ```{r} # Use this code chunk to load the data frames needed for the rest of the final project. ``` ## Data-Specific Questions Now that we have our tuition data, we are ready to answer some questions and create some graphs! For the first section, we will check some of our answers from the Excel portion (step 6) of the final project. For the last section, we will create some graphs from the data. Note that some/all of the tasks will require data frame manipulation before you can answer the question with a table/graph. Carefully work through the code and check that every part is answered before submitting the assignment. Remember, you can only submit this assignment once, so you must ensure that your code runs properly. If you have any questions about your code, come to the office hours specified in the introduction of this project or email me at erg5331@psu.edu. ### Part 1 - Verifying Excel Answers A.) (2 pts.) What is the total tuition for University Park, Harrisburg, Altoona, and Beaver students? ```{r} ``` B.) (2 pts.) Among sophomores, what is the standard deviation of GPA? ```{r} ``` C.) (2 pts.) For male juniors and seniors, what is the maximum tuition? ```{r} ``` D.) (2 pts.) Among University Park students in the IST, mineral sciences, and communications programs, what is the minimum GPA? ```{r} ``` E.) (2 pts.) In the business program, how many freshmen are there? ```{r} ``` ### Part 2 - Creating Graphs with **ggplot** and **plotly** F.) (10 pts.) Create 4 side-by-side boxplots (**ggplot**) for tuition cost after aid, where each boxplot shows tuition by gender and each graph is an academic class. Color the boxplots by gender, remaining consistent in the chosen color theme for each of the 4 graphs. Ensure the graphs are labeled properly (the titles should be the name of the class and the axes' titles should be cleaned). Then, join the 4 graphs in a 2x2 grid with an appropriate title. ```{r} ``` G.) (10 pts.) Build 4 bar charts (**ggplot**) that show the total number of students, where each graph is an academic class and each program is a bar. Fill the bars in a scheme as follows: - Freshmen: Light Blue - Sophomores: Dark Gray - Juniors: Light Gray - Seniors: Dark Blue Again, join the 4 graphs in a 2x2 grid with an appropriate title. ```{r} ``` H.) (5 pts.) Construct a bubble chart (**plotly**) with GPA on the x-axis and tuition after aid on the y-axis. Then, add the bubbles, coloring them according to the program and sizing them appropriately. Ensure the graph has a title and that the legend is visible. ```{r} ``` # Conclusion I appreciate you taking the time to come to the workshops and complete each of the assignments and the final project. After your final project submission, the technical workshop team will analyze your assignment and project submissions and give you a final grade. If your grade is at least a 79.5%, you will pass the series and earn special designation in the resume books. While you await the results, I wish you good luck for final exams, any upcoming actuarial exams, and summer internships! If you do not have a summer internship, don't panic - just do something this summer to show recruiters that you weren't being playing video games for 3 months. I look forward to continuing our time together in the club in the 2021-2022 school year!