In this assignment we will working with the Ohio COVID-19 Dashboard data: [login to view URL] (Links to an external site.)
and also the Population data available here: [login to view URL] (Links to an external site.)
Write a program that calculates and compares each Ohio County's COVID-19 rate (per 1000) among adults (age 20+) in the delta surge of fall 2021 compared to the current (and hopefully ending!) omicron wave. For this assignment we will consider the delta surge to be cases from July 1, 2021 to November 30, 2021. The current omicron wave began approximately December 1, 2021 and is still happening.
Specifics
Read in the Ohio COVID-19 data and the population data directly from the provided web addresses (like we did in class). You will work with the data so that you create a final dataset that contains the following 3 variables (you can have more but the grade will depend on these three)
The county's name
A character/factor variable (called "Surge" in provided code below) indicating if it is the delta surge or the current omicron wave
Cases per 1000 residents
Note: there are multiple ways to get from the original data sources to this wrangled version of the data. You will need to do some problem solving -- work it out on paper before starting!!!
Once you have the data in the proper/wrangled form do the following:
Write code that determines which 3 counties had the highest and lowest COVID-19 rates for adults age 20+ in the delta surge, and which 3 counties have the highest and lowest rates with the current omicron wave.
What do you notice when comparing the case rates between delta and omicron?
Also, include a code chunk in your program with the below code. Modify it to work with your your data but otherwise do not change it.
ggplot(ohio_data, aes(x=Surge, y=Cases_per_1000, group=County)) +
geom_line(color="gray50") +
stat_summary(fun="mean", group=1, geom="line", color="navy", size=2) +
coord_cartesian(xlim=c(0.9, 2.1), expand=FALSE) +
theme_minimal() +
labs(x="Surge", y="Cases per 1000 residents",
title="Changes of COVID-19 effects on Ohio Counties",
caption="Source: Ohio DOH and US Census Bureau")
This code should generate an aesthetically pleasing plot that shows the difference in COVID-19 rates from the delta surge to the present omicron wave for each county in Ohio, along with an average change. What does this plot suggest?
What to submit
Submit a RMarkdown document (Rmd file). I will knit this document and grade the outcome.
Make the document is reasonably well formatted such that code is clearly differentiated from output and written remarks. Make sure to include written remarks providing some light analysis. Note: code alone does not constitute a correct assignment.
Some hints/pointers
Recall all the dplyr verbs we have used: group_by(), summarize(), filter(), mutate() -- all are likely needed
As a first step, filter the COVID data to only include cases since (and including) July 1, 2021 -- all values before that date are not needed
To create the surge variable, consider one of two methods:
Read about using the ifelse() or case_when() function to create the "Surge" variable
Filter the data into two datasets (one for delta and one for omicron), create a surge variable in each, then stack them together
Make sure to adjust appropriately for population -- note we are only considering adults 20+ in this assignment
Hi there,
I have read your project description and i'm confident i can do this project for you perfectly.I still have a few questions. please leave a message on my chat so we can discuss the budget and deadline of the project.
Thanks.
..
.
Hi
I have checked your project description. I have excellent programming and
development skills. Contact me with all the details and requirements for
your project for further discussion. I will provide you with dedicated support
and follow-up.
Regards
Farah Khan