Skip to content

Commit fcedd52

Browse files
Update tidy-data.Rmd (#1557)
* Update tidy-data.Rmd it is not easy to try out the two data sets (tb and weather). with this changes, any people can run the code to get the two data sets. Also use one of tidyverse packages to read in data sets * Show how to follow along at home --------- Co-authored-by: Davis Vaughan <[email protected]>
1 parent b39be95 commit fcedd52

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

vignettes/tidy-data.Rmd

+4
Original file line numberDiff line numberDiff line change
@@ -198,6 +198,8 @@ billboard3 %>% arrange(date, rank)
198198
After pivoting columns, the key column is sometimes a combination of multiple underlying variable names. This happens in the `tb` (tuberculosis) dataset, shown below. This dataset comes from the World Health Organisation, and records the counts of confirmed tuberculosis cases by `country`, `year`, and demographic group. The demographic groups are broken down by `sex` (m, f) and `age` (0-14, 15-25, 25-34, 35-44, 45-54, 55-64, unknown).
199199

200200
```{r}
201+
# To run this on your own:
202+
# tb <- readr::read_csv("https://raw.githubusercontent.com/tidyverse/tidyr/main/vignettes/tb.csv")
201203
tb <- as_tibble(read.csv("tb.csv", stringsAsFactors = FALSE))
202204
tb
203205
```
@@ -244,6 +246,8 @@ tb %>% pivot_longer(
244246
The most complicated form of messy data occurs when variables are stored in both rows and columns. The code below loads daily weather data from the Global Historical Climatology Network for one weather station (MX17004) in Mexico for five months in 2010.
245247

246248
```{r}
249+
# To run this on your own:
250+
# weather <- readr::read_csv("https://raw.githubusercontent.com/tidyverse/tidyr/main/vignettes/weather.csv")
247251
weather <- as_tibble(read.csv("weather.csv", stringsAsFactors = FALSE))
248252
weather
249253
```

0 commit comments

Comments
 (0)