Skip to content

Commit 8c59082

Browse files
committed
version bump to v0.4.0
1 parent 9597a8a commit 8c59082

File tree

197 files changed

+2068
-415
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

197 files changed

+2068
-415
lines changed

DESCRIPTION

+1-1
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ Description: Datasets and code published by the data journalism website
44
'FiveThirtyEight' available at <https://github.com/fivethirtyeight/data>.
55
Note that while we received guidance from editors at 'FiveThirtyEight', this
66
package is not officially published by 'FiveThirtyEight'.
7-
Version: 0.3.0.9000
7+
Version: 0.4.0
88
Authors@R: c(
99
person("Albert Y.", "Kim", email = "[email protected]", role = c("aut", "cre")),
1010
person("Chester", "Ismay", email = "[email protected]", role = "aut"),

NEWS.md

+15-3
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
1-
# fivethirtyeight 0.3.0.9000
1+
# fivethirtyeight 0.4.0
22

3-
* Added new and missing data sets:
3+
* Package website now at <http://fivethirtyeight-r.netlify.com/>
4+
* Added new data sets:
45
+ By Meredith Manley: `ahca_polls`, `bachelorette`, `candy_rankings`,
56
`chess_transfers`, `generic_polllist`, `generic_topline`, `mlb_elo`,
67
`ratings`, `spi_global_rankings`, `spi_matches`, `tweets`
@@ -10,9 +11,20 @@
1011
`tv_hurricanes`, `tv_hurricanes_by_network`, `tv_states`, `sandy_311`,
1112
`senators`, `trump_approval_poll`, `trump_approval_trend`,
1213
`trumpworld_issues`, `trumpworld_polls`, `twitter_presidents`, `undefeated`
13-
* Package website now at <http://fivethirtyeight-r.netlify.com/>
14+
* Note for now `mayweather_mcgregor_tweets`, `mlb_elo`, `nba_carmelo`,
15+
`nfl_elo`, `senators`, only include the first 10 rows of the data because of R
16+
package file size restrictions (< 5MB tarballs); code to download/process the
17+
entire dataset are included in help files. We hope to include full versions of
18+
these datasets in v0.5.0 of package using the `drat` package.
1419
* Converted all ordinal categorical variables to `factor(..., ordered = TRUE)`.
1520
Ex: `bechdel$clean_test`
21+
* Vignette notes:
22+
+ Added a vignette that has a link to all user-contributed vignettes, which
23+
areonly available on development version of package and on GitHub due to
24+
CRAN package size restrictions.
25+
+ For all datasets that have a user contributed vignette, add a link to it
26+
in the corresponding help/roxygen code file.
27+
* Added `CITATION`
1628

1729

1830
# fivethirtyeight 0.3.0

R/data_albert.R

+14-8
Original file line numberDiff line numberDiff line change
@@ -139,7 +139,7 @@
139139
#' \item{wine_servings}{Servings of wine in average serving sizes per person}
140140
#' \item{total_litres_of_pure_alcohol}{Total litres of pure alcohol per person}
141141
#' }
142-
#' @source World Health Organisation, Global Information System on Alcohol and Health (GISAH), 2010.
142+
#' @source World Health Organization, Global Information System on Alcohol and Health (GISAH), 2010.
143143
#' @examples
144144
#' # To convert data frame to tidy data (long) format, run:
145145
#' library(tidyverse)
@@ -159,6 +159,8 @@
159159
#' The raw data behind the story
160160
#' "'Straight Outta Compton' Is The Rare Biopic Not About White Dudes"
161161
#' \url{https://fivethirtyeight.com/features/straight-outta-compton-is-the-rare-biopic-not-about-white-dudes/}.
162+
#' An analysis using this data was contributed by Pradeep Adhokshaja as a package vignette at
163+
#' \url{http://fivethirtyeight-r.netlify.com/articles/biopics.html}.
162164
#'
163165
#' @format A data frame with 761 rows representing movies and 14 variables:
164166
#' \describe{
@@ -187,6 +189,8 @@
187189
#' The raw data behind the story
188190
#' "A Statistical Analysis of the Work of Bob Ross"
189191
#' \url{https://fivethirtyeight.com/features/a-statistical-analysis-of-the-work-of-bob-ross/}.
192+
#' An analysis using this data was contributed by Jonathan Bouchet as a package vignette at
193+
#' \url{http://fivethirtyeight-r.netlify.com/articles/bob_ross.html}.
190194
#'
191195
#' @format A data frame with 403 rows representing episodes and 71 variables:
192196
#' \describe{
@@ -335,7 +339,7 @@
335339
#' \item{employed_fulltime_yearround}{Employed at least 50 weeks (WKW == 1) and at least 35 hours (WKHP >= 35)}
336340
#' \item{unemployed}{Number unemployed (ESR == 3)}
337341
#' \item{unemployment_rate}{Unemployed / (Unemployed + Employed)}
338-
#' \item{p25th}{25th percentile of earnigns}
342+
#' \item{p25th}{25th percentile of earnings}
339343
#' \item{median}{Median earnings of full-time, year-round workers}
340344
#' \item{p75th}{75th percentile of earnings}
341345
#' }
@@ -362,15 +366,15 @@
362366
#' \item{grad_employed_fulltime_yearround}{Employed at least 50 weeks (WKW == 1) and at least 35 hours (WKHP >= 35)}
363367
#' \item{grad_unemployed}{Number unemployed (ESR == 3)}
364368
#' \item{grad_unemployment_rate}{Unemployed / (Unemployed + Employed)}
365-
#' \item{grad_p25th}{25th percentile of earnigns}
369+
#' \item{grad_p25th}{25th percentile of earnings}
366370
#' \item{grad_median}{Median earnings of full-time, year-round workers}
367371
#' \item{grad_p75th}{75th percentile of earnings}
368372
#' \item{nongrad_total}{Total number of people with major}
369373
#' \item{nongrad_employed}{Number employed (ESR == 1 or 2)}
370374
#' \item{nongrad_employed_fulltime_yearround}{Employed at least 50 weeks (WKW == 1) and at least 35 hours (WKHP >= 35)}
371375
#' \item{nongrad_unemployed}{Number unemployed (ESR == 3)}
372376
#' \item{nongrad_unemployment_rate}{Unemployed / (Unemployed + Employed)}
373-
#' \item{nongrad_p25th}{25th percentile of earnigns}
377+
#' \item{nongrad_p25th}{25th percentile of earnings}
374378
#' \item{nongrad_median}{Median earnings of full-time, year-round workers}
375379
#' \item{nongrad_p75th}{75th percentile of earnings}
376380
#' \item{grad_share}{grad_total / (grad_total + nongrad_total)}
@@ -405,7 +409,7 @@
405409
#' \item{employed_fulltime_yearround}{Employed at least 50 weeks (WKW == 1) and at least 35 hours (WKHP >= 35)}
406410
#' \item{unemployed}{Number unemployed (ESR == 3)}
407411
#' \item{unemployment_rate}{Unemployed / (Unemployed + Employed)}
408-
#' \item{p25th}{25th percentile of earnigns}
412+
#' \item{p25th}{25th percentile of earnings}
409413
#' \item{median}{Median earnings of full-time, year-round workers}
410414
#' \item{p75th}{75th percentile of earnings}
411415
#' \item{college_jobs}{Number with job requiring a college degree}
@@ -424,6 +428,8 @@
424428
#' The raw data behind the story
425429
#' "Comic Books Are Still Made By Men, For Men And About Men"
426430
#' \url{https://fivethirtyeight.com/features/women-in-comic-books/}.
431+
#' An analysis using this data was contributed by Jonathan Bouchet as a package vignette at
432+
#' \url{http://fivethirtyeight-r.netlify.com/articles/comics_gender.html}.
427433
#'
428434
#' @format A data frame with 23272 rows representing characters and 16 variables:
429435
#' \describe{
@@ -476,7 +482,7 @@
476482
#' \item{care_data}{How much, if at all, do you care about the debate over the use of the word "data" as a singular or plural noun?}
477483
#' \item{care_proper_grammar}{In your opinion, how important or unimportant is proper use of grammar?}
478484
#' }
479-
#' @source See \url{https://github.com/fivethirtyeight/data/tree/master/comma-survey-data}.
485+
#' @source See \url{https://github.com/fivethirtyeight/data/tree/master/comma-survey}.
480486
"comma_survey"
481487

482488

@@ -537,9 +543,9 @@
537543
#' \item{google_knowledge_occupation}{Their occupation or office, according to Google's Knowledge Graph or, if they're not in there, how Stewart introduced them on the program.}
538544
#' \item{show}{Air date of episode. Not unique, as some shows had more than one guest}
539545
#' \item{group}{A larger group designation for the occupation. For instance, us senators, us presidents, and former presidents are all under "politicians"}
540-
#' \item{raw_guest_list}{The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowlege_Occupation only refers to one of them in a given row.}
546+
#' \item{raw_guest_list}{The person or list of people who appeared on the show, according to Wikipedia. The GoogleKnowledge_Occupation only refers to one of them in a given row.}
541547
#' }
542-
#' @source Google Knowlege Graph, The Daily Show clip library, Wikipedia.
548+
#' @source Google Knowledge Graph, The Daily Show clip library, Wikipedia.
543549
"daily_show_guests"
544550

545551

R/data_chester.R

+4-2
Original file line numberDiff line numberDiff line change
@@ -196,7 +196,7 @@
196196
#' \item{gamenum}{Order of All-Star Game for the season (in years w/ multiple ASGs; set to 0 when only 1 per year)}
197197
#' \item{gameid}{Game ID at Baseball-Reference.com}
198198
#' \item{lgid}{League of All-Star team}
199-
#' \item{startingpos}{Postion (according to baseball convention; 1=pitcher, 2=catcher, etc.) if starter}
199+
#' \item{startingpos}{Position (according to baseball convention; 1=pitcher, 2=catcher, etc.) if starter}
200200
#' \item{off600}{Estimate of offensive talent, in runs above league average per 600 plate appearances}
201201
#' \item{def600}{Estimate of fielding talent, in runs above league average per 600 plate appearances}
202202
#' \item{pitch200}{Estimate of pitching talent, in runs above league average per 200 innings pitched}
@@ -257,6 +257,8 @@
257257
#' The raw data behind the story
258258
#' "Projecting The Top 50 Players In The 2015 NBA Draft Class"
259259
#' \url{https://fivethirtyeight.com/features/projecting-the-top-50-players-in-the-2015-nba-draft-class/}.
260+
#' An analysis using this data was contributed by G. Elliott Morris as a package vignette at
261+
#' \url{http://fivethirtyeight-r.netlify.com/articles/nba.html}.
260262
#'
261263
#' @format A data frame with 1090 rows representing National Basketball Association players/prospects and 9 variables:
262264
#' \describe{
@@ -333,7 +335,7 @@
333335
#' \item{name}{first initial.last name}
334336
#' \item{team}{team at time of suspension}
335337
#' \item{games}{number of games suspended (one regular season = 16 games)}
336-
#' \item{category}{personal conduct, substance abuse, peformance enhancing drugs or in-game violence}
338+
#' \item{category}{personal conduct, substance abuse, performance enhancing drugs or in-game violence}
337339
#' \item{description}{description of suspension}
338340
#' \item{year}{year of suspension}
339341
#' \item{source}{news source}

R/data_jen.R

+7-2
Original file line numberDiff line numberDiff line change
@@ -215,6 +215,9 @@
215215
#' The raw data behind the story
216216
#' "A Complete Catalog Of Every Time Someone Cursed Or Bled Out In A Quentin Tarantino Movie"
217217
#' \url{https://fivethirtyeight.com/features/complete-catalog-curses-deaths-quentin-tarantino-films/}.
218+
#' An analysis using this data was contributed by Olivia Barrows, Jojo Miller, and Jayla Nakayama
219+
#' as a package vignette at
220+
#' \url{http://fivethirtyeight-r.netlify.com/articles/tarantino_swears.html}.
218221
#'
219222
#' @format A data frame with 1894 rows representing curse/death instances and 4 variables:
220223
#' \describe{
@@ -303,7 +306,9 @@
303306
#' The raw data behind the story
304307
#' "The World's Favorite Donald Trump Tweets"
305308
#' \url{https://fivethirtyeight.com/features/the-worlds-favorite-donald-trump-tweets/}.
306-
#' Tweets posted on twitter by Donald Trump (@@realDonaldTrump)
309+
#' Tweets posted on twitter by Donald Trump (@@realDonaldTrump).
310+
#' An analysis using this data was contributed by Adam Spannbauer as a package vignette at
311+
#' \url{http://fivethirtyeight-r.netlify.com/articles/trump_twitter.html}.
307312
#'
308313
#' @format A data frame with 448 rows representing tweets and 3 variables:
309314
#' \describe{
@@ -461,7 +466,7 @@
461466
#' \item{mehs}{Mehs}
462467
#' \item{league_average_gpct}{League-average goose percentage}
463468
#' \item{ppf}{Pitcher park factor}
464-
#' \item{replacement_gpct}{Replacement-level goose pecentage}
469+
#' \item{replacement_gpct}{Replacement-level goose percentage}
465470
#' \item{gwar}{Goose Wins Above Replacement}
466471
#' \item{key_retro}{Retrosheet unique player identifier}
467472
#' }

0 commit comments

Comments
 (0)