Skip to content

Releases: dhicks/comp-HOPOS

v2.0

12 Nov 18:36
Compare
Choose a tag to compare

[2.0] - 2019-11-11

Added

library(UNF)

## UNF value for publications-philosophy of science-Rds v2.0
unf_value = 'nJaKSRjMpMV1zYGoOPFRlQ=='

pub_level = readRDS('publications_philsci.Rds')
pub_level_unf = unf(pub_level, version = 6, digits = 3, timezone = 'UTC')

identical(pub_level_unf$unf, unf_value)

Removed

  • Several redundant or (almost entirely) empty/NA columns were removed.
    • Redundant URL column; cf #11
    • member, prefix, score, source, subject, archive, authenticated.orcid, affiliation1.name, affiliation2.name, affiliation3.name, affiliation4.name, name, funder, assertion
  • Evelyn Brister manually identified and removed numerous non-article documents, such as tables of contents and book reviews.
  • Evelyn Brister manually identified authors who qualified as philosophers of science using the threshold criterion (i.e., 2 or more papers in a primary venue) but who primarily worked in other areas of philosophy. These authors are:
    • E. J. Lowe (metaphysics, phil mind, and phil lang.)
    • H B Acton (political philosophy)
    • Alasdair MacIntyre (ethics)
    • V. J. McGill
    • Jan Narveson (political theory)
    • Patrick Nowell-Smith (moral theory)
    • Daniel J O’Connor (philosophy of education)

Fixed

  • Evelyn Brister manually reviewed names and gender attribution, fixing issues related to initialization, misspellings, and incorrect or missing gender attribution (based on presentation on faculty websites, etc.).

Changed

  • The "philosophy of science" dataset size is now filtered by year, and includes only documents published between 1930 and 2017. The first primary philosophy of science venue (the first version of Erkenntnis) began publication in 1930, so our approach identifies very few "philosophers of science" prior to this year.

v1.1

26 Aug 23:28
Compare
Choose a tag to compare

This release fixes a substantial error that appeared when combing the gender attributions with the article metadata.

In v1.0, problems with the join logic when combining the results of the gender attribution algorithms (in script 06) meant that ~150 rows in the gender attribution dataframe had NA for both given and family names. All ~150 then matched to NA/NA author names in the article dataframe. The result was a massive inflation in the size of the dataset, and a mean of 26 authors per paper. Anyone familiar with philosophy should recognize this is incorrect.

Fixing the join logic in 06 appears to have solved the problem. Author inflation has disappeared. (In script 07, authors_unfltd has the same number of rows as authors_full.) In the full dataset, about 78% of papers have just 1 author; this is about 92% in the philosophy of science dataset.

v1.0

20 Aug 14:33
Compare
Choose a tag to compare

This is the first official release of the scripts used to construct the Comp HOPOS dataset.