identify open content license urls used for open access articles in hybrid journals
Here's a reproducible example (reprex) to obtain licenses used for all hybrid journals covered by the Open APC initiative.
# required libraries
library(dplyr) # data transformation
#> Attaching package: 'dplyr'
#> The following objects are masked from 'package:stats':
#> filter, lag
#> The following objects are masked from 'package:base':
#> intersect, setdiff, setequal, union
library(tidyr) # working with list-columns
library(jsonlite) # working with json files
# load data, most recent dump, which also includes data from Jan and Feb 2020
license_df <- jsonlite::stream_in(url("https://raw.githubusercontent.com/subugoe/hybrid_oa_dashboard/update_jan_feb_20/data/jn_facets_df.json"), verbose = FALSE)
# prepare a summary table, where all license URLs´s variants are broken down by publisher
license_df %>%
select(license_refs, journal_title, publisher) %>%
unnest(license_refs) %>%
# unterschiedliche Fälle je Verlag
group_by(.id, publisher) %>%
summarise(n_cases = sum(V1))
#> # A tibble: 567 x 3
#> # Groups: .id [216]
#> .id publisher n_cases
#> <chr> <chr> <int>
#> 1 http:// creativecommons.org/licenses/… Cambridge University Pre… 3
#> 2 http://academic.oup.com/journals/page… Elsevier BV 1
#> 3 http://academic.oup.com/journals/page… Oxford University Press … 4734
#> 4 http://academic.oup.com/journals/page… Oxford University Press … 23
#> 5 http://aspb.org/publications/aspb-jou… American Society of Plan… 4039
#> 6 http://avs.scitation.org/jvb/authors/… American Vacuum Society 35
#> 7 http://creative commons.org/licenses/… Cambridge University Pre… 1
#> 8 http://creative commons.org/licenses/… Cambridge University Pre… 1
#> 9 http://creative commons.org/licenses/… Cambridge University Pre… 1
#> 10 http://creative%20commons.org/license… Cambridge University Pre… 2
#> # … with 557 more rows
Created on 2020-03-06 by the reprex package (v0.3.0)