fastRhockey icon indicating copy to clipboard operation
fastRhockey copied to clipboard

Boxscore and play-by-play stat scraper for the Premier Hockey Federation (PHF, formerly known as the NWHL).

fastRhockey

CRAN
version CRAN
downloads Version-Number R-CMD-check Lifecycle:maturing Contributors

fastRhockey is an R Package that is designed to pull play-by-play (and boxscore) data from the newest version of the Professional Women’s Hockey League website. In the past, there have been a few scrapers for the PHF (formerly the NWHL), but they have all been deprecated since the formation of the new PWHL league changed websites.

With the first season of the league kicking off on January 1st, and games being broadcast on ESPN+, this package was created to allow access to play-by-play data to continue pushing women’s hockey analytics forward.

In Spring of 2021, the Big Data Cup and the data they made available revolutionized what we were able to thanks to the detailed play-by-play data for the season and the x/y location data. That wave continued with the inaugural WHKYHAC conference in July that produced some amazing conversations and projects in the women’s hockey space.

In the past, the lack of data and poor access to data have been the biggest barrier to entry in women’s hockey analytics, a barrier that this package intends to alleviate.


Installation

You can install the CRAN version of fastRhockey with:

install.packages("fastRhockey")

You can install the released version of fastRhockey from GitHub with:

# You can install using the pacman package using the following code:
if (!requireNamespace('pacman', quietly = TRUE)){
  install.packages('pacman')
}
pacman::p_load_current_gh("sportsdataverse/fastRhockey", dependencies = TRUE, update = TRUE)

If you would prefer the devtools installation:

if (!requireNamespace('devtools', quietly = TRUE)){
  install.packages('devtools')
}
devtools::install_github(repo = "sportsdataverse/fastRhockey")

Documentation

You can find the documentation for fastRhockey on GitHub pages.

You can view CSVs of historical boxscore and play-by-play on the fastRhockey data repo, as well as the process for scraping that historical data.


Breaking Changes

Full News on Releases


Follow the SportsDataverse (@sportsdataverse) on Twitter and star this repo

GitHub
stars

Our Authors

  • Ben Howell (@BenHowell71)

@BenHowell71

  • Saiem Gilani (@saiemgilani)

@saiemgilani

Our Contributors (they’re awesome)

  • Alyssa Longmuir (@alyssastweeting)

@Aklongmuir

  • Tan Ho (@_TanHo)
    @tanho63

Citations

To cite the fastRhockey R package in publications, use:

BibTex Citation

@misc{howell_gilani_fastRhockey_2021,
  author = {Ben Howell and Saiem Gilani},
  title = {fastRhockey: The SportsDataverse's R Package for Hockey Data.},
  url = {https://fastRhockey.sportsdataverse.org/},
  year = {2021}
}