dsci-benchmark
dsci-benchmark copied to clipboard
R scripts for benchmarking next word prediction algorithms developed for the Coursera Data Science Capstone Project.
Next word prediction benchmark
A simple R script for benchmarking a next word prediction algorithm.
Usage:
- Download the repository
- Extract data.zip into the current folder (password is provided in the Coursera forum)
- Open benchmark.R and run the code up to section 03
- (optional) create a wrapper function for your prediction function (section 03)
- Perform the benchmark (section 04)
- Report your results in the Coursera Forum
File description:
- data.zip Archive containing the benchmark datasets.
- benchmark.R Script needed to perform the benchmark (see above).
- generate_dataset.R Script used to generate the benchmark datasets (this should not be re-run and is provided for reference only)