Szilard Pafka

Results 17 repositories owned by Szilard Pafka

benchm-ml

1.9k
Stars
335
Forks
Watchers

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algor...

GBM-perf

211
Stars
27
Forks
Watchers

Performance of various open source GBM implementations

benchm-databases

90
Stars
17
Forks
Watchers

A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (interactive data analysis).

benchm-dl

69
Stars
11
Forks
Watchers

Playing with various deep learning tools and network architectures

datascience-latency

20
Stars
4
Forks
Watchers

Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for the most common analytical tasks (SQL-like data munging, linea...

dataset-sizes-kdnuggets

16
Stars
2
Forks
Watchers

Size of datasets used for analytics based on 10 years of surveys by KDnuggets.

GBM-intro

17
Stars
4
Forks
Watchers

GBM intro talk (with R and Python code)

GBM-multicore

20
Stars
1
Forks
Watchers

GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems

GBM-tune

22
Stars
3
Forks
Watchers

Tuning GBMs (hyperparameter tuning) and impact on out-of-sample predictions

ml-prod

72
Stars
11
Forks
Watchers

Some thoughts on how to use machine learning in production