Radim Řehůřek

Results 11 repositories owned by Radim Řehůřek

gensim

15.3k
Stars
4.4k
Forks
Watchers

Topic Modelling for Humans

sqlitedict

1.1k
Stars
130
Forks
Watchers

Persistent dict, backed by sqlite3 and pickle, multithread-safe.

smart_open

3.1k
Stars
379
Forks
Watchers

Utils for streaming large files (S3, HDFS, gzip, bz2...)

gensim-data

953
Stars
127
Forks
Watchers

Data repository for pretrained NLP models and NLP corpora.

bounter

935
Stars
47
Forks
Watchers

Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.

gensim-simserver

109
Stars
61
Forks
Watchers

[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]

sparsesvd

56
Stars
18
Forks
Watchers

Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition

data_science_python

58
Stars
40
Forks
Watchers

Source code for the "Practical Data Science in Python" tutorial

sim-shootout

99
Stars
28
Forks
Watchers

Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neighbours-intro

topic_modeling_tutorial

108
Stars
52
Forks
Watchers

Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"