(base) jiangyinzuo@kiwi:/usr3/jiangyinzuo_data/ann-benchmarks$ python3 create_dataset.py --dataset nytimes-256-angular
building matrix...
normalizing matrix entries with tfidf...
reducing dimensionality...
Splitting 300000*None into train/test
train size: 290000 * 256
test size: 10000 * 256
/usr3/jiangyinzuo_data/ann-benchmarks/ann_benchmarks/algorithms/bruteforce/module.py:47: RuntimeWarning: invalid value encountered in divide
X /= numpy.sqrt(lens)[..., numpy.newaxis]
0/10000...
/usr3/jiangyinzuo_data/ann-benchmarks/ann_benchmarks/distance.py:42: RuntimeWarning: invalid value encountered in scalar divide
distance=lambda a, b: 1 - np.dot(a, b) / (norm(a) * norm(b)),
1000/10000...
2000/10000...
3000/10000...
4000/10000...
5000/10000...
6000/10000...
7000/10000...
8000/10000...
9000/10000...
- python version: 3.12
- numpy version: 1.26.3