big-ann-benchmarks icon indicating copy to clipboard operation
big-ann-benchmarks copied to clipboard

OPtANNe

Open sdongaonkar opened this issue 2 years ago • 7 comments

BIGANN submission for GraphANN using Intel Optane Persistent Memory (high QPS HW)

sdongaonkar avatar Oct 22 '21 21:10 sdongaonkar

@sdongaonkar Hey, catching up with the PRs. Are the two OptANNE PR's intended to be different submissions? I think we already discussed this in email, but I just wanted to make sure:

https://github.com/harsha-simhadri/big-ann-benchmarks/pull/64

https://github.com/harsha-simhadri/big-ann-benchmarks/pull/63

sourcesync avatar Nov 27 '21 19:11 sourcesync

@sdongaonkar I sent you an email on the same topic. We are seeing improvement in 2 datasets (text2image and msspacev) but big decrease in the rest ( on the recall benchmark ). Was that intended?

sourcesync avatar Nov 29 '21 01:11 sourcesync

@sdongaonkar I sent you an email on the same topic. We are seeing improvement in 2 datasets (text2image and msspacev) but big decrease in the rest ( on the recall benchmark ). Was that intended?

Thanks for the heads up. I just sent a reply. It seems for 3 of the datasets the last search window size value was picked for the summary. The attached file has all the values, and I've highlighted the intended search_window_size values for recall and QPS rankings.

Please let me know if your numbers are still different to these ones.

recall_and_qps_results_highlighted.xlsx y.

sdongaonkar avatar Nov 29 '21 01:11 sdongaonkar

@sdongaonkar I sent you an email on the same topic. We are seeing improvement in 2 datasets (text2image and msspacev) but big decrease in the rest ( on the recall benchmark ). Was that intended?

Thanks for the heads up. I just sent a reply. It seems for 3 of the datasets the last search window size value was picked for the summary. The attached file has all the values, and I've highlighted the intended search_window_size values for recall and QPS rankings.

Please let me know if your numbers are still different to these ones.

recall_and_qps_results_highlighted.xlsx y.

OK @sdongaonkar, lets keep it as a github issue/conversation for now. I'd like to track down what went wrong and what might be wrong. The following is the algos.yaml we used when we ran the eval. Does it look correct?

https://github.com/harsha-simhadri/big-ann-benchmarks/blob/t3/eval_optanne_graphann/t3/optanne_graphann/algos.yaml

If it is correct, then something else besides the yaml is wrong. Thanks.

sourcesync avatar Nov 29 '21 03:11 sourcesync

@sdongaonkar Hey, we are currently unable to run any more evaluations due to this error. Any thoughts on what to do here?

Trying to instantiate benchmark.algorithms.graphann.GraphANN(['euclidean', {'index_file': '/mnt/data/competition_indexes/bigann1b_index127-superflat.index', 'vectors_file': '/mnt/data/competition_indexes/bigann1b_vectors.bin', 'vectors_location': 'HUGE'}]) Activating environment at ~/BigANN/GraphANN/contrib/PyANN/Project.toml Precompiling project...

✓ PyCall 7 dependencies successfully precompiled in 217 seconds (58 already precompiled) 1 dependency precompiled but a different version is currently loaded. Restart julia to access the new version [ Info: Running in an exclusive environment. Populating thread affinities. [ Info: Running in an exclusive environment. Populating thread affinities. [ Info: Running in an exclusive environment. Populating thread affinities. Running graphann on bigann-1B Got 10000 queries Copying index to PMem...

Process Process-1: Traceback (most recent call last): File "/home/bigann/.pyenv/versions/3.8.12/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap self.run() File "/home/bigann/.pyenv/versions/3.8.12/lib/python3.8/multiprocessing/process.py", line 108, in run self._target(*self._args, **self._kwargs) File "/home/bigann/BigANN/big-ann-benchmarks/benchmark/main.py", line 45, in run_worker run_no_docker(definition, args.dataset, args.count, File "/home/bigann/BigANN/big-ann-benchmarks/benchmark/runner.py", line 339, in run_no_docker run_from_cmdline(cmd) File "/home/bigann/BigANN/big-ann-benchmarks/benchmark/runner.py", line 230, in run_from_cmdline run(definition, args.dataset, args.count, args.runs, args.rebuild, File "/home/bigann/BigANN/big-ann-benchmarks/benchmark/runner.py", line 95, in run elif rebuild or not algo.load_index(dataset): File "/home/bigann/BigANN/big-ann-benchmarks/benchmark/algorithms/graphann.py", line 85, in load_index self.create_index_dir(ds), File "/home/bigann/BigANN/big-ann-benchmarks/benchmark/algorithms/graphann.py", line 151, in create_index_dir shutil.copy(self._index_file, graph_file0) File "/home/bigann/.pyenv/versions/3.8.12/lib/python3.8/shutil.py", line 418, in copy copyfile(src, dst, follow_symlinks=follow_symlinks) File "/home/bigann/.pyenv/versions/3.8.12/lib/python3.8/shutil.py", line 275, in copyfile _fastcopy_sendfile(fsrc, fdst) File "/home/bigann/.pyenv/versions/3.8.12/lib/python3.8/shutil.py", line 166, in _fastcopy_sendfile raise err from None File "/home/bigann/.pyenv/versions/3.8.12/lib/python3.8/shutil.py", line 152, in _fastcopy_sendfile sent = os.sendfile(outfd, infd, offset, blocksize) OSError: [Errno 28] No space left on device: '/mnt/data/competition_indexes/bigann1b_index127-superflat.index' -> '/mnt/pm0/public/graph.bin'

sourcesync avatar Nov 29 '21 22:11 sourcesync

I was running the benchmark script on it. I just logged out. It should work now.

sdongaonkar avatar Nov 29 '21 22:11 sdongaonkar

I was running the benchmark script on it. I just logged out. It should work now.

Ooops. Apologies...I should have checked if someone was logged in. Thanks!

sourcesync avatar Nov 29 '21 22:11 sourcesync