aduana
aduana copied to clipboard
Dumping of PageRank scores for external use
It would be nice to provide to user a flexible scoring concept, to achieve that we need to be able to dump from Aduana top N (can be 10K or 100K) sorted by PageRank requests, calling some BFScheduler function. No need for exclusion mechanism, just dumping.
Here are the fields that needs to be dumped:
- url,
- fingerprint (if stored),
- PageRank score.
It would be nice to have a C function for this in library and python binding for it.