Benjamin Ooghe-Tabanou
Benjamin Ooghe-Tabanou
I've checked what's already handled or in the pipe and commented a few points initalic inline. Don't dream it doesn't mean all boxes are meant to be checked! :) Some...
there is a compromise to find between a rich search engine and a simple one. The API allows to do more complex search queries than what the front allows but...
This is already what we do for the solr indexation in https://github.com/medialab/hyphe2solr In Hyphe itself, considering how the indexation already lags compared to the crawls, I'm not sure including full...
referring to #150 here
first mockups from CPH coding retreat https://twitter.com/jacomyma/status/1187418935429386242/photo/1
Can you give examples and highlight the fields and kind of information you feel would be useful ? Also doing it on demand or for crawled domains would sound doable,...
Comme l'indique la page EXPORT, celle-ci vise uniquement à exporter les métadonnées des webentités, pas de récupérer les données ni sur le réseau ni sur les pages web. À ce...
@g-arcas une fois en python2 il faut que tus utilises l'environnement python de hyphe pour avoir toutes les dépendances. Mais sinon effectivement avec minet cela devrait marcher plus simplement. Le...
id ≠ nom :)
Two other options might be considerable : - a "complete crawls" feature could run a crawl with depth 0 on all known uncrawled pages of each IN WebEntity - a...