python-goose
python-goose copied to clipboard
Algorithm used in goose ?
Hi,
I am working on my undergrad research thesis and using goose extractor.Goose is really a commendable tool. However, I have a mid term presentation regarding my thesis and I will have to explain the algorithm used by goose.
Can you please tell me the algorithm or how goose extracts information from html pages.
Thanks, Faisal
read the source. If you have any specific questions about implementation I'm sure someone will be more willing to assist you.
ok thanks