TumblThree
TumblThree copied to clipboard
Adapt the Twitter crawler to output better text files
The new Twitter crawler can download pictures, videos and text.
"But its text representation could be more effective with twitter id, date, time etc. as Twint does, otherwise it's a great tool to me."
Originally posted by @cr1zydog in https://github.com/TumblThreeApp/TumblThree/issues/161#issuecomment-886141526
That's correct the text file lacks some contextual information, especially if compared to the Tumblr crawlers' output format: Post id: Date: Post url: Slug: Reblog key: Reblog url: Reblog name: Summary: Title: Tags:
The Twitter crawler should be adapted to output text files similar to that ones of the Tumblr crawlers, either in json or text format depending on the chosen setting.
Do you like to take care of this issue?