ntscraper icon indicating copy to clipboard operation
ntscraper copied to clipboard

Certain re-tweets are not successfully scraped

Open morris-y opened this issue 1 year ago • 4 comments

It worked. But there are cases where certain re-tweets are not successfully scraped.

And this feels like fixable, because the instance I choose did include these re-tweets when I checked (instance = 'https://nitter.salastil.com'), and I tried different instances, it all show up same amounts of tweets in a certain time range.

morris-y avatar Oct 04 '23 15:10 morris-y

After testing, it seems that retweets over a certain time frame are not successfully scraped from the nitter. Recent retweets are often successfully scraped.

morris-y avatar Oct 05 '23 05:10 morris-y

Hi, the issue appears to be on nitter's part. I tried the advanced search on the instance you provided and other instances as well, on different accounts, but it seems that nitter either does not show retweets or only shows the most recent. For example, on X's profile, if you try to filter the search results between 01 July 2023 and 31 August 2023, you will notice that there are no retweets. But, if you check the same period through the main "tweets" page for the profile, there are 2 retweets. The same thing happens on nitter's official instance, nitter.net.

Looking through the issues on nitter's repo there is an open issue that describes the retweet filter as broken, so they are aware of the bug. For now we'll have to wait until it's solved.

bocchilorenzo avatar Oct 06 '23 19:10 bocchilorenzo

Can't you scrape them indirectly - through the "is-retweet" descriptor of tweets? Maybe I am understanding something wrong...

TomatoGreen2 avatar Dec 30 '23 14:12 TomatoGreen2

Just checked - not possible... sorry...

TomatoGreen2 avatar Dec 30 '23 14:12 TomatoGreen2