dmi-tcat icon indicating copy to clipboard operation
dmi-tcat copied to clipboard

retweet_count or count(retweet_id)

Open psegovias opened this issue 9 years ago • 3 comments

Example:

random tweet got 40 RT, in table retweet_count i have 6 RT, but if i search retweet_id i got 40 results, Any idea why?

psegovias avatar Mar 06 '15 13:03 psegovias

There are a couple of scenarios here:

  • legacy: retweet_id did not exist, this is why the 'identical tweet frequency' export counts identical tweets. Note that some tweets may be truncated or modified, and these will appear as separate rows + counts in the export.
  • track (streaming API): each tweet object sent through the streaming API also has a retweet_count. Whenever a tweet is posted, it is sent immediately over the stream to TCAT. For tweets captured through the streaming API the retweet_count in the TCAT tables will thus always be 0.
  • search API, lookup of IDs, etcetera: whenever you retrieve older tweets, it might have been retweeted in the mean time. When querying 'historical' tweets, the retweet_count in the tweet object might thus be non zero
  • Some of the export modules count the number of times a retweet_id appears.

I'll thus turn this question into a feature request: add an export module which displays tweets and the number of times their id appears as a retweet_id in the status object of other tweets in the query bin.

ErikBorra avatar Mar 06 '15 16:03 ErikBorra

Thanks @ErikBorra

psegovias avatar Mar 09 '15 15:03 psegovias

This should be added to the FAQ, and is related to issue #74

ErikBorra avatar Mar 18 '15 10:03 ErikBorra