kafka-connect-twitter icon indicating copy to clipboard operation
kafka-connect-twitter copied to clipboard

Retweets `TEXT` is truncated, retweeted_status not available

Open rmoff opened this issue 5 years ago • 2 comments

Per https://developer.twitter.com/en/docs/tweets/data-dictionary/overview/tweet-object the text field is truncated for retweets, with the full value available in retweeted_status. Can retweeted_status be added to the schema for this connector?

At the moment any RT is only available in its first 140 characters (give or take for user name length etc):

ksql> select retweet, max(len(text)) as max_tweet_length  from all_tweets group by retweet emit changes;
+--------------------------------------------------------------------+--------------------------------------------------------------------+
|RETWEET                                                             |MAX_TWEET_LENGTH                                                    |
+--------------------------------------------------------------------+--------------------------------------------------------------------+
|false                                                               |411                                                                 |
|true                                                                |152                                                                 |

rmoff avatar Dec 03 '19 11:12 rmoff

@rmoff in the API I am using getRetweetedStatus and getQuotedStatus are full Status objects. If I pulled these in would it work for you?

jcustenborder avatar Dec 03 '19 14:12 jcustenborder

yes that would be 💯 what I need 👍

rmoff avatar Dec 03 '19 17:12 rmoff