coordination-network-toolkit
coordination-network-toolkit copied to clipboard
False positives in co-link network?
I'm seeing a big difference between two outputted networks when I preprocess a .csv file with and without tweets which contains urls.
When I preprocess a .csv file which contains tweets without urls, I get more than 40 pairs of source/target combinations between profiles. But when I remove tweets without links in them, my network gets reduced to just a handful of profiles.
This makes me wonder if I am processing my data in a wrong way when I'm generating the .csv file? This is the current format of my csv file:
message_id,user_id,username,repost_id,reply_id,message,timestamp,urls
id,id,username,,id,"Tweet content.",1645488019,
id,id,username2,,id,Tweet content,1645488035,
id,id,username3,,id,Tweet content,1645488035,url
id,id,username4,,id,Tweet content,1645488035,url1 url2
Is this the correct way of doing it?