coordination-network-toolkit icon indicating copy to clipboard operation
coordination-network-toolkit copied to clipboard

False positives in co-link network?

Open havardl opened this issue 2 years ago • 3 comments

I'm seeing a big difference between two outputted networks when I preprocess a .csv file with and without tweets which contains urls.

When I preprocess a .csv file which contains tweets without urls, I get more than 40 pairs of source/target combinations between profiles. But when I remove tweets without links in them, my network gets reduced to just a handful of profiles.

This makes me wonder if I am processing my data in a wrong way when I'm generating the .csv file? This is the current format of my csv file:

message_id,user_id,username,repost_id,reply_id,message,timestamp,urls
id,id,username,,id,"Tweet content.",1645488019,
id,id,username2,,id,Tweet content,1645488035,
id,id,username3,,id,Tweet content,1645488035,url
id,id,username4,,id,Tweet content,1645488035,url1 url2

Is this the correct way of doing it?

havardl avatar May 06 '22 13:05 havardl