TY - JOUR
T1 - Exploring the effect of streamed social media data variations on social network analysis
AU - Weber, Derek
AU - Nasim, Mehwish
AU - Mitchell, Lewis
AU - Falzon, Lucia
PY - 2021/12
Y1 - 2021/12
N2 - To study the effects of online social network (OSN) activity on real-world offline events, researchers need access to OSN data, the reliability of which has particular implications for social network analysis. This relates not only to the completeness of any collected dataset, but also to constructing meaningful social and information networks from them. In this multidisciplinary study, we consider the question of constructing traditional social networks from OSN data and then present several measurement case studies showing how variations in collected OSN data affect social network analyses. To this end, we developed a systematic comparison methodology, which we applied to five pairs of parallel datasets collected from Twitter in four case studies. We found considerable differences in several of the datasets collected with different tools and that these variations significantly alter the results of subsequent analyses. Our results lead to a set of guidelines for researchers planning to collect online data streams to infer social networks.
AB - To study the effects of online social network (OSN) activity on real-world offline events, researchers need access to OSN data, the reliability of which has particular implications for social network analysis. This relates not only to the completeness of any collected dataset, but also to constructing meaningful social and information networks from them. In this multidisciplinary study, we consider the question of constructing traditional social networks from OSN data and then present several measurement case studies showing how variations in collected OSN data affect social network analyses. To this end, we developed a systematic comparison methodology, which we applied to five pairs of parallel datasets collected from Twitter in four case studies. We found considerable differences in several of the datasets collected with different tools and that these variations significantly alter the results of subsequent analyses. Our results lead to a set of guidelines for researchers planning to collect online data streams to infer social networks.
KW - Dataset reliability
KW - Social media analytics
KW - Social network analysis
UR - http://www.scopus.com/inward/record.url?scp=85109213597&partnerID=8YFLogxK
U2 - 10.1007/s13278-021-00770-y
DO - 10.1007/s13278-021-00770-y
M3 - Article
AN - SCOPUS:85109213597
SN - 1869-5450
VL - 11
JO - Social Network Analysis and Mining
JF - Social Network Analysis and Mining
IS - 1
M1 - 62
ER -