You are here

Community Topic Usage in Social Networks

Authors: 
Publication Type: 
Refereed Conference Meeting Proceeding
Abstract: 
When studying large social media data sets, it is useful to reduce the dimensionality of both the network (e.g. by finding communities) and user-generated data such as text (e.g. using topic models). Algorithms exist for both these tasks, however their combination has received little attention and proposed models to date are not scalable (e.g.: [4]). One approach to such combined modelling is to perform community and topic modelling independently and later combine the results. In the case of overlapping communities, this combination requires a method for attributing each users topic usage to the communities in which she participates. This paper presents a Bayesian model for attributing individual documents to communities which balances the users proportional community membership with community topic coherence. Community topic usage is modelled with a Dirichlet distribution with fixed concentration parameter, leading to a well defined conjugate prior. Thought the prior is computationally expensive, the already reduced dimensionality in both topics and communities make a tractable algorithm feasible, even for large data sets. The model is applied to a corpus of tweets and twitter follower relations collected on hash tags used by people with eating disorders [14].
Conference Name: 
2015 Workshop on Topic Models: Post-Processing and Applications (colocated with CIKM 2015)
Proceedings: 
Proceedings of the 2015 Workshop on Topic Models: Post-Processing and Applications
Digital Object Identifer (DOI): 
10.1145/2809936.2809937
Publication Date: 
24/10/2015
Conference Location: 
Australia
Research Group: 
Institution: 
National University of Ireland, Galway (NUIG)
Open access repository: 
No
Publication document: