Date of Original Version
Proceedings of the NIPS Workshop on Machine Learning for Social Computing
Abstract or Description
We propose a Bayesian generative model of how demographic social factors in- fluence lexical choice. We apply the method to a corpus of geo-tagged Twitter messages originating from mobile phones, cross-referenced against U.S. Census demographic data. Our method discovers communities jointly defined by linguistic and demographic properties.
Proceedings of the NIPS Workshop on Machine Learning for Social Computing.