A multiple feature integration model to infer occupation from social media records

Xiang Wang, Lele Yu, Junjie Yao, Bin Cui

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

With the rapid development of more and more social media applications, lots of users are connected with friends and their daily life and opinions are recorded. Social media provides us an unprecedented way to collect and analyze billions of users' information. Proper user attribute identification or profile inference becomes more and more attractive and feasible. However, the flourishing social records also pose great challenge in effective feature selection and integration for user profile inference. This is mainly caused by the text sparsity and complex community structures. In this paper, we propose a comprehensive framework to infer user's occupation from his/her social activities recorded in micro-blog message streams. A multi-source integrated classification model is set up with some fine selected features. We first identify some beneficial basic content features, and then we proceed to tailor a community discovery based latent dimension solution to extract community features. Extensive empirical studies are conducted on a large real micro-blog dataset. Not only we demonstrate the integrated model shows advantages over several baseline methods, but also we verify the effect of homophily in users' interaction records. The different effects of heterogeneous interactive networks are also revealed.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering, WISE 2013 - 14th International Conference, Proceedings
Pages137-150
Number of pages14
EditionPART 2
DOIs
StatePublished - 2013
Externally publishedYes
Event14th International Conference on Web Information Systems Engineering, WISE 2013 - Nanjing, China
Duration: 13 Oct 201315 Oct 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume8181 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Web Information Systems Engineering, WISE 2013
Country/TerritoryChina
CityNanjing
Period13/10/1315/10/13

Keywords

  • Feature Selection
  • Heterogeneous Network
  • Micro-blog
  • Occupation Inference
  • User Profile Modeling

Fingerprint

Dive into the research topics of 'A multiple feature integration model to infer occupation from social media records'. Together they form a unique fingerprint.

Cite this