A Unified Model for Stable and Temporal Topic Detection from.ppt
《A Unified Model for Stable and Temporal Topic Detection from.ppt》由会员分享,可在线阅读,更多相关《A Unified Model for Stable and Temporal Topic Detection from.ppt(46页珍藏版)》请在麦多课文档分享上搜索。
1、A Unified Model for Stable and Temporal Topic Detection from Social Media Data,Hongzhi Yin, Bin Cui, Hua Lu, Yuxin Huang and Junjie Yao Peking University Aalobrg University,Outline,Motivation Problem Formulation A Basic Solution A User-Temporal Mixture Model Enhancement of the basic solution Regular
2、ization Technique Burst-Weighted Boosting Experiments Q/A,Outline,Motivation Problem Formulation A Basic Solution A User-Temporal Mixture Model Enhancement of the basic solution Regularization Technique Burst-Weighted Boosting Experiments Q/A,Motivation,Motivation (Cont.),Two different types of topi
3、cs are mixed up in the social media platforms such as Twitter, Weibo and Delicious; Temporal Topics are temporally coherent meaningful themes. They are time-sensitive and often on popular real-life events or hot spots, i.e., breaking events in the real world. Stable Topics are often on users regular
4、 interests and their daily routine discussions, e.g., their moods and statuses.,One Example in Twitter,Temporal Topic : Dead pigs in Shanghai,Stable Topic : Big Data,Another Example in Twitter,Temporal Topic: Independence Day,Stable Topic: Animal Adoption,We can tell the difference between temporal
5、and Stable topics from their temporal distributions and their description words.,Motivation (Cont.),Discovering different topics of events that are coherent in temporal space Detecting bursty events, such as disaster (e.g., earthquakes), politics (e.g., election), and public events (e.g., Olympics)
6、Analyzing topic trends Extracting stable topics that are coherent in user-interest space. Finding user intrinsic interests and better modeling user preference,Outline,Motivation Problem Formulation A Basic Solution A User-Temporal Mixture Model Enhancement of the basic solution Regularization Techni
7、que Burst-Weighted Smoothing Experiments Q/A,Problem Formulation,A user-time-associated document d is a text document associated with a time stamp and a user. A temporal topic is a temporally coherent theme. In other words, the words that are emerging in the close time dimension are clustered in a t
8、opic. An example of temporal topics: Given a collection of user-time-associated tweets, the desired temporal topics are the events happening in different times. Formally, a temporal/stable topic is represented by a word distribution where,Problem Formulation (Cont.),A topic distribution in time dime
9、nsion is the distribution of topics given a specific time interval. Formally, is the probability of temporal topic given time interval t. A topic distribution in user space is the distribution of topics given a specific user. Formally, is the probability of stable topic given user u.,Problem Formula
10、tion (Cont.),A User-Time-Keyword Matrix M is a hyper-matrix whose three dimensions refer to user, time and keyword. A cell in Mu, t, w stores the frequency of word w generated by user u within time interval t. Given a collection of user-time-associated documents C, we first formulate matrix M Detect
11、ing Temporal Topics Extracting Stable Topics,Task 1,Task 2,Problem Formulation (Cont.),Detecting a set of temporal topics that are event-driven. Detecting bursty events, such as disaster (e.g., earthquakes), politics (e.g., election), and public events (e.g., Olympics) Analyzing topic trends Extract
12、ing a set of stable topics that are interest-driven. Finding user intrinsic interests and better modeling user preference,Outline,Motivation Problem Formulation A Basic Solution A User-Temporal Mixture Model Enhancement of the basic solution Regularization Technique Burst-Weighted Boosting Experimen
13、ts Q/A,A User-Time Mixture Model,Main InsightsTo find both temporal and stable topics in a unified manner, we propose a topic model that simultaneously captures two observations: Words generated around the same time are more likely to have the same event-driven temporal topicWords generated by the s
14、ame user are more likely to have the same interest-driven stable topic. The former helps find event-driven temporal topics while the latter helps identify interest-driven stable topics.,Combine user and time information We assume that when a user u generates a word w at time t, he/she is probably in
15、fluenced by two factors: the breaking news/events occurring in time t and his/her intrinsic interests. Breaking events are modeled by temporal topics and user intrinsic interests are modeled by stable topics.,The likelihood that user u generates word w at time t is as follows:Parameters and are mixi
- 1.请仔细阅读文档,确保文档完整性,对于不预览、不比对内容而直接下载带来的问题本站不予受理。
- 2.下载的文档,不会出现我们的网址水印。
- 3、该文档所得收入(下载+内容+预览)归上传者、原创作者;如果您是本文档原作者,请点此认领!既往收益都归您。
下载文档到电脑,查找使用更方便
2000 积分 0人已下载
下载 | 加入VIP,交流精品资源 |
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- AUNIFIEDMODELFORSTABLEANDTEMPORALTOPICDETECTIONFROMPPT

链接地址:http://www.mydoc123.com/p-373198.html