CN107122481A - News temperature real-time online Forecasting Methodology - Google Patents

News temperature real-time online Forecasting Methodology Download PDF

Info

Publication number
CN107122481A
CN107122481A CN201710308998.5A CN201710308998A CN107122481A CN 107122481 A CN107122481 A CN 107122481A CN 201710308998 A CN201710308998 A CN 201710308998A CN 107122481 A CN107122481 A CN 107122481A
Authority
CN
China
Prior art keywords
hot
news
calorific value
word
vocabulary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710308998.5A
Other languages
Chinese (zh)
Other versions
CN107122481B (en
Inventor
余军
卢品吟
刘盾
张汨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Hua Seiun Technology Co Ltd
Original Assignee
Chengdu Hua Seiun Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Hua Seiun Technology Co Ltd filed Critical Chengdu Hua Seiun Technology Co Ltd
Priority to CN201710308998.5A priority Critical patent/CN107122481B/en
Publication of CN107122481A publication Critical patent/CN107122481A/en
Application granted granted Critical
Publication of CN107122481B publication Critical patent/CN107122481B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/40Business processes related to social networking or social networking services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Operations Research (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Quality & Reliability (AREA)
  • Marketing (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Tourism & Hospitality (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Computer And Data Communications (AREA)

Abstract

本发明公开了一种新闻热度实时在线预测方法包括两大部分,热点事件分析与建模和最新新闻热度预测,将所有事件中的热词和热词对合并到一起形成热词和热词对的热值表,将热词在各个事件中的热值相加,得到热词和热词对的当前热值;不断更新上述热词和热词对的热值表;利用热点事件分析与建模步骤中得到的热词和热词对的热值表对中得到的词汇和词汇组合进行热度打分,即在热值表中查询词汇的热值和词汇组合的热值,将相同词汇和词汇组合的热值累加,得到每个词汇和词汇组合在当前新闻中的热值;将新闻中所有词汇和词汇组合的热度相加,得到新闻热度,此热度即为预测的新闻热度。本发明能够全面分析热点话题并及时更新热点新闻。The invention discloses a real-time online prediction method for news popularity, which includes two parts, hot event analysis and modeling and latest news popularity prediction, and hot words and hot word pairs in all events are combined to form hot words and hot word pairs The calorific value table of hot words in each event is added to obtain the current calorific value of hot words and hot word pairs; the above hot words and hot word pairs are constantly updated; The calorific value table of the hot words and hot word pairs obtained in the modeling step is used to score the vocabulary and vocabulary combinations obtained in the calorific value table, that is, the calorific value of the vocabulary and the calorific value of the vocabulary combination are queried in the calorific value table, and the same vocabulary and vocabulary The heat value of the combination is accumulated to obtain the heat value of each word and word combination in the current news; the heat value of all words and word combinations in the news is added to obtain the news heat, which is the predicted news heat. The invention can comprehensively analyze hot topics and update hot news in time.

Description

新闻热度实时在线预测方法Real-time Online Prediction Method of News Popularity

技术领域technical field

本发明涉及新闻资讯领域,具体涉及一种新闻热度实时在线预测方法。The invention relates to the field of news information, in particular to a method for real-time online prediction of news popularity.

背景技术Background technique

随着互联网技术的快速发展,网络舆情越来越影响社会的稳定发展,监控网络舆情是政府维护社会安定的一个重要环节。作为舆情监控其中的一个环节,热点新闻的预测显得尤其关键。微博以其独特的传播特性和实时交互特性改变着传统新闻信息的传播方式。尤其微博和移动终端的的结合,使微博信息能够更加快速的被转发或评论,微博平台上大量的用户评论和交流信息能够快速汇集为观点,从而形成一定的舆论走向。微博天然的开放性、实时性、交互性、海量性和易检性,构成了热点新闻预测的基础。通过综合分析新闻在微博平台的话题量判断新闻的热度。With the rapid development of Internet technology, Internet public opinion is increasingly affecting the stable development of society. Monitoring Internet public opinion is an important link for the government to maintain social stability. As one of the links in public opinion monitoring, the prediction of hot news is particularly critical. With its unique dissemination characteristics and real-time interaction characteristics, Weibo has changed the way of dissemination of traditional news information. In particular, the combination of Weibo and mobile terminals enables Weibo information to be forwarded or commented on more quickly, and a large number of user comments and exchanged information on the Weibo platform can be quickly gathered into opinions, thereby forming a certain trend of public opinion. Weibo's natural openness, real-time, interactive, massive and easy-to-check nature constitute the basis for hot news forecasting. Judging the popularity of news by comprehensively analyzing the topic volume of news on the Weibo platform.

传统的舆情热点话题仅仅是通过点击数、转发数、评论数等数据进行判断,但这种热点话题预测技术并不能够全面分析热点话题的特征,而且对热点新闻的提取不够及时。Traditional public opinion hot topics are judged only by data such as number of clicks, forwarding numbers, and comments, but this hot topic prediction technology cannot fully analyze the characteristics of hot topics, and the extraction of hot news is not timely enough.

发明内容Contents of the invention

本发明的目的在于克服现有技术的不足,提供一种新闻热度实时在线预测方法,能够全面分析热点话题并及时更新热点新闻。The purpose of the present invention is to overcome the deficiencies of the prior art and provide a real-time online prediction method for news popularity, which can comprehensively analyze hot topics and update hot news in time.

本发明的目的是通过以下技术方案来实现的:The purpose of the present invention is achieved through the following technical solutions:

一种新闻热度实时在线预测方法,包括以下两大部分:A real-time online prediction method for news popularity, including the following two parts:

热点事件分析与建模,包括以下步骤:Analysis and modeling of hotspot events, including the following steps:

S01:对发生过的热点事件,人工确定关键词,基于人工确定的关键词,从网络爬取各种该热点事件相关的资讯;S01: For the hot events that have occurred, manually determine the keywords, and based on the manually determined keywords, crawl various information related to the hot events from the Internet;

S02:事件热度值评估,利用网络爬取的信息总量,给事件的热度打分,信息总量越大的,分值越高,上不封顶;S02: Evaluation of event heat value, using the total amount of information crawled from the web to score the heat of the event. The larger the total amount of information, the higher the score, and there is no upper limit;

S03:对爬取的信息进行热词分析,找出20%热度最高的词汇;S03: Analyze hot words on the crawled information, and find out 20% of the most popular words;

S04:对已知的热点事件进行热点事件建模,分析各种词条对热度的贡献率,以及词条组合对事件热度的联合贡献率;S04: Perform hot event modeling on known hot events, analyze the contribution rate of various entries to the popularity, and the joint contribution rate of the combination of entries to the event popularity;

S05:利用事件热度值和热词对事件的贡献率,以及热词组合对事件的贡献率,计算热词的热值,计算公式为:热词热值=事件热值×热词频率/所有热词的频率之和;热词对热值=事件热值×热词对频率/所有热词对的频率之和;S05: Use the popularity value of the event, the contribution rate of the hot word to the event, and the contribution rate of the hot word combination to the event to calculate the heat value of the hot word. The calculation formula is: hot word heat value = event heat value × frequency of hot words / all The sum of the frequency of the hot words; the heat value of the hot word pair = the heat value of the event × the frequency of the hot word pair / the sum of the frequencies of all the hot word pairs;

S06:将所有事件中的热词和热词对合并到一起形成热词和热词对的热值表,将热词在各个事件中的热值相加,得到热词和热词对的当前热值;S06: Combine the hot words and hot word pairs in all events to form a hot word and hot word pair heat value table, add the hot words in each event to get the current hot word and hot word pair calorific value;

S07:不断更新上述热词和热词对的热值表;S07: constantly updating the calorific value table of the above-mentioned hot words and hot word pairs;

最新新闻热度预测,包括以下步骤:The hottest prediction of the latest news includes the following steps:

S11:实时采集各种来源的资讯,包括但不限于新闻,微博,论坛,贴吧的内容;S11: Collect information from various sources in real time, including but not limited to news, Weibo, forums, and post bars;

S12:对上述采集到的资讯进行分词,去掉停用词,得到新闻相关的词汇;S12: Segment the information collected above, remove stop words, and obtain news-related vocabulary;

S13:利用热点事件分析与建模步骤中得到的热词和热词对的热值表对中得到的词汇和词汇组合进行热度打分,即在热值表中查询词汇的热值和词汇组合的热值,将相同词汇和词汇组合的热值累加,得到每个词汇和词汇组合在当前新闻中的热值;S13: Use the heat value table of hot words and hot word pairs obtained in the hot event analysis and modeling step to score the words and word combinations obtained in the heat value table, that is, query the heat value of words and the number of word combinations in the heat value table Calorific value, adding up the calorific value of the same vocabulary and vocabulary combination to obtain the calorific value of each vocabulary and vocabulary combination in the current news;

S14:将新闻中所有词汇和词汇组合的热度相加,得到新闻热度,此热度即为预测的新闻热度。S14: Add the popularity of all words and word combinations in the news to obtain the news popularity, which is the predicted news popularity.

进一步的,所述的步骤S01中的网络包括各大新闻网站、微博、微信、论坛、贴吧,政府网站等不同渠道包含有该关键词的文章、微博、微信的内容。Further, the network in the step S01 includes articles, microblogs, and WeChat content containing the keyword in various channels such as major news websites, Weibo, WeChat, forums, post bars, and government websites.

进一步的,所述的步骤S02中事件热度值的计算公式是Hotvalue=sum[count×k],其中count表示(舆情总数),k为权重其取值为1~100。Further, the formula for calculating the heat value of an event in step S02 is Hotvalue=sum[count×k], where count represents (the total number of public opinions), and k is a weight whose value ranges from 1 to 100.

进一步的,所述的步骤S03中热词分析包括以下步骤,首先需要去除停用词,然后采用出现频率对词汇进行打分,词频指该词汇在所有内容中出现的次数,按得分搞定找出热度最高的20%的词汇。Further, the hot word analysis in the step S03 includes the following steps. First, stop words need to be removed, and then the vocabulary is scored by frequency of occurrence. Word frequency refers to the number of times the vocabulary appears in all content, and the popularity can be found according to the score. Top 20% of vocabulary.

本发明的有益效果是:本发明通过对当下热点事件进行分析,对个热点事件进行评分梳理事件热度形成热点事件表,然后针对热点事件表对采集的热点词汇进行打分,并通过一系列的计算得到相应的热度值,从而实现了全面实时的分析当下热点事件及时更新新闻热点。The beneficial effects of the present invention are: the present invention analyzes the current hot events, scores and sorts out the hot events to form a hot event table, and then scores the collected hot words according to the hot event table, and through a series of calculations Get the corresponding heat value, so as to realize the comprehensive real-time analysis of current hot events and timely update news hotspots.

具体实施方式detailed description

一种新闻热度实时在线预测方法,包括以下两大部分:A real-time online prediction method for news popularity, including the following two parts:

热点事件分析与建模,包括以下步骤:Analysis and modeling of hotspot events, including the following steps:

S01:对发生过的热点事件,人工确定关键词,基于人工确定的关键词,从网络爬取各种该热点事件相关的资讯;包括各大新闻网站,微博,微信,论坛,贴吧,政府网站等不同渠道包含有该关键词的文章,微博,微信等内容。S01: For the hot events that have happened, manually determine the keywords, and based on the manually determined keywords, crawl various information related to the hot events from the Internet; including major news websites, Weibo, WeChat, forums, post bars, government Different channels such as the website contain articles, Weibo, WeChat, etc. that contain the keyword.

S02:事件热度值评估,利用网络爬取的信息总量,给事件的热度打分,信息总量越大的,分值越高,上不封顶;不同的信息来源,可以人工设定权值,越重要的信息来源,对评分影响的权值越高,可以根据业务场景进行不断调整,例如对于新闻事件而言,来源于公信力高的网站,例如人民网,新华网等,可以设定较高的权值,对于娱乐事件而言,来源于明星大V的微博信息,可以给予较高的分值。S02: Evaluation of event heat value, using the total amount of information crawled from the network to score the heat of the event. The larger the total amount of information, the higher the score, and there is no upper limit; different information sources can manually set the weight, The more important the source of information, the higher the weight of the impact on the score, which can be continuously adjusted according to the business scenario. For example, for news events, it can be set higher if it comes from a website with high credibility, such as People's Daily Online, Xinhuanet, etc. The weight of , for entertainment events, comes from the Weibo information of celebrities and big Vs, and can be given a higher score.

S03:对爬取的信息进行热词分析,找出20%热度最高的词汇;热词分析,首先需要去除停用词,然后采用出现频率对词汇进行打分,词频指该词汇在所有内容中出现的次数,按得分搞定找出热度最高的20%的词汇,例如在某条体育新闻中,“足球”在文中出现的次数为10,则词汇“足球”的词频值为10,该专题下所有新闻中“足球”词频值相加,在所有单词的词频中排列在前20%,则“足球”为改专题下的热词之一。S03: Perform hot word analysis on the crawled information to find out 20% of the most popular words; hot word analysis, first need to remove stop words, and then use frequency of occurrence to score words, word frequency refers to the word appears in all content The number of times, according to the score to find the most popular 20% of the vocabulary, for example, in a certain sports news, "football" appears in the article 10 times, then the word frequency value of the word "football" is 10, all under this topic The word frequency value of "football" in the news is added, and it ranks in the top 20% of the word frequency of all words, then "football" is one of the hot words under the topic.

S04:对已知的热点事件进行热点事件建模,分析各种词条对热度的贡献率,以及词条组合对事件热度的联合贡献率;S04: Perform hot event modeling on known hot events, analyze the contribution rate of various entries to the popularity, and the joint contribution rate of the combination of entries to the event popularity;

S05:利用事件热度值和热词对事件的贡献率,以及热词组合对事件的贡献率,计算热词的热值,计算公式为:热词热值=事件热值×热词频率/所有热词的频率之和;热词对热值=事件热值×热词对频率/所有热词对的频率之和;S05: Use the popularity value of the event, the contribution rate of the hot word to the event, and the contribution rate of the hot word combination to the event to calculate the heat value of the hot word. The calculation formula is: hot word heat value = event heat value × frequency of hot words / all The sum of the frequency of the hot words; the heat value of the hot word pair = the heat value of the event × the frequency of the hot word pair / the sum of the frequencies of all the hot word pairs;

S06:将所有事件中的热词和热词对合并到一起形成热词和热词对的热值表,将热词在各个事件中的热值相加,得到热词和热词对的当前热值;S06: Combine the hot words and hot word pairs in all events to form a hot word and hot word pair heat value table, add the hot words in each event to get the current hot word and hot word pair calorific value;

S07:不断更新上述热词和热词对的热值表;S07: constantly updating the calorific value table of the above-mentioned hot words and hot word pairs;

进一步的,所述的步骤S02中事件热度值的计算公式是Hotvalue=sum[count×k],其中count表示(舆情总数),k为权重其取值为1~100,例如人民网,新华网等高权值网站来源新闻k值为100,微博等自媒体渠道普通用户新闻k值为1。Further, the formula for calculating the heat value of an event in step S02 is Hotvalue=sum[count×k], where count represents (the total number of public opinions), and k is a weight whose value ranges from 1 to 100, such as People's Network, Xinhuanet The k-value of news from websites with equal weight is 100, and the k-value of ordinary user news from we-media channels such as Weibo is 1.

最新新闻热度预测,包括以下步骤:The hottest prediction of the latest news includes the following steps:

S11:实时采集各种来源的资讯,包括但不限于新闻,微博,论坛,贴吧的内容;S11: Collect information from various sources in real time, including but not limited to news, Weibo, forums, and post bars;

S12:对上述采集到的资讯进行分词,去掉停用词,得到新闻相关的词汇;S12: Segment the information collected above, remove stop words, and obtain news-related vocabulary;

S13:利用热点事件分析与建模步骤中得到的热词和热词对的热值表对中得到的词汇和词汇组合进行热度打分,计算公式是Hotvalue=sum[count×k],其中count表示(舆情总数),k为权重其取值为1~100。即在热值表中查询词汇的热值和词汇组合的热值,将相同词汇和词汇组合的热值累加,得到每个词汇和词汇组合在当前新闻中的热值;S13: Use the heat value table of hot words and hot word pairs obtained in the hot event analysis and modeling step to score the words and word combinations obtained in the heat value. The calculation formula is Hotvalue=sum[count×k], where count means (the total number of public opinions), k is the weight and its value is 1~100. That is, query the calorific value of the vocabulary and the calorific value of the vocabulary combination in the calorific value table, accumulate the calorific value of the same vocabulary and vocabulary combination, and obtain the calorific value of each vocabulary and vocabulary combination in the current news;

S14:将新闻中所有词汇和词汇组合的热度相加,得到新闻热度,此热度即为预测的新闻热度。S14: Add the popularity of all words and word combinations in the news to obtain the news popularity, which is the predicted news popularity.

以上所述仅是本发明的优选实施方式,应当理解本发明并非局限于本文所披露的形式,不应看作是对其他实施例的排除,而可用于各种其他组合、修改和环境,并能够在本文所述构想范围内,通过上述教导或相关领域的技术或知识进行改动。而本领域人员所进行的改动和变化不脱离本发明的精神和范围,则都应在本发明所附权利要求的保护范围内。The above descriptions are only preferred embodiments of the present invention, and it should be understood that the present invention is not limited to the forms disclosed herein, and should not be regarded as excluding other embodiments, but can be used in various other combinations, modifications and environments, and Modifications can be made within the scope of the ideas described herein, by virtue of the above teachings or skill or knowledge in the relevant art. However, changes and changes made by those skilled in the art do not depart from the spirit and scope of the present invention, and should all be within the protection scope of the appended claims of the present invention.

Claims (4)

1. a kind of news temperature real-time online Forecasting Methodology, it is characterised in that including following two large divisions:
Focus incident is analyzed with modeling, and is comprised the following steps:
S01:To the focus incident occurred, keyword is manually determined, based on the keyword manually determined, is crawled from network various The related information of the focus incident;
S02:Event hot value is assessed, the informational capacity crawled using network, and the temperature to event is given a mark, and informational capacity is bigger , score value is higher, no ceiling;
S03:Hot word analysis is carried out to the information crawled, 20% temperature highest vocabulary is found out;
S04:Focus incident modeling is carried out to known focus incident, contribution rate of the various entries to temperature, and entry is analyzed Combine the joint contribution rate to event temperature;
S05:Heat is calculated to the contribution rate of event using the contribution rate of event hot value and hot word to event, and hot word combination The calorific value of word, calculation formula is:The frequency sum of hot word calorific value=event calorific value × hot word frequency/all hot words;Hot word is to calorific value The frequency sum of=event calorific value × hot word to frequency/all hot words pair;
S06:The calorific value table that hot word and hot word in all events to be formed to hot word and hot word pair to being merged together, hot word is existed Calorific value in each event is added, and obtains the current calorific value of hot word and hot word pair;
S07:Constantly update the calorific value table of above-mentioned hot word and hot word pair;
Latest news temperature is predicted, is comprised the following steps:
S11:The information in various sources, including but not limited to news, microblogging, forum, the content of mhkc are gathered in real time;
S12:Participle is carried out to the above-mentioned information collected, removes stop words, the related vocabulary of news is obtained;
S13:The vocabulary that the calorific value table centering for analyzing hot word and hot word pair with being obtained in modeling procedure using focus incident is obtained Temperature marking is carried out with word combination, i.e., the calorific value of vocabulary and the calorific value of word combination are inquired about in calorific value table, by identical vocabulary Added up with the calorific value of word combination, obtain the calorific value of each vocabulary and word combination in Present News;
S14:All vocabulary in news are added with the temperature of word combination, news temperature is obtained, this temperature is the new of prediction Hear temperature.
2. a kind of news temperature real-time online Forecasting Methodology according to claim 1, it is characterised in that:Described step Network in S01 includes major news websites, microblogging, wechat, forum, mhkc, and the different channels of government website include the pass The article of keyword, microblogging, the content of wechat.
3. a kind of news temperature real-time online Forecasting Methodology according to claim 1, it is characterised in that:Described step The calculation formula of event hot value is Hotvalue=sum [count × k] in S02, and wherein count is represented(Public sentiment sum), k is Its value of weight is 1 ~ 100.
4. a kind of news temperature real-time online Forecasting Methodology according to claim 1, it is characterised in that:Described step Hot word analysis comprises the following steps in S03, it is necessary first to remove stop words, and then vocabulary is given a mark using the frequency of occurrences, Word frequency refers to the number of times that the vocabulary occurs in all the elements, and the vocabulary for finding out temperature highest 20% is settled by score.
CN201710308998.5A 2017-05-04 2017-05-04 Real-time online prediction method of news popularity Active CN107122481B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710308998.5A CN107122481B (en) 2017-05-04 2017-05-04 Real-time online prediction method of news popularity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710308998.5A CN107122481B (en) 2017-05-04 2017-05-04 Real-time online prediction method of news popularity

Publications (2)

Publication Number Publication Date
CN107122481A true CN107122481A (en) 2017-09-01
CN107122481B CN107122481B (en) 2020-06-30

Family

ID=59726634

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710308998.5A Active CN107122481B (en) 2017-05-04 2017-05-04 Real-time online prediction method of news popularity

Country Status (1)

Country Link
CN (1) CN107122481B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109344316A (en) * 2018-08-14 2019-02-15 优视科技(中国)有限公司 News temperature calculates method and device
CN109376231A (en) * 2018-09-29 2019-02-22 杭州凡闻科技有限公司 A kind of media hotspot tracking and system
CN109885656A (en) * 2019-02-18 2019-06-14 国家计算机网络与信息安全管理中心 Microblog forwarding prediction method and device based on quantitative popularity
CN110457594A (en) * 2019-08-01 2019-11-15 深圳市顶尖传诚科技有限公司 A prediction method of public opinion hot spots based on big data
CN110750682A (en) * 2018-07-06 2020-02-04 武汉斗鱼网络科技有限公司 Title hot word automatic metering method, storage medium, electronic equipment and system
CN112597280A (en) * 2020-12-28 2021-04-02 上海朝阳永续信息技术股份有限公司 Method for automatically discovering hot keywords and hot news
CN113535956A (en) * 2021-07-26 2021-10-22 北京清博智能科技有限公司 News hotspot prediction method based on medium contribution degree
CN114938477A (en) * 2022-06-23 2022-08-23 阿里巴巴(中国)有限公司 Video topic determination method, device and equipment
CN116304339A (en) * 2023-03-21 2023-06-23 上海第一财经传媒有限公司 A News Popularity Prediction Method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923544A (en) * 2009-06-15 2010-12-22 北京百分通联传媒技术有限公司 Method for monitoring and displaying Internet hot spots
CN102945290A (en) * 2012-12-03 2013-02-27 北京奇虎科技有限公司 Hot microblog topic digging device and method
US20130226560A1 (en) * 2010-02-05 2013-08-29 Jebu Ittiachen System and method for discovering story trends in real time from user generated content
CN104035960A (en) * 2014-05-08 2014-09-10 东莞市巨细信息科技有限公司 Internet Information Information Hotspot Prediction Method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923544A (en) * 2009-06-15 2010-12-22 北京百分通联传媒技术有限公司 Method for monitoring and displaying Internet hot spots
US20130226560A1 (en) * 2010-02-05 2013-08-29 Jebu Ittiachen System and method for discovering story trends in real time from user generated content
CN102945290A (en) * 2012-12-03 2013-02-27 北京奇虎科技有限公司 Hot microblog topic digging device and method
CN104035960A (en) * 2014-05-08 2014-09-10 东莞市巨细信息科技有限公司 Internet Information Information Hotspot Prediction Method

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110750682B (en) * 2018-07-06 2022-08-16 武汉斗鱼网络科技有限公司 Title hot word automatic metering method, storage medium, electronic equipment and system
CN110750682A (en) * 2018-07-06 2020-02-04 武汉斗鱼网络科技有限公司 Title hot word automatic metering method, storage medium, electronic equipment and system
CN109344316A (en) * 2018-08-14 2019-02-15 优视科技(中国)有限公司 News temperature calculates method and device
CN109376231A (en) * 2018-09-29 2019-02-22 杭州凡闻科技有限公司 A kind of media hotspot tracking and system
CN109885656A (en) * 2019-02-18 2019-06-14 国家计算机网络与信息安全管理中心 Microblog forwarding prediction method and device based on quantitative popularity
CN109885656B (en) * 2019-02-18 2021-06-29 国家计算机网络与信息安全管理中心 Microblog forwarding prediction method and device based on quantitative popularity
CN110457594B (en) * 2019-08-01 2021-06-01 深圳市顶尖传诚科技有限公司 Big data-based public opinion hotspot prediction method
CN110457594A (en) * 2019-08-01 2019-11-15 深圳市顶尖传诚科技有限公司 A prediction method of public opinion hot spots based on big data
CN112597280A (en) * 2020-12-28 2021-04-02 上海朝阳永续信息技术股份有限公司 Method for automatically discovering hot keywords and hot news
WO2022141803A1 (en) * 2020-12-28 2022-07-07 上海朝阳永续信息技术股份有限公司 Method for automatically discovering hot keywords and hot news
CN113535956A (en) * 2021-07-26 2021-10-22 北京清博智能科技有限公司 News hotspot prediction method based on medium contribution degree
CN114938477A (en) * 2022-06-23 2022-08-23 阿里巴巴(中国)有限公司 Video topic determination method, device and equipment
CN114938477B (en) * 2022-06-23 2024-05-03 阿里巴巴(中国)有限公司 Video topic determination method, device and equipment
CN116304339A (en) * 2023-03-21 2023-06-23 上海第一财经传媒有限公司 A News Popularity Prediction Method

Also Published As

Publication number Publication date
CN107122481B (en) 2020-06-30

Similar Documents

Publication Publication Date Title
CN107122481A (en) News temperature real-time online Forecasting Methodology
US11847612B2 (en) Social media profiling for one or more authors using one or more social media platforms
Raisi et al. Cyberbullying identification using participant-vocabulary consistency
CN106980692B (en) An Influence Calculation Method Based on Weibo Specific Events
CN104156436B (en) Social association cloud media collaborative filtering and recommending method
CN103116605B (en) A kind of microblog hot event real-time detection method based on monitoring subnet and system
CN102999507B (en) The recommendation process method and apparatus of network microblog famous person's information
CN104298765B (en) The Dynamic Recognition and method for tracing of a kind of internet public feelings topic
Vosecky et al. Searching for quality microblog posts: Filtering and ranking based on content analysis and implicit links
CN103853700B (en) A kind of event method for early warning found based on region and object information
CN105893484A (en) Microblog Spammer recognition method based on text characteristics and behavior characteristics
CN108228745A (en) A kind of proposed algorithm and device based on collaborative filtering optimization
CN104850647A (en) Microblog group discovering method and microblog group discovering device
CN108228586A (en) A kind of cross-platform network leader of opinion ranks method
Steurer et al. Predicting interactions in online social networks: an experiment in second life
CN102270240A (en) Method for discovering hot views in network forum and analyzing evolvement trend thereof
CN105468780A (en) Normalization method and device of product name entity in microblog text
Xiao et al. Predicting user influence in social media
Abdullah et al. Reputation measurement based on a hybrid sentiment analysis approach for Saudi telecom companies
CN110210927A (en) A kind of IT books recommender system design based on collaborative filtering
Purohit et al. Finding influential authors in brand-page communities
CN113536086B (en) Model training method, account scoring method, device, equipment, medium and product
Sharma et al. A ranking algorithm for online social network search
CN114218474A (en) Social network rumor early detection system based on level perception attention GRU network
Wei et al. The Bayesian network and trust model based movie recommendation system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant