CN103365904B - A kind of advertising message searching method and system - Google Patents
A kind of advertising message searching method and system Download PDFInfo
- Publication number
- CN103365904B CN103365904B CN201210097984.0A CN201210097984A CN103365904B CN 103365904 B CN103365904 B CN 103365904B CN 201210097984 A CN201210097984 A CN 201210097984A CN 103365904 B CN103365904 B CN 103365904B
- Authority
- CN
- China
- Prior art keywords
- advertiser
- words
- word
- effective
- candidate word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
本申请提供了一种广告信息搜索方法和系统,涉及网络技术领域。本申请的方法包括:将用户端输入的查询词进行扩展,获得扩展后的候选词集合;分析搜索引擎实时日志和/或当前时间的广告客户状态信息获得有效词集合;所述广告客户状态信息包括广告客户选择的关键词展示条件信息;所述有效词集合包括当前时间符合展示条件的关键词;利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词;选择过滤后的候选词集合中至少一个候选词检索广告信息。本申请通过实时的过滤掉不能召回广告的扩展的候选词,保证了当前时间输入至检索引擎的检索关键词能够及时召回广告进行展示,不影响广告的展示效率,能充分利用有限的展示资源展示广告。
The application provides an advertisement information search method and system, which relate to the field of network technology. The method of the present application includes: expanding the query words input by the client to obtain an expanded set of candidate words; analyzing the real-time log of the search engine and/or the advertiser status information at the current time to obtain a valid word set; the advertiser status information Including the keyword display condition information selected by the advertiser; the effective word set includes keywords that meet the display conditions at the current time; using the effective word set, filter out the words in the candidate word set that are not in the effective word set; select At least one candidate word in the filtered candidate word set retrieves advertisement information. This application filters out the extended candidate words that cannot recall advertisements in real time, ensuring that the search keywords input to the search engine at the current time can recall advertisements for display in time, without affecting the display efficiency of advertisements, and can make full use of limited display resources for display advertise.
Description
技术领域technical field
本申请涉及网络技术领域,特别是涉及一种广告信息搜索方法和系统。The present application relates to the field of network technology, in particular to a method and system for searching advertisement information.
背景技术Background technique
随着互联网的发展,互联网提供的信息已经越来越多样化,而通过互联网投放广告也成为广告客户的重要选择,在互联网中投放广告的服务不但可以为广告客户进行宣传,而且可以为用户提供更多的服务器信息。互联网搜索引擎作为信息服务器平台,通过搜索引擎投放广告也已经成为广告客户普遍采用的一种广告投放方式。With the development of the Internet, the information provided by the Internet has become more and more diversified, and advertising on the Internet has become an important choice for advertisers. The service of advertising on the Internet can not only promote advertisers, but also provide users with More server information. As an information server platform, Internet search engines have become an advertising delivery method generally adopted by advertisers through search engines.
互联网客户在搜索时,向搜索引擎提交查询词,搜索引擎将与查询词最相关的一组网页的链接作为搜索结果页面返回给用户端,用户可以点击这些链接去访问相关网页。现有技术中,在通过搜索引擎投放广告时,广告客户通过检索关键词竞价排名系统注册与其所要推广的广告链接网站或者网页密切相关的一组检索关键词;用户端输入查询词后,搜索引擎首先对所述查询词进行扩展得到候选词,然后选择候选词中与查询词最相关候选词作为检索关键词,然后依据扩展后的各检索关键词进行检索,如果广告客户注册的关键词与所述检索关键词匹配,那么即可进行展现。举例比如输入查询词“诺基亚手机”,可扩展“诺基亚”,“手机”,“诺基亚N97”,“诺基亚5230”等候选词,然后从这些候选词中选取与“诺基亚手机”相关性大于阈值的词作为检索关键词去进行检索。When Internet users search, they submit query words to the search engine, and the search engine returns the links of a group of webpages most relevant to the query words to the client as search result pages, and users can click on these links to visit related webpages. In the prior art, when placing an advertisement through a search engine, the advertiser registers a group of search keywords closely related to the advertisement link website or web page to be promoted through the search keyword bidding ranking system; First, the query words are expanded to obtain candidate words, and then the candidate words most relevant to the query words are selected as search keywords among the candidate words, and then searched according to each expanded search keyword, if the keyword registered by the advertiser is the same as the search word If the above search keywords match, then it can be displayed. For example, enter the query word "Nokia mobile phone", which can expand "Nokia", "mobile phone", "Nokia N97", "Nokia 5230" and other candidate words, and then select from these candidate words the correlation with "Nokia mobile phone" is greater than the threshold Words are used as search keywords to search.
但在实际现有技术的现有技术中:首先,搜索引擎能够接收的搜索词的个数是存在上限,如果搜索词个数超过这个上限,搜索引擎的效率和性能就会受很大影响,即搜索引擎基于查询词进行扩展时,其选择的扩展后的检索关键词个数不会超过其能接收的最大个数;第二,对于广告的投放位置来说,实际中,即能够有效展示广告的投放位置是有限的;第三,对于各广告客户来说,实际中,广告客户一般根据实际情况对自己选择的一组检索关键词进行展示限定(比如展示时间段),那么,如果搜索引擎根据扩展的各检索关键词在广告库中搜索到该广告客户的广告,但此时不符合客户设定的展示条件(比如此时不在客户设定的展示时间段内),那么搜索引擎就不会召回该条广告放入广告信息搜索位置进行投放。But in the prior art of the actual existing technology: firstly, there is an upper limit to the number of search words that a search engine can receive, if the number of search words exceeds this upper limit, the efficiency and performance of the search engine will be greatly affected, That is, when the search engine expands based on the query words, the number of expanded search keywords it selects will not exceed the maximum number it can receive; second, for the location of the advertisement, in practice, it can effectively display The placement of advertisements is limited; thirdly, for each advertiser, in practice, advertisers usually limit the display of a set of search keywords selected by themselves (such as the display time period) according to the actual situation, then, if the search The engine searches for the advertiser's advertisement in the advertisement library according to the expanded search keywords, but at this time it does not meet the display conditions set by the client (for example, it is not within the display time period set by the client), then the search engine will The advertisement will not be recalled and placed in the advertising information search position for delivery.
基于上述情况,现有技术存在如下缺点:Based on the foregoing, the prior art has the following disadvantages:
现有技术中搜索引擎基于客户输入的查询词扩展出的各检索关键词的只基于相关性考虑,在搜索引擎能接收的最优搜索词个数、有效广告信息搜索位置个数、和各广告客户对于其自身广告的展示条件的限制下,其扩展出的检索关键词中可能存在一个或几个检索关键词,在检索关键词广告索引库后无法召回广告进行展示,并且由于搜索引擎能接收的检索关键词有个数限制,如果搜索引擎扩展出了无法召回广告的检索关键词,该词就占据了检索资源,导致其他与查询词相关的检索词无法及时或者不能进入检索过程,影响了其他与所述查询词相关并且符合展示条件的广告的展示,使扩展后的检索关键词不能够充分利用有限的广告位置展示广告客户的广告。In the prior art, the retrieval keywords expanded by the search engine based on the query words input by the customer are only based on correlation considerations, the optimal number of search words that can be received by the search engine, the number of effective advertisement information search positions, and the number of each advertisement information. Under the customer's restrictions on the display conditions of its own advertisements, there may be one or several search keywords in the extended search keywords, and the advertisements cannot be recalled for display after searching the keyword advertisement index library, and because the search engine can receive There is a limit on the number of search keywords. If the search engine expands the search keywords that cannot recall advertisements, the words will occupy the search resources, causing other search words related to the query words to be unable to enter the search process in time or not, affecting the search results. The display of other advertisements that are related to the query words and meet the display conditions prevents the expanded search keywords from being able to fully utilize the limited advertisement positions to display the advertiser's advertisements.
发明内容Contents of the invention
本申请所要解决的技术问题是提供一种广告信息搜索方法和系统,解决现有技术中扩展词无法有效在检索关键词广告索引库后无法召回广告进行展示,广告的展示效率低的问题。The technical problem to be solved by this application is to provide an advertisement information search method and system, which solves the problems in the prior art that the expanded words cannot be effectively recalled for display after searching the keyword advertisement index database, and the display efficiency of advertisements is low.
为了解决上述问题,本申请公开了一种广告信息搜索方法,包括:In order to solve the above problems, this application discloses a method for searching advertisement information, including:
将用户端输入的查询词进行扩展,获得扩展后的候选词集合;Expanding the query words input by the client to obtain an expanded set of candidate words;
分析搜索引擎实时日志和/或当前时间的广告客户状态信息获得有效词集合;所述广告客户状态信息包括广告客户选择的关键词展示条件信息;所述有效词集合包括当前时间符合展示条件的关键词;Analyze search engine real-time logs and/or advertiser status information at the current time to obtain an effective word set; the advertiser status information includes keyword display condition information selected by the advertiser; the effective word set includes keywords that meet the display conditions at the current time word;
利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词;Using the effective word set to filter out words that are not in the effective word set in the candidate word set;
选择过滤后的候选词集合中至少一个候选词检索广告信息。At least one candidate word in the filtered candidate word set is selected to retrieve advertisement information.
优选的,通过以下步骤获取用户状态信息:Preferably, the user status information is obtained through the following steps:
从数据存储服务器获取各广告客户对应的数据库表,所述数据库表包括广告客户的广告费用预算和广告客户的广告推广时间。The database table corresponding to each advertiser is obtained from the data storage server, and the database table includes the advertiser's advertising expense budget and the advertiser's advertisement promotion time.
优选的,所述数据库表包括广告客户的广告费用预算表和广告客户的广告推广时间表,分别对应广告客户的广告费用预算和广告推广时间。Preferably, the database table includes an advertiser's advertising expense budget table and an advertiser's advertising promotion schedule, corresponding to the advertiser's advertising expense budget and advertising promotion time respectively.
优选的,获取广告客户对应的数据库表后,包括:Preferably, after acquiring the database table corresponding to the advertiser, it includes:
分析所述广告客户的广告费用预算表和广告客户的广告推广时间表,对于广告费用预算大于阈值且广告推广时间段包含所述当前时间的广告客户,将所述广告选择的关键词加入有效词集合。Analyzing the advertiser's advertising expense budget table and the advertiser's advertising promotion timetable, for the advertiser whose advertising expense budget is greater than the threshold and whose advertising promotion period includes the current time, add the keywords selected by the advertisement into effective words gather.
优选的,通过以下步骤分析搜索引擎实时日志获得有效词集合:Preferably, the effective word set is obtained by analyzing the real-time log of the search engine through the following steps:
从搜索引擎获取实时搜索日志;Get real-time search logs from search engines;
分析所述实时搜索日志中各检索关键词在当前时间段内是否返回广告给用户端;Analyze whether each retrieval keyword in the real-time search log returns an advertisement to the client within the current time period;
如果返回,则将对应检索关键词加入有效词集合。If it returns, add the corresponding search keyword to the effective word set.
优选的,在利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词后,还包括:Preferably, after using the effective word set to filter out the words in the candidate word set that are not in the effective word set, it also includes:
针对所述过滤后的候选词集合中的候选词,计算所述候选词与所述查询词的相关度;For the candidate words in the filtered candidate word set, calculate the correlation between the candidate words and the query word;
针对所述过滤后的候选词集合中的候选词,统计所述候选词的有效广告客户深度;所述有效广告客户深度为选择所述候选词并且在当前时间符合所述展示条件的广告客户的数量;For the candidate words in the filtered candidate word set, the effective advertiser depth of the candidate words is counted; the effective advertiser depth is the advertiser who selects the candidate word and meets the display conditions at the current time quantity;
结合所述候选词的相关度的权重和有效广告客户深度的权重对所述过滤后的候选词集合中的各候选词进行重排序。The candidate words in the filtered candidate word set are reordered in combination with the weight of the relevance degree of the candidate words and the weight of the effective advertiser depth.
优选的,所述的数据库表还包括广告客户在线状态表。Preferably, the database table also includes an advertiser online status table.
优选的,在利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词后,还包括:Preferably, after using the effective word set to filter out the words in the candidate word set that are not in the effective word set, it also includes:
针对所述过滤后的候选词集合中的候选词,计算所述候选词与所述查询词的相关度;For the candidate words in the filtered candidate word set, calculate the correlation between the candidate words and the query word;
针对所述过滤后的候选词集合中的候选词,统计所述候选词的有效广告客户深度;所述有效广告客户深度为选择所述候选词并且在当前时间符合所述展示条件的广告客户的数量;For the candidate words in the filtered candidate word set, the effective advertiser depth of the candidate words is counted; the effective advertiser depth is the advertiser who selects the candidate word and meets the display conditions at the current time quantity;
统计选择所述候选词并且在当前时间符合所述展示条件的在线广告客户的数量;Counting the number of online advertisers who select the candidate word and meet the display conditions at the current time;
结合所述候选词的相关度的权重、有效广告客户深度的权重和在线广告客户数量对所述过滤后的候选词集合中的各候选词进行重排序。The candidate words in the filtered candidate word set are reordered in combination with the weight of the relevance degree of the candidate words, the weight of the effective advertiser depth and the number of online advertisers.
优选的,将用户端输入的查询词进行扩展,获得扩展后的候选词集合时包括:Preferably, the query word input by the user terminal is expanded, and the extended candidate word set includes:
获取广告客户新选择的关键词,将所述关键词加入扩展词库;Obtaining the keywords newly selected by the advertiser, and adding the keywords into the expanded thesaurus;
计算所述查询词与扩展词库中各关键词的相关性,当所述相关性大于阈值时,将所述关键词加入候选词集合。The correlation between the query word and each keyword in the expanded thesaurus is calculated, and when the correlation is greater than a threshold, the keyword is added to the candidate word set.
相应的,本申请公开了一种广告信息搜索系统,包括:Correspondingly, this application discloses an advertisement information search system, including:
查询词扩展模块,用于将用户端输入的查询词进行扩展,获得扩展后的候选词集合;The query word expansion module is used to expand the query words input by the client to obtain an expanded set of candidate words;
有效词集合获取模块,用于分析搜索引擎实时日志和/或当前时间的广告客户状态信息获得有效词集合;所述广告客户状态信息包括广告客户选择的关键词展示条件信息;所述有效词集合包括当前时间符合展示条件的关键词;The effective word set acquisition module is used to analyze the search engine real-time log and/or the advertiser status information of the current time to obtain the effective word set; the advertiser status information includes the keyword display condition information selected by the advertiser; the effective word set Include keywords that meet the display conditions at the current time;
过滤模块,用于利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词;A filtering module, configured to use the effective word set to filter out words in the candidate word set that are not in the effective word set;
检索模块,用于选择过滤后的候选词集合中至少一个候选词检索广告信息。The retrieval module is used to select at least one candidate word in the filtered candidate word set to retrieve advertisement information.
优选的,所述有效词集合获取模块包括:Preferably, the effective word set acquisition module includes:
用户状态信息获取子模块,用于从数据存储服务器获取各广告客户对应的数据库表,所述数据库表包括广告客户的广告费用预算和广告客户的广告推广时间。The user status information acquisition sub-module is used to acquire the database table corresponding to each advertiser from the data storage server, and the database table includes the advertiser's advertising expense budget and the advertiser's advertising promotion time.
优选的,所述数据库表包括广告客户的广告费用预算表和广告客户的广告推广时间表,分别对应广告客户的广告费用预算和广告推广时间。Preferably, the database table includes an advertiser's advertising expense budget table and an advertiser's advertising promotion schedule, corresponding to the advertiser's advertising expense budget and advertising promotion time respectively.
优选的,所述有效词集合获取模块还包括:Preferably, the effective word set acquisition module also includes:
用户状态分析子模块,用于分析所述广告客户的广告费用预算表和广告客户的广告推广时间表,对于广告费用预算大于阈值且广告推广时间段包含所述当前时间的广告客户,将所述广告选择的关键词加入有效词集合。The user status analysis sub-module is used to analyze the advertiser's advertising expense budget table and the advertiser's advertising promotion schedule, and for the advertiser whose advertising expense budget is greater than a threshold and whose advertising promotion time period includes the current time, the The keywords selected by the advertisement are added to the effective word set.
优选的,所述有效词集合获取模块包括:Preferably, the effective word set acquisition module includes:
实时日志获取子模块,用于从搜索引擎获取实时搜索日志;The real-time log acquisition sub-module is used to obtain real-time search logs from search engines;
实时日志分析子模块,用于分析所述实时搜索日志中各检索关键词在当前时间段内是否返回广告给用户端;如果返回,则将对应检索关键词加入有效词集合。The real-time log analysis sub-module is used to analyze whether each search keyword in the real-time search log returns an advertisement to the client within the current time period; if returned, the corresponding search keyword is added to the effective word set.
与现有技术相比,本申请包括以下优点:Compared with the prior art, the present application includes the following advantages:
本申请在系统获取到用户端的查询词,并依据所述查询词扩展得到大量与所述查询词相关的候选词集合后,实时分析广告客户实时状态数据或搜索引擎实时日志,获得当前能够召回广告的关键词,并将所述关键词作为有效关键词加入有效词集合,然后利用该有效词集合筛选所述候选词集合,筛选后获得能够召回广告的候选词集合,然后对所述筛选后的各候选词进行重排序,获取其中排序靠前的至少一个检索关键词,至多到搜索引擎保持性能最优个数的检索关键词,然后基于上述检索关键词进行检索。上述方法中,通过实时的过滤掉不能召回广告的扩展的候选词,保证了当前时间输入至检索引擎的检索关键词能够及时召回广告进行展示,不影响广告的展示效率,能充分利用有限的展示资源展示广告。After the system obtains the query words at the user end, and obtains a large number of candidate word sets related to the query words according to the expansion of the query words, the application analyzes the real-time status data of the advertiser or the real-time logs of the search engine in real time, and obtains the currently recallable advertisements. keywords, and add the keyword as an effective keyword to the effective word set, then use the effective word set to screen the candidate word set, obtain a candidate word set capable of recalling the advertisement after screening, and then perform the screening on the Each candidate word is re-ranked to obtain at least one search keyword that ranks first among them, and at most the search keywords that the search engine maintains the optimal number of performance, and then search based on the above-mentioned search keywords. In the above method, by filtering out the extended candidate words that cannot recall advertisements in real time, it is ensured that the search keywords input to the search engine at the current time can recall advertisements for display in time, without affecting the display efficiency of advertisements, and making full use of the limited display Resource display ads.
附图说明Description of drawings
图1是本申请一种广告信息搜索方法的流程示意图;FIG. 1 is a schematic flow diagram of an advertisement information search method in the present application;
图2是本申请一种广告信息搜索系统的结构示意图;Fig. 2 is a schematic structural diagram of an advertisement information search system of the present application;
图3是本申请优选的一种广告信息搜索系统的结构示意图。Fig. 3 is a schematic structural diagram of a preferred advertisement information search system of the present application.
具体实施方式detailed description
为使本申请的上述目的、特征和优点能够更加明显易懂,下面结合附图和具体实施方式对本申请作进一步详细的说明。In order to make the above objects, features and advantages of the present application more obvious and comprehensible, the present application will be further described in detail below in conjunction with the accompanying drawings and specific implementation methods.
参照图1,示出了本申请一种广告信息搜索方法的流程示意图,具体可以包括:Referring to FIG. 1 , it shows a schematic flow chart of an advertisement information search method of the present application, which may specifically include:
步骤110,将用户端输入的查询词进行扩展,获得扩展后的候选词集合。Step 110, expand the query word input by the client to obtain an expanded set of candidate words.
由于用户和广告客户的知识领域等的不对称,有时广告客户选择的关键词(即购买的竞价词)可能不完全符合用户的搜索时的组词习惯,从而导致不能最大可能的曝光广告客户试图推广的产品或广告,所以搜索引擎一般会对用户输入的查询词进行扩展,基于目前搜索引擎普遍采用的关键词广告索引方式(通过用户购买的竞价词匹配相应广告),在搜索引擎关键词广告扩展匹配中,一般是通过各种方法(增词,减词,替换等)在原始用户查询的基础上,在保证一定相关性前提下,扩展出一个或者多个的竞价词,然后通过将原词以及扩展出的竞价词作为新的查询进行广告匹配。Due to the asymmetry between the knowledge fields of users and advertisers, sometimes the keywords selected by advertisers (that is, the bidding words purchased) may not completely conform to the user's habit of forming words when searching, resulting in the inability to maximize possible exposure. Promoted products or advertisements, so search engines generally expand the query words entered by users. Based on the keyword advertising index method generally used by search engines (matching corresponding advertisements through the bidding words purchased by users), search engine keyword advertising In extended matching, one or more bid words are generally expanded on the basis of the original user query through various methods (word addition, word subtraction, replacement, etc.) under the premise of ensuring a certain relevance, and then the original Words and the expanded bidding words are used as new queries for advertisement matching.
在关键词广告中,在竞价词扩展匹配中除了相关性,一般会增加商业因素相关的评价方法,比如扩展PPC(pay-per-click,表示广告每次点击的扣费)高的词,客户深度(购买某个竞价词的用户数称为客户深度)高的词等等。In keyword advertising, in addition to the relevance in the bidding word expansion matching, evaluation methods related to commercial factors are generally added, such as expanding the words with high PPC (pay-per-click, which means the deduction fee for each click of the advertisement), and the customer Words with high depth (the number of users who buy a certain bidding word is called customer depth) and so on.
一般,对于查询词的扩展方法有三种:一是短语匹配,按照相关性取用户查询词的子串作为推荐词,比如诺基亚手机,可以扩展诺基亚和手机,其中诺基亚相关性高于手机。Generally, there are three ways to expand query words: one is phrase matching, which takes substrings of user query words as recommended words according to relevance, such as Nokia mobile phones, which can expand Nokia and mobile phones, among which Nokia is more relevant than mobile phones.
二是包含扩展匹配,也就是扩展信息更丰富的词包含用户搜索的词,比如用户搜索手机,可以扩展三星手机,htc手机等,扩展的方法是原词进行分词,然后通过查词的倒排索引,找出包含原词的所有词,最后根据相关性取最好的N个词作为推荐结果。The second is to include extended matching, that is, words with richer extended information include words searched by users. For example, when users search for mobile phones, Samsung mobile phones, htc mobile phones, etc. can be extended. Index to find all the words that contain the original word, and finally select the best N words according to the relevance as the recommendation result.
三是基于统计的关联推荐,根据网站的搜索日志,根据用户搜索行为查找相似的搜索词,进行聚类,推荐相关词。比如搜索手机可以推荐iphone。The third is related recommendation based on statistics. According to the search log of the website, search for similar search words according to user search behavior, perform clustering, and recommend related words. For example, searching for mobile phones can recommend iphone.
本申请对于用户端输入的查询词,也会对其进行扩展,首先获得扩展后的候选词集合。This application also expands the query words input by the user terminal, and first obtains the expanded candidate word set.
步骤120,分析搜索引擎实时日志和/或当前时间的广告客户状态信息获得有效词集合;所述广告客户状态信息包括广告客户选择的关键词展示条件信息;所述有效词集合包括当前时间符合展示条件的关键词。Step 120, analyze the search engine real-time log and/or advertiser status information at the current time to obtain an effective word set; the advertiser status information includes the keyword display condition information selected by the advertiser; the effective word set includes the current time to meet the display condition information Condition keyword.
由于搜索引擎基于其性能的原因,有最大搜索词个数的限制,所以其只会选择其中相关性最高的检索词。对于现有技术获取到的各扩展词中,可能其中的广告客户已选择的关键词(即客户选择的竞价词),由于该广告客户对其自身需要推广的广告设置了展示条件,从而其选择的关键词也相应的存在展示条件,如果搜索引擎当前的展示环境不符合用户设置的展示条件,比如客户设置的广告展示时间段不包括当前的展示时间,或者客户当前的账户余额低于其设置的展示余额等,那么即使搜索引擎最终选择了该关键词,也无法召回该关键词对应的广告客户的广告进行展现,那么就浪费了搜索引擎的检索工作,降低了搜索引擎的效率。Because the search engine has a limit on the maximum number of search terms based on its performance, it will only select the most relevant search terms. Among the extended words obtained by the prior art, the keywords that may have been selected by the advertiser (that is, the bidding word selected by the client), because the advertiser has set the display conditions for the advertisements that need to be promoted, so the advertiser chooses The keywords also have corresponding display conditions. If the current display environment of the search engine does not meet the display conditions set by the user, for example, the ad display time period set by the customer does not include the current display time, or the customer's current account balance is lower than its setting Even if the search engine finally selects the keyword, it cannot recall the advertisement of the advertiser corresponding to the keyword for display, which wastes the search work of the search engine and reduces the efficiency of the search engine.
而本申请,对于前述得到的候选词集合,首先获取实时的有效词集合,所述的有效词集合包括当前时间符合展示条件的关键词,比如对于“诺基亚”,存在选择该词的广告客户,其设置的广告展示时间段包括当前的展示时间,同时其用于支付推广费用的账户余额大于其设置的阈值。其中,一般在搜索引擎进行推广一般可采用CPC(Cost Per Click,网络广告每次点击费用)系统收取广告客户的费用。In this application, for the candidate word set obtained above, first obtain a real-time effective word set, which includes keywords that meet the display conditions at the current time, for example, for "Nokia", there is an advertiser who selects this word, The advertisement display time period set by it includes the current display time, and at the same time, the account balance used to pay the promotion fee is greater than the threshold value set by it. Among them, general promotion in search engines generally adopts CPC (Cost Per Click, cost per click for online advertising) system to collect fees from advertisers.
对于有效词集合的获取,一般通过分析搜索引擎实时日志和/或当前时间的广告客户状态信息获得有效词集合。For obtaining the effective word set, the effective word set is generally obtained by analyzing the real-time log of the search engine and/or the advertiser status information at the current time.
优选的,通过以下步骤获取用户状态信息:Preferably, the user status information is obtained through the following steps:
步骤S11,从数据存储服务器获取各广告客户对应的数据库表,所述数据库表包括广告客户的广告费用预算和广告客户的广告推广时间。In step S11, the database table corresponding to each advertiser is obtained from the data storage server, and the database table includes the advertiser's advertising expense budget and the advertiser's advertising promotion time.
进一步的,包括:从数据存储服务器获取各广告客户对应的数据库表,所述数据库表包括广告客户的广告费用预算表和广告客户的广告推广时间表。Further, the method includes: acquiring a database table corresponding to each advertiser from the data storage server, and the database table includes the advertiser's advertising expense budget table and the advertiser's advertising promotion schedule.
本申请通过数据库表关联,一般情况下决定广告客户的广告是否上线的因素一般有两个,一是预算,即广告客户对自己账户余额限值的设定,比如当账户余额小于300时,不再允许广告被展现;二是推广时间,即广告客户对广告推广时间的设定,比如中午11点至下午1点。所以可从数据存储服务器获取各广告客户对应的数据库表进行分析,更进一步说所以可通过数据库表中的广告客户的广告费用预算表和广告客户的广告推广时间表获取有效词集合,即获取具有有效广告客户深度(有效广告客户深度:对于购买了一竞价词各广告客户,在某一时刻,可以通过该竞价词在线上进行展现的广告对应的广告客户数量,即在所述某一时刻,可通过所述竞价词从广告库召回进行展现的广告对应的广告客户数量)的关键词(竞价词)。This application is associated with the database table. Generally, there are two factors that determine whether an advertiser's advertisement is online. One is the budget, that is, the advertiser's setting of the limit value of his own account balance. For example, when the account balance is less than 300, no Then allow the advertisement to be displayed; the second is the promotion time, that is, the advertiser's setting of the advertisement promotion time, such as 11:00 noon to 1:00 pm. Therefore, the database table corresponding to each advertiser can be obtained from the data storage server for analysis. Furthermore, the effective word set can be obtained through the advertiser's advertising expense budget table and the advertiser's advertising promotion timetable in the database table, that is, to obtain Effective Advertiser Depth (Effective Advertiser Depth: For each advertiser who purchased a bidding word, at a certain moment, the number of advertisers corresponding to the advertisements that can be displayed online through the bidding word, that is, at the certain moment, The number of advertisers (number of advertisers corresponding to the advertisements to be displayed) can be recalled from the advertisement library through the bidding words (bidding words).
优选的,获取广告客户对应的数据库表后,包括:Preferably, after acquiring the database table corresponding to the advertiser, it includes:
步骤S12,分析所述广告客户的广告费用预算表和广告客户的广告推广时间表,对于广告费用预算大于阈值且广告推广时间段包含所述当前时间的广告客户,将所述广告客户选择的关键词加入有效词集合。Step S12, analyzing the advertiser's advertising expense budget table and the advertiser's advertising promotion schedule, and for the advertiser whose advertising expense budget is greater than a threshold and whose advertising promotion period includes the current time, the key selected by the advertiser Words are added to the effective word set.
即通过广告客户的广告费用预算表和广告客户的广告推广时间表的与广告客户选择的关键词的关联性,分析获取有效词集合。That is, through the correlation between the advertiser's advertising expense budget table and the advertiser's advertising promotion schedule and the keywords selected by the advertiser, the effective word set is obtained through analysis.
或者,通过以下步骤分析搜索引擎实时日志获得有效词集合:Or, analyze the search engine real-time log to obtain the effective word set through the following steps:
步骤S21,从搜索引擎获取实时搜索日志;Step S21, obtaining a real-time search log from a search engine;
步骤S22,分析所述实时搜索日志中各检索关键词在当前时间段内是否返回广告给用户端;Step S22, analyzing whether each retrieval keyword in the real-time search log returns an advertisement to the client within the current time period;
步骤S23,如果返回,则将对应检索关键词加入有效词集合。Step S23, if it returns, add the corresponding search keyword to the effective word set.
在搜索引擎中,会实时记录用户搜索查询词,以及搜索引擎当时展现的广告,所以通过该方法可以获得该段时间段内用户查询词的实时返回广告数,也就是实时客户深度,在这个过程中,如果通过查询词及其扩展词实时返回过广告,那么这些词即可认为是有效词,将其加入有效词集合。In the search engine, the user's search query words and the advertisements displayed by the search engine will be recorded in real time, so through this method, the number of real-time returned advertisements for the user's query words within this period of time can be obtained, that is, the real-time customer depth. In , if the advertisements are returned in real time through the query words and their expansion words, then these words can be considered as effective words and added to the effective word set.
步骤130,利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词。Step 130, using the effective word set to filter out words in the candidate word set that are not in the effective word set.
在得到有效词集合后,即可对所述候选词集合进行处理,过滤掉所述候选词集合中不在有效词集合中的词。如此,即可过滤掉没有客户深度的词,即不能召回广告客户的广告进行展现的词。比如,当前时间为14:00,对于关键词“诺基亚”,购买该词的广告客户有A、B、C:After the effective word set is obtained, the candidate word set can be processed to filter out the words in the candidate word set that are not in the effective word set. In this way, words without customer depth can be filtered out, that is, words that cannot recall an advertiser's advertisement for display. For example, if the current time is 14:00, for the keyword "Nokia", there are advertisers A, B, and C who purchased this word:
对于A来说,其设置的该关键词对应的广告展示时间为11:00-13:00,账户余额为1000,其设置的账户余额阈值为300;For A, the advertisement display time corresponding to the keyword set by it is 11:00-13:00, the account balance is 1000, and the account balance threshold set by it is 300;
对于B来说,其设置的该关键词对应的广告展示时间为11:00-15:00,账户余额为100,其设置的账户余额阈值为400;For B, the advertisement display time corresponding to the keyword set by it is 11:00-15:00, the account balance is 100, and the account balance threshold set by it is 400;
对于C来说,其设置的该关键词对应的广告展示时间为18:00-21:00,账户余额为200,其设置的账户余额阈值为300;For C, the advertisement display time corresponding to the keyword set by it is 18:00-21:00, the account balance is 200, and the account balance threshold set by it is 300;
那么关键词“诺基亚”对于A来说,其展示条件中虽然其账户余额大于A设置的阈值300,但当前展示时间点不符合A的要求,所以关键词“诺基亚”不能召回A的对应“诺基亚”的广告进行展现;Then, for the keyword "Nokia", for A, although its account balance is greater than the threshold of 300 set by A in the display conditions, the current display time does not meet the requirements of A, so the keyword "Nokia" cannot recall the corresponding "Nokia" of A. ” ads are displayed;
对于B来说其展示条件中虽然当前展示时间点符合B的要求,但其账户余额小于于B设置的阈值400,所以关键词“诺基亚”不能召回B的对应“诺基亚”的广告进行展现;For B, although the current display time meets B's requirements in its display conditions, its account balance is less than the threshold set by B of 400, so the keyword "Nokia" cannot recall B's corresponding "Nokia" advertisement for display;
对于C来说,其展示条件中当前展示时间点不符合C的要求,其账户余额小于C设置的阈值300,那么关键词“诺基亚”不能召回B的对应“诺基亚”的广告进行展现。For C, the current display time point in its display conditions does not meet the requirements of C, and its account balance is less than the threshold value of 300 set by C, then the keyword "Nokia" cannot recall B's corresponding "Nokia" advertisement for display.
另外优选的,在利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词后,还包括:In addition, preferably, after using the effective word set to filter out the words in the candidate word set that are not in the effective word set, it also includes:
步骤131,针对所述过滤后的候选词集合中的候选词,计算所述候选词与所述查询词的相关度;Step 131, for the candidate words in the filtered candidate word set, calculate the correlation between the candidate words and the query word;
对于过滤后的候选词,可计算所述候选词与所述查询词的相关度,比如字面匹配文本相似度+主题相似度:For the filtered candidate words, the correlation between the candidate words and the query word can be calculated, such as literal matching text similarity + topic similarity:
字面文本相似度比如:“诺基亚手机”和“三星手机”,首先对两个词进行分词,打标签,“诺基亚/品牌”,“手机/产品”,“三星/品牌”,“手机/产品”,对于不同的标签其重要程度是不同的,比如产品标签大于品牌标签,大于修饰标签,然后根据两个词匹配的分词以及其标签,计算相似分数。Literal text similarity For example: "Nokia mobile phone" and "Samsung mobile phone", first segment the two words and label them, "Nokia/brand", "mobile phone/product", "Samsung/brand", "mobile phone/product" , the importance of different tags is different, for example, the product tag is greater than the brand tag, greater than the modifier tag, and then calculate the similarity score based on the word segmentation and its tag matched by the two words.
主题相似度比如:基于网站本身的主题体系(比如数码产品,五金工具,体育,娱乐),通过计算用户搜索查询词后点击的信息,可以计算出每个查询词的主题倾向,每个查询词会有一个或者多个主题,根据匹配的主题数,计算二者的相似度。For example, based on the theme system of the website itself (such as digital products, hardware tools, sports, entertainment), by calculating the information clicked by the user after searching the query word, the topic tendency of each query word can be calculated, and each query word There will be one or more topics, and the similarity between the two is calculated according to the number of matching topics.
步骤132,针对所述过滤后的候选词集合中的候选词,统计所述候选词的有效广告客户深度;所述有效广告客户深度为选择所述候选词并且在当前时间符合所述展示条件的广告客户的数量;Step 132, for the candidate words in the filtered candidate word set, count the effective advertiser depth of the candidate words; the effective advertiser depth is the selection of the candidate word and meet the display conditions at the current time the number of advertisers;
对于过滤后的候选词,因其是利用有效词集合筛选过的,那么每个候选词都会存在一个有效广告客户深度,即该词在当前时间能够召回进行展现的广告个数。比如,对于“笔记本电脑”,当前时间为12:00,购买该词的客户有M1,M2,M3,M4,M5;M1,M2,M3,M4,设置的该词对应广告的展示时间段包括12:00,并且M1,M2,M3,M4各自的账户余额大于其设置的最低阈值,而M5设置的该词对应广告的展示时间段不包括12:00,那么词“笔记本电脑”的有效广告客户深度为4。For the filtered candidate words, because they are filtered by the effective word set, each candidate word will have an effective advertiser depth, that is, the number of advertisements that can be recalled and displayed by the word at the current time. For example, for "laptop computer", the current time is 12:00, the customers who purchased this word include M1, M2, M3, M4, M5; 12:00, and the account balances of M1, M2, M3, and M4 are greater than the minimum threshold set by them, and the display time period of the advertisement corresponding to the word set by M5 does not include 12:00, then the effective advertisement of the word "laptop" The customer depth is 4.
步骤133,结合所述候选词的相关度的权重和有效广告客户深度的权重对所述过滤后的候选词集合中的各候选词进行重排序。Step 133, combining the weight of the relevance of the candidate words and the weight of the effective advertiser depth to reorder the candidate words in the filtered candidate word set.
在实际中,排序时会对候选词与查询词的相关度设置权重,也会对该候选词的有效广告客户深度设置权重,那么结合所述候选词的相关度的权重和有效广告客户深度的权重可计算候选词的总权值,从而可基于各候选词的总权值进行排序。比如,相关性权重为a,有效广告客户深度权重为b,对于候选词P,其与查询词的相关性为m,其有效广告客户深度为n,那么词P的总权值W=a*m+b*n。In practice, when sorting, weights will be set for the relevance of the candidate word and the query word, and the weight of the effective advertiser depth of the candidate word will also be set, so the weight of the relevance degree of the candidate word and the effective advertiser depth of the candidate word Weight calculates the total weight of the candidate words, so that the ranking can be based on the total weight of each candidate word. For example, the relevance weight is a, the weight of the effective advertiser depth is b, for the candidate word P, its correlation with the query word is m, and its effective advertiser depth is n, then the total weight of the word P is W=a* m+b*n.
另外,所述的数据库表还包括广告客户在线状态表。用于指示用户在线状态。优选的,在利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词后,还包括:In addition, the database table also includes an advertiser online status table. Used to indicate user online status. Preferably, after using the effective word set to filter out the words in the candidate word set that are not in the effective word set, it also includes:
步骤S31,针对所述过滤后的候选词集合中的候选词,计算所述候选词与所述查询词的相关度;Step S31, calculating the correlation between the candidate words and the query word for the candidate words in the filtered candidate word set;
步骤S32,针对所述过滤后的候选词集合中的候选词,统计所述候选词的有效广告客户深度;所述有效广告客户深度为选择所述候选词并且在当前时间符合所述展示条件的广告客户的数量;Step S32, for the candidate words in the filtered candidate word set, calculate the effective advertiser depth of the candidate words; the effective advertiser depth is the selected candidate word and meets the display conditions at the current time the number of advertisers;
步骤S33,统计选择所述候选词并且在当前时间符合所述展示条件的在线广告客户的数量;Step S33, counting the number of online advertisers who select the candidate word and meet the display conditions at the current time;
步骤S34,结合所述候选词的相关度的权重、有效广告客户深度的权重和在线广告客户数量对所述过滤后的候选词集合中的各候选词进行重排序。Step S34 , reordering the candidate words in the filtered candidate word set in combination with the weight of the relevance of the candidate words, the weight of the effective advertiser depth and the number of online advertisers.
本申请还可为候选词的在线状广告客户数量设置一个权重,将其加入总权值计算。This application can also set a weight for the number of online advertisers of the candidate word, and add it to the calculation of the total weight.
步骤140,选择过滤后的候选词集合中至少一个候选词检索广告信息。Step 140, selecting at least one candidate word in the filtered candidate word set to retrieve advertisement information.
优选的将前述排序后的候选词中,选取排序靠前的至少一个候选词在搜索引擎中检索广告信息。Preferably, among the aforementioned sorted candidate words, at least one candidate word ranked higher is selected to retrieve advertisement information in a search engine.
参照图2,其示出了本申请一种广告信息搜索系统的结构示意图,包括:Referring to Fig. 2, it shows a schematic structural diagram of an advertisement information search system of the present application, including:
查询词扩展模块210,用于将用户端输入的查询词进行扩展,获得扩展后的候选词集合;The query word expansion module 210 is used to expand the query word input by the client to obtain an expanded set of candidate words;
有效词集合获取模块220,用于分析搜索引擎实时日志和/或当前时间的广告客户状态信息获得有效词集合;所述广告客户状态信息包括广告客户选择的关键词展示条件信息;所述有效词集合包括当前时间符合展示条件的关键词;Effective word set acquisition module 220, used to analyze search engine real-time log and/or the advertiser state information of current time to obtain effective word set; Described advertiser state information comprises the keyword display condition information that advertiser selects; Described effective word The collection includes keywords that meet the display conditions at the current time;
过滤模块230,用于利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词;A filtering module 230, configured to use the effective word set to filter out words in the candidate word set that are not in the effective word set;
检索模块240,用于选择过滤后的候选词集合中至少一个候选词检索广告信息。The retrieval module 240 is configured to select at least one candidate word in the filtered candidate word set to retrieve advertisement information.
优选的,所述有效词集合获取模块包括:Preferably, the effective word set acquisition module includes:
用户状态信息获取子模块,用于从数据存储服务器获取各广告客户对应的数据库表,所述数据库表包括广告客户的广告费用预算和广告客户的广告推广时间。The user status information acquisition sub-module is used to acquire the database table corresponding to each advertiser from the data storage server, and the database table includes the advertiser's advertising expense budget and the advertiser's advertising promotion time.
所述数据库表包括广告客户的广告费用预算表和广告客户的广告推广时间表,分别对应广告客户的广告费用预算和广告推广时间。The database table includes an advertiser's advertising expense budget table and an advertiser's advertising promotion schedule, which correspond to the advertiser's advertising expense budget and advertisement promotion time respectively.
所述有效词集合获取模块还包括:The effective word collection acquisition module also includes:
用户状态分析子模块,用于分析所述广告客户的广告费用预算表和广告客户的广告推广时间表,对于广告费用预算大于阈值且广告推广时间段包含所述当前时间的广告客户,将所述广告选择的关键词加入有效词集合。The user status analysis sub-module is used to analyze the advertiser's advertising expense budget table and the advertiser's advertising promotion schedule, and for the advertiser whose advertising expense budget is greater than a threshold and whose advertising promotion time period includes the current time, the The keywords selected by the advertisement are added to the effective word set.
所述有效词集合获取模块包括:The effective word collection acquisition module includes:
实时日志获取子模块,用于从搜索引擎获取实时搜索日志;The real-time log acquisition sub-module is used to obtain real-time search logs from search engines;
实时日志分析子模块,用于分析所述实时搜索日志中各检索关键词在当前时间段内是否返回广告给用户端;如果返回,则将对应检索关键词加入有效词集合。The real-time log analysis sub-module is used to analyze whether each search keyword in the real-time search log returns an advertisement to the client within the current time period; if returned, the corresponding search keyword is added to the effective word set.
参照图3,其示出了本申请优选的一种广告信息搜索系统的结构示意图,包括:Referring to Fig. 3, it shows a schematic structural diagram of a preferred advertisement information search system of the present application, including:
用户查询词处理服务器310,搜索引擎服务器320,数据存储服务器330;User query word processing server 310, search engine server 320, data storage server 330;
所述用户查询词处理服务器310,用于接收搜索引擎服务器320前端获得的用户的原始查询词,并获取所述对所述数据存储服务器330中存储的广告客户状态信息;和/或搜索引擎服务器320的实时搜索日志对查询词进行处理,并将处理得到的词返回搜索引擎服务器320。The user query word processing server 310 is used to receive the user's original query word obtained by the front end of the search engine server 320, and obtain the advertiser status information stored in the data storage server 330; and/or the search engine server The real-time search log at 320 processes the query words, and returns the processed words to the search engine server 320 .
所述的用户查询词处理服务器310包括:查询词扩展模块,用于将用户端输入的查询词进行扩展,获得扩展后的候选词集合;The user query word processing server 310 includes: a query word expansion module, which is used to expand the query word input by the user terminal to obtain an expanded set of candidate words;
有效词集合获取模块,用于当前时间的广告客户状态信息获得有效词集合;所述广告客户状态信息包括广告客户选择的关键词展示条件信息;所述有效词集合包括当前时间符合展示条件的关键词;The valid word set acquisition module is used to obtain the effective word set for the advertiser status information of the current time; the advertiser status information includes the keyword display condition information selected by the advertiser; the effective word set includes the keywords that meet the display conditions at the current time word;
过滤模块,用于利用所述有效词集合,过滤掉所述候选词集合中不在有效词集合中的词。A filtering module, configured to use the effective word set to filter out words in the candidate word set that are not in the effective word set.
所述搜索引擎服务器320,用于将用户输入的原始查询词输入用户查询词处理服务器310,依据用户查询词处理服务器310输送过来的检索词进行最终的广告检索。其中,所述搜索引擎服务器包括检索模块,用于选择过滤后的候选词集合中至少一个候选词检索广告信息。The search engine server 320 is configured to input the original query words input by the user into the user query word processing server 310 , and perform final advertisement retrieval based on the search words sent by the user query word processing server 310 . Wherein, the search engine server includes a retrieval module for selecting at least one candidate word in the filtered candidate word set to retrieve advertisement information.
所述数据存储服务器330用于存储的广告客户状态信息。The data storage server 330 is used for storing advertiser status information.
对于系统实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。As for the system embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and for the related parts, please refer to the part of the description of the method embodiment.
本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。Each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same and similar parts of each embodiment can be referred to each other.
本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowcharts and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram. These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.
尽管已描述了本申请的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例做出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请范围的所有变更和修改。While preferred embodiments of the present application have been described, additional changes and modifications to these embodiments can be made by those skilled in the art once the basic inventive concept is appreciated. Therefore, the appended claims are intended to be construed to cover the preferred embodiment and all changes and modifications which fall within the scope of the application.
以上对本申请所提供的一种广告信息搜索方法和系统,进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The above is a detailed introduction to the advertising information search method and system provided by this application. In this paper, specific examples are used to illustrate the principle and implementation of this application. The description of the above embodiment is only used to help understand this application. method and its core idea; at the same time, for those of ordinary skill in the art, according to the idea of this application, there will be changes in the specific implementation and scope of application. In summary, the content of this specification should not be understood as Limitations on this Application.
Claims (14)
- A kind of 1. advertising message searching method, it is characterised in that including:The query word that user terminal inputs is extended, the candidate word set after being extended;The advertiser's status information for analyzing search engine real-time logs and/or current time obtains effective set of words;It is described wide Accusing client state information includes the keyword displaying conditional information of advertiser's selection;The effectively set of words includes current time Meet the keyword of displaying condition;Using effective set of words, the not word in effective set of words is filtered out in the candidate word set;Wherein, it is described not Word in effective set of words refers to that the word that the advertisement of advertiser is showed can not be recalled;At least one candidate's word and search advertising message in candidate word set after selection filtering.
- 2. according to the method for claim 1, it is characterised in that obtain user state information by following steps:Database table corresponding to each advertiser is obtained from data storage server, the database table includes the wide of advertiser Accuse expense budget and the advertisement promotion time of advertiser.
- 3. according to the method for claim 2, it is characterised in that:The database table includes the advertising expenditure budget table of advertiser and the advertisement promotion timetable of advertiser, right respectively Answer advertising expenditure budget and the advertisement promotion time of advertiser.
- 4. according to the method for claim 3, it is characterised in that after obtaining database table corresponding to advertiser, including:The advertising expenditure budget table of the advertiser and the advertisement promotion timetable of advertiser are analyzed, it is pre- for advertising expenditure The advertiser that calculation is more than threshold value and the advertisement promotion period includes the current time, the keyword of the advertisement selection is added Enter effective set of words.
- 5. according to the method for claim 1, it is characterised in that analyze search engine real-time logs by following steps and obtain Effective set of words:Search daily record in real time is obtained from search engine;Analyze whether each search key in the daily record of search in real time returns to advertisement to user terminal in current slot;If returned, corresponding search key is added into effective set of words.
- 6. according to the method for claim 1, it is characterised in that utilizing effective set of words, filtering out the candidate In set of words not after the word in effective set of words, in addition to:For the candidate word in the candidate word set after the filtering, the degree of correlation of the candidate word and the query word is calculated;For the candidate word in the candidate word set after the filtering, effective advertiser's depth of the candidate word is counted;Institute The advertiser's for stating effective advertiser's depth to select the candidate word and meet the displaying condition in current time Quantity;With reference to the weight of the degree of correlation of the candidate word and the weight of effective advertiser's depth to the candidate word after the filtering Each candidate word in set is reordered.
- 7. according to one of them the described method of claim 3,4, it is characterised in that:The database table also includes advertiser's presence table.
- 8. according to the method for claim 1, it is characterised in that utilizing effective set of words, filtering out the candidate In set of words not after the word in effective set of words, in addition to:For the candidate word in the candidate word set after the filtering, the degree of correlation of the candidate word and the query word is calculated;For the candidate word in the candidate word set after the filtering, effective advertiser's depth of the candidate word is counted;Institute The advertiser's for stating effective advertiser's depth to select the candidate word and meet the displaying condition in current time Quantity;Statistics selects the candidate word and the quantity for the online advertisement client for meeting the displaying condition in current time;The weight and online advertisement customer quantity of weight, effective advertiser's depth with reference to the degree of correlation of the candidate word are to institute Each candidate word stated in the candidate word set after filtering is reordered.
- 9. according to the method for claim 1, it is characterised in that the query word that user terminal inputs is extended, expanded Include during candidate word set after exhibition:The keyword that advertiser newly selects is obtained, the keyword is added into extension dictionary;The query word is calculated with extending the correlation of each keyword in dictionary, when the correlation is more than threshold value, by described in Keyword adds candidate word set.
- A kind of 10. advertising message search system, it is characterised in that including:Query word expansion module, the query word for user terminal to be inputted are extended, the candidate word set after being extended;Effective set of words acquisition module, advertiser's state for analyzing search engine real-time logs and/or current time are believed Breath obtains effective set of words;Advertiser's status information includes the keyword displaying conditional information of advertiser's selection;Institute Stating effective set of words includes the keyword that current time meets displaying condition;Filtering module, for utilizing effective set of words, filter out in the candidate word set not in effective set of words Word;Wherein, the word not in effective set of words refers to that the word that the advertisement of advertiser is showed can not be recalled;Module is retrieved, for selecting at least one candidate's word and search advertising message in the candidate word set after filtering.
- 11. system according to claim 10, it is characterised in that the effectively set of words acquisition module includes:User state information acquisition submodule, for obtaining database table corresponding to each advertiser from data storage server, The database table includes the advertising expenditure budget of advertiser and the advertisement promotion time of advertiser.
- 12. system according to claim 11, it is characterised in that:The database table includes the advertising expenditure budget table of advertiser and the advertisement promotion timetable of advertiser, right respectively Answer advertising expenditure budget and the advertisement promotion time of advertiser.
- 13. system according to claim 12, it is characterised in that the effectively set of words acquisition module also includes:User Status analyzes submodule, is pushed away for analyzing the advertisement of advertising expenditure budget table and advertiser of the advertiser Wide timetable, it is more than the advertiser that threshold value and advertisement promotion period include the current time for advertising expenditure budget, The keyword of the advertisement selection is added into effective set of words.
- 14. system according to claim 10, it is characterised in that the effectively set of words acquisition module includes:Real-time logs acquisition submodule, for obtaining search daily record in real time from search engine;Real-time logs analyze submodule, are in current slot for analyzing each search key in the daily record of search in real time No return advertisement is to user terminal;If returned, corresponding search key is added into effective set of words.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201210097984.0A CN103365904B (en) | 2012-04-05 | 2012-04-05 | A kind of advertising message searching method and system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201210097984.0A CN103365904B (en) | 2012-04-05 | 2012-04-05 | A kind of advertising message searching method and system |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103365904A CN103365904A (en) | 2013-10-23 |
| CN103365904B true CN103365904B (en) | 2018-01-09 |
Family
ID=49367268
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201210097984.0A Active CN103365904B (en) | 2012-04-05 | 2012-04-05 | A kind of advertising message searching method and system |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN103365904B (en) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105446984A (en) * | 2014-06-30 | 2016-03-30 | 阿里巴巴集团控股有限公司 | Expansion word pair screening method and device |
| CN104702597A (en) * | 2015-02-09 | 2015-06-10 | 北京博雅立方科技有限公司 | Information obtaining method and device |
| CN104715022B (en) * | 2015-02-28 | 2018-07-31 | 北京奇艺世纪科技有限公司 | A kind of related search method and device |
| CN106557480B (en) * | 2015-09-25 | 2020-07-07 | 阿里巴巴集团控股有限公司 | Implementation method and device for query rewriting |
| CN106708886B (en) * | 2015-11-17 | 2020-08-11 | 北京国双科技有限公司 | Display method and device for in-site search words |
| CN106294688A (en) * | 2016-08-05 | 2017-01-04 | 浪潮软件集团有限公司 | A query expansion method, device and system based on user characteristic analysis |
| CN107038608A (en) * | 2017-04-21 | 2017-08-11 | 北京恒冠网络数据处理有限公司 | A kind of big data analysis system |
| CN110020053A (en) * | 2017-12-20 | 2019-07-16 | 北京奇虎科技有限公司 | Acquisition methods, device and the computer readable storage medium of advertisement main information |
| CN110309355B (en) * | 2018-06-15 | 2023-05-16 | 腾讯科技(深圳)有限公司 | Content tag generation method, device, equipment and storage medium |
| CN109815392A (en) * | 2018-12-17 | 2019-05-28 | 北京三快在线科技有限公司 | Scenario-based merchant recall method, device, electronic device, and readable storage medium |
| CN113763010A (en) * | 2020-11-19 | 2021-12-07 | 北京沃东天骏信息技术有限公司 | Information pushing method and device |
| CN112686717B (en) * | 2021-03-11 | 2021-07-02 | 腾讯科技(深圳)有限公司 | Data processing method and system for advertisement recall |
| CN116976993A (en) * | 2023-07-09 | 2023-10-31 | 郑州易赛诺信息技术有限公司 | An advertising delivery processing method and system based on user matching |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101034997A (en) * | 2006-03-09 | 2007-09-12 | 新数通兴业科技(北京)有限公司 | Method and system for accurately publishing the data information |
| CN101331487A (en) * | 2005-12-15 | 2008-12-24 | 微软公司 | Advertising keyword cross-selling |
| CN101488209A (en) * | 2008-01-18 | 2009-07-22 | 里斯特有限公司 | Advertisement broadcasting method of individual media |
| CN101673385A (en) * | 2009-09-28 | 2010-03-17 | 百度在线网络技术(北京)有限公司 | Consumption preliminary estimate method and device thereof |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101266603B (en) * | 2007-03-12 | 2010-09-08 | 北京搜狗科技发展有限公司 | Webpage information sorting method, system and service system applying the classification |
| CN100507920C (en) * | 2007-05-25 | 2009-07-01 | 清华大学 | Search engine retrieving result reordering method based on user behavior information |
| CN101887437B (en) * | 2009-05-12 | 2016-03-30 | 阿里巴巴集团控股有限公司 | A kind of Search Results generation method and information search system |
-
2012
- 2012-04-05 CN CN201210097984.0A patent/CN103365904B/en active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101331487A (en) * | 2005-12-15 | 2008-12-24 | 微软公司 | Advertising keyword cross-selling |
| CN101034997A (en) * | 2006-03-09 | 2007-09-12 | 新数通兴业科技(北京)有限公司 | Method and system for accurately publishing the data information |
| CN101488209A (en) * | 2008-01-18 | 2009-07-22 | 里斯特有限公司 | Advertisement broadcasting method of individual media |
| CN101673385A (en) * | 2009-09-28 | 2010-03-17 | 百度在线网络技术(北京)有限公司 | Consumption preliminary estimate method and device thereof |
Also Published As
| Publication number | Publication date |
|---|---|
| CN103365904A (en) | 2013-10-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN103365904B (en) | A kind of advertising message searching method and system | |
| US11843651B2 (en) | Personalized recommendation method and system, and terminal device | |
| JP5478889B2 (en) | Advertising keyword cross-selling | |
| US10025807B2 (en) | Dynamic data acquisition method and system | |
| CN102902753B (en) | Method and device for completing search terms and building individual interest models | |
| AU2011240953B2 (en) | Search advertisement selection based on user actions | |
| US8666980B1 (en) | Virtual clickstream recommendations | |
| CN103984705B (en) | A kind of methods of exhibiting of search result, device and system | |
| US20140012840A1 (en) | Generating search results | |
| US20230005022A1 (en) | Structured informational link annotations | |
| CN106062743A (en) | Systems and methods for keyword suggestion | |
| CN101681352A (en) | Identify insufficient search content | |
| CN104036038A (en) | News recommendation method and system | |
| WO2008039768A2 (en) | A system and method for providing medical disposition sensitive content | |
| EP1678573A2 (en) | Automatically targeting web-based advertisements | |
| CN106445963B (en) | Advertisement index keyword automatic generation method and device of APP platform | |
| CN104978368A (en) | Method and device used for providing recommendation information | |
| JP2015521301A (en) | Generate ad campaign | |
| CN104077327B (en) | The recognition methods of core word importance and equipment and search result ordering method and equipment | |
| CN103198098A (en) | Network information transfer method and device | |
| CN106484698A (en) | A kind of method for pushing of search keyword and device | |
| CN113763107A (en) | Object information pushing method, device, equipment and storage medium | |
| CN106844365A (en) | The application message method for pushing and device of a kind of application distribution platform | |
| CN107562756B (en) | Method and equipment for determining object information | |
| US7716209B1 (en) | Automated advertisement publisher identification and selection |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1186796 Country of ref document: HK |
|
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1186796 Country of ref document: HK |