CN106998262A - A kind of System and method for for recognizing Internet user - Google Patents

A kind of System and method for for recognizing Internet user Download PDF

Info

Publication number
CN106998262A
CN106998262A CN201610882549.7A CN201610882549A CN106998262A CN 106998262 A CN106998262 A CN 106998262A CN 201610882549 A CN201610882549 A CN 201610882549A CN 106998262 A CN106998262 A CN 106998262A
Authority
CN
China
Prior art keywords
user
data
traffic data
user characteristics
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610882549.7A
Other languages
Chinese (zh)
Inventor
张大炜
雷葆华
吉晓峰
张洋硕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen World Network Technology Co Ltd
Original Assignee
Shenzhen World Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen World Network Technology Co Ltd filed Critical Shenzhen World Network Technology Co Ltd
Priority to CN201610882549.7A priority Critical patent/CN106998262A/en
Publication of CN106998262A publication Critical patent/CN106998262A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/5061Network service management, e.g. ensuring proper service fulfilment according to agreements characterised by the interaction between service providers and their network customers, e.g. customer relationship management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of method for recognizing Internet user, methods described includes:Obtain the user traffic data gathered in network link equipment;User traffic data to acquisition is handled, and cleans the user traffic data after the data unrelated with user characteristics are handled;The user traffic data after the processing is analyzed, user characteristics therein is extracted;All user characteristicses of unique user are associated, the customer relationship chain of unique user is formed.By gathering the customer flow information on network link, and handled above- mentioned information and analyzed the mark scope expanded for user characteristics, and for being analyzed based on the user characteristics in above-mentioned flow information, the accuracy for user's identification is improved, and has broken limitation in the prior art for user's identification.

Description

A kind of System and method for for recognizing Internet user
Technical field
This patent belongs to internet arena, is related to a kind of System and method for for recognizing Internet user.
Background technology
Currently, the Working Life of internet and people, all trades and professions in society are closely merged.Each user is using mutual More or less " vestige " is all left when in networking, such as user passes through internet in the different time, using different equipment Similar " vestige " can all be left with website by accessing different applications.
If we can be based on these " vestiges ", different features is extracted from every vestige to identify this user, Set up feature recognition storehouse for the user, this will draw a portrait in accurate user, the precision marketing of inter-network station and application, market survey, with And the field such as network air control possesses very high value.
In internet, there is the technological means of mark unique subscriber in the prior art, for example:Used at PC ends Cookie mapping carry out user's mark, i.e., the user surfed the Net by cookie mapping methods to PC ends enters mark; In mobile terminal user is identified using MAC Address, iOS IDFA, Android ID and IMEI etc..
Although the above method is solving the problem of user identifies, skill of the prior art with scope to a certain degree Art scheme still has following shortcoming:
First, method coverage of the prior art is not enough:If dependent on SSP, DSP, Ad Exchange and media net Information when carrying out advertising business of standing exchanges to set up cookie mapping, it is contemplated that the channel of advertisement putting, media covering Rate etc., the program is difficult that the cookie mapping of progress user's universe internet access and inter-network station are got through.On the other hand, Tripartite's statistics company can not also cover all websites and code implant, so as to carry out universe cookie mapping.Secondly, use To identify the ID species deficiency of user:Except cookie and mobile device unique mark (IMEI, IDFA, Android ID, Mac), there are many strong Property IDs can be for mark user, for example, user is in online media sites, social network sites, electric business website Register account number, the cell-phone number of user, E-mail address account, broadband internet-access account number etc., in the future as the increase of terminal form, can also There are more strong Property IDs to be used for identifying user.Many strong schemes of the Property ID based on prior art can not all be recognized completely.The Three, unrealized striding equipment gets through that scheme of the prior art all can only be based respectively on PC ends and mobile terminal is identified to user, And Internet user may use plurality of devices in actual conditions.In addition, the technical scheme of existent technique can not also ensure User characteristics home banking upgrades in time, and for an online media sites, it is to have a life cycle in the cookie of user terminal , if the cookie rules of certain online media sites are changed, the scheme of prior art is necessarily dependent upon again to the user at this Media deliver advertisement, feature recognition storehouse could be updated accordingly, this will influence the development of related service.
The content of the invention
This patent is based on the demand of the prior art and proposed, this patent technical problem to be solved is to carry For a kind of System and method for for recognizing Internet user, in order to be able to improve the accuracy of identification user and expand what is be applicable Scope.
In order to solve the above problems, the technical scheme that this patent is provided includes:
A kind of method for recognizing Internet user, methods described includes:Obtain the customer flow gathered in network link equipment Data;User traffic data to acquisition is handled, the user's stream cleaned after the data unrelated with user characteristics are handled Measure data;The user traffic data after the processing is analyzed, user characteristics therein is extracted;Associate all users of unique user Feature, forms the customer relationship chain of unique user.
Preferably, this method also includes, and above-mentioned customer relationship chain is updated based on user characteristics.
Preferably, methods described includes:The user traffic data is included by setting flow optical splitter or using end The user traffic data that the mode of port mirror image is collected from interchanger.
Preferably, the user characteristics includes:Cookie signature identifications, device identification, user account mark, statistics engine Mark, SSP advertisement engines mark and geographical position mark;The user traffic data after the processing is analyzed, therein use is extracted Family feature includes, and the user traffic data after processing is analyzed, and identifies cookie signature identifications therein, equipment Mark, user account mark, statistics engine mark, SSP advertisement engines mark and geographical position mark, and extracted.
Preferably, all user characteristicses of association unique user include, and each user in the customer flow is special Relevance in the relevance levied and each user characteristics content associates the user characteristics.
A kind of device for recognizing Internet user is additionally provided according to the other side of this patent, described device includes:Number According to acquisition module, the user traffic data gathered in network link equipment is obtained;Data processing module, to the customer flow of acquisition Data are handled, and clean the user traffic data after the data unrelated with user characteristics are handled;User characteristics extracts mould Block, analyzes the user traffic data after the processing, extracts user characteristics therein;User characteristics relating module, is associated single All user characteristicses of user, form the customer relationship chain of unique user.
According to the other side of this patent there is provided the method for another identification Internet user, methods described includes: User traffic data is gathered in network link;User traffic data to collection is handled, including the user collected is flowed Amount data are pre-processed and cleaned, and wrong and invalid data are removed in the pretreatment, and the pretreatment is weeded out and user The unrelated data of feature;So as to the user traffic data after being handled;The user traffic data after the processing is analyzed, is extracted User characteristics therein;All user characteristicses of unique user are associated, the customer relationship chain of unique user is formed.
According to the other side of this patent, a kind of network information control method is additionally provided, methods described includes:Step First, user characteristics is obtained, the user characteristics includes the characteristic information for recognizing Internet user;Step 2: based on the user Feature and the customer relationship chain identification Internet user as any one of claim 1-11;Step 3: being used according to the online User characteristics in the customer relationship chain at family, controls the relevant information of the Internet user.
This patent is handled above- mentioned information and analyzed expansion by gathering the customer flow information on network link For the mark scope of user characteristics, and for being analyzed based on the user characteristics in above-mentioned flow information, improve For the accuracy of user's identification, and limitation in the prior art for user's identification is broken.
Brief description of the drawings
Fig. 1 is a kind of flow chart for the method for recognizing Internet user in this patent embodiment
Fig. 2 is the structural representation of customer relationship chain in this patent embodiment.
Embodiment
Specific implementation of the patent mode is described in detail below in conjunction with the accompanying drawings.It should be noted that the specific reality It is only the citing to this patent optimal technical scheme to apply mode, can not be interpreted as the limitation to this patent protection domain.
Embodiment one
A kind of method for recognizing Internet user is present embodiments provided, the scene that methods described is realized includes but is not limited to one Equipment is planted, for example, server, PC or mobile device;Cooperation either between the said equipment.Each of which is set It can include assisting work to realize this method between multiple individuals, each individual in standby.
As shown in figure 1, methods described comprises the following steps:
Step 1: obtaining the user traffic data gathered in network link equipment
In this step, the data of acquisition are collected in network link, for example, setting flow on switches Optical splitter has collected data on internet by the way of Port Mirroring.It can be obtained by way of directly transmitting Take above-mentioned data;For example the data collected in network link can also be stored and obtained again by indirect mode Obtained data.
It is further preferred that above-mentioned data on flows can be gathered in the present embodiment in multiple switch, then will be above-mentioned Data are collected, and then expand the scope of data acquisition.
Data in the network link are distinct from the data collected in user terminal, and it comes from network link and set It is standby, when same user is surfed the Net using different terminal devices, although information produced by each terminal device has certain area Not, but the data on each terminal device can be delivered in the network link equipment, thus in network link equipment The user traffic data collected can comprehensively reflect internet information of the user under each equipment or each environment, so that As the basis analyzed comprehensively.
In the present embodiment, the data for coming from the network link equipment can be by pretreated, certainly Can be without pretreated initial data.This can not influence the implementation of the present embodiment.
Step 2: the user traffic data to acquisition is handled
The data that network link equipment is collected contain the substantial amounts of information unrelated with identification Internet user, for example, The information related to operator, information related with network environment etc., above- mentioned information it is generally unrelated with user or with The degree of association at family is smaller, thus needs to clean for the above in the information that obtains in network link equipment.Cleaning The data volume of correlation is reduced afterwards, consequently facilitating carrying out the identification of user using effective data.
Therefore, carrying out processing to the user traffic data of acquisition in this step includes data cleansing, and data cleansing can be with Realized by rule-based mode, for example, pre-setting corresponding cleaning rule, the regular data will not met and picked Remove, and retain and meet the regular data.All there is the feature in more obvious content due to the information related to user, and And the information unrelated with user also has the feature in obvious content;Thus those skilled in the art can be according to specific feelings Condition sets the content of respective rule, therefore does not carry out detailed expansion to the cleaning rule in this embodiment.
In addition, in this step, the user traffic data progress processing to acquisition can also be included besides cleaning It is other be easy to analysis operations, for example characterize, or compression etc. processing.
The user traffic data after processing is produced after being handled the user traffic data of acquisition.
Step 3: strategy and rule base based on identification user carry out user characteristics identification
Include substantial amounts of user's characteristic information in user traffic data after processing, for the customer flow number after processing The basis as identification unique user is identified in each user characteristics in.
The identification for the user characteristics is strategy and rule base based on identification user to realize in this step. The strategy and rule base refer to the typelib of the predetermined feature related to user and recognize the plan of this feature type Slightly.For example, these characteristic types include but is not limited to:Cookie signature identifications, device identification, user account mark, statistics are drawn Mark and geographical position mark are held up, etc..These characteristic types all have respective data characteristicses, by for the data characteristicses Analysis so that it is determined that recognize the strategy of these user characteristicses, it is special with user in order to be extracted from the network link data of magnanimity Levy the related user characteristics of identification.
Specifically, for example:
The cookie is that the data for feeding back to user terminal (be usually browser) are generated by server end, and user terminal can be by Cookie data are saved in the text under some catalogue, ask next time just to send the cookie to clothes during same website Business device.The feature of some user is can be identified for that by cookie, thus extracts the cookie numbers that network link equipment is collected It is identified according to and to it, it is meaningful for identification user.
The device identification includes but is not limited to mobile device, generally has different device identifications on different devices Number, for example, there is the coding for uniquely recognizing the mobile phone on a certain mobile phone, thus extraction and identification for device identification It is related to identification user.And device identification often has specific data format, thus it can be incited somebody to action by the analysis of data format The said equipment is identified.
The statistics engine mark, SSP advertisement engines mark refer to a user in statistics engine or SSP advertisement engines Corresponding data, are either applied due to counting solicitous and SSP advertisement engines for a user in the website of a certain scope On carried out the identification and push of information.Because the source that statistics engine is identified and SSP advertisement engines are identified is with significant special Levy, it is thus possible to being identified in the data on flows for collect from network link equipment it, and based on statistics engine data It is identified with SSP advertisement engine data also meaningful for identification user.
The user account mark, refers to identify the account information of the user, and for identifying user.Due to Account of the user in some website either application is often to determine, thus identifies that the account can be identified for that out the use Family.The characteristics of account of user has each specific in different websites and different application, can be according to using under specific environment User account information in the setting Rule Extraction flow of family account.
The geographical position mark, geographical location information produced by referring to user in different websites or application, this A little information can be that geographic coordinate information can also be geographical location information (such as city selection) after selection etc..This A little geographical location information have reference significance for identification user.
In addition, the user characteristics can also include broadband account, cell-phone number, Mac addresses etc., these information can The flow analysis obtained as the characteristic information of identification user from network link is obtained, so as to be used as the feature of identification user.
Carried out due to information that can be all on statistics network, thus by the data on flows obtained from network link Feature recognition is stated, can realize that the user-association in the range of universe recognizes user in the range of universe.And it can also count on User is on online media sites, social network sites, the register account number of electric business website, the cell-phone number of user, E-mail address account and broadband The strong Property ID such as net account, for recognizing that the raising of user's degree of accuracy has significant meaning.In addition, user uses distinct device When (such as using PC and mobile phone), (such as email accounts QQ number) has identical content in some identification features, because And the user surfed the Net using distinct device can be recognized by the data on flows analyzed in network link.
Step 4: the related user characteristics of association sets up customer relationship chain
After the user characteristics for being extracted correlation, you can describe a certain user with the information reflected according to user characteristics, By represent this with various user characteristicses associate, so as to set up customer relationship chain.The customer relationship chain is to institute The accurate portrait of user is stated, so as to complete the identification to the user.
Wherein, associate related user characteristics to refer to, a variety of user characteristicses for representing same user are associated.Association Features described above can be determined by predetermined rule, for example, understood based on the excavation to user characteristics, in certain time period, The user characteristics that data traffic includes in same IP, it is by analyzing the content in each user characteristics that related user is special Levy and associate one user of description.Analysis can also be passed through in the user characteristics included by the data on flows in a certain equipment Related user characteristics is associated one user of description by the content in each user characteristics.Account etc. can additionally be passed through The ID of strong attribute corresponding relation, carrys out multiple user characteristicses in associate traffic data so as to describe a user.
By setting up the association of user characteristics, the mark for user can be both realized.The association of the user characteristics is shown Example property, as shown in Figure 2.After analysis data on flows, following customer relationship chain can be obtained, by taking user Mike as an example, Under Mike network data, by Mike PC, Mike mobile phone association gets up to get through the boundary between equipment, while by Mike Microblog account, QQ accounts, the cookie of website such as Baidu, Sohu etc. associates, and forms user Mike relation chain.
Step 5: updating customer relationship chain based on user characteristics
Due to the change of various factors, user characteristics can produce change in different times, and such as user can change mobile phone, more Change number etc..The change of these user characteristicses needs to be updated for customer relationship chain, in order to improve the standard of user's identification True degree.
In this step, in the renewal of customer relationship chain can be by determining to need to(for) the analysis of user characteristics content Content, the analysis of these contents can determine according to the data characteristicses in specific user characteristics.For example work as user characteristics When middle device identification changes, can be identified by analyzing the strong ID such as telephone number, account related to device identification so that Determine that the change is due to caused by user has changed mobile device, so that it is related to update device identification in customer relationship chain etc. User characteristics.The mode that customer relationship chain is updated in certain the present embodiment is not limited to that, when user characteristics changes Associated user's feature in customer relationship chain can also be replaced, increases or deleted according to other rules.
Embodiment two
A kind of method for recognizing Internet user is provided in the present embodiment, can be by means of in network link in this method Multiple equipment is realized to realize, or in part steps using the multiple equipment in the network link.The lattice chain Equipment in road includes light splitting machine, server etc..
Method in the present embodiment comprises the following steps:
Step 1: gathering user traffic data in network link
The user traffic data is collected in network link, for example, on the interchanger in network link Flow optical splitter is set or data on internet have been collected by the way of Port Mirroring.Preferably, in the present embodiment Above-mentioned data on flows can be gathered in multiple switch, is then collected above-mentioned data, and then expands data acquisition Scope.Data in the network link are distinct from the data collected in user terminal, and it comes from network link equipment, when When same user is surfed the Net using different terminal devices, although information produced by each terminal device has certain difference, but It is that data on each terminal device can be delivered in the network link equipment, thus is collected in network link equipment User traffic data can comprehensively reflect internet information of the user under each equipment or each environment so that as complete The basis of surface analysis.
Step 2: being handled for the user traffic data
In this step, processing is carried out to user traffic data includes pre-processing for data, that is, rejects error number According to, invalid data etc. substantially brings the data of noise, or masks and be substantially related to individual privacy or private data, so that It is easy to follow-up analysis and processing.
In addition, the user traffic data, which is handled, also to be included filtering out from pretreated data with recognizing The related data of network users.Such as pretreated data contain the information related to operator, related with network environment Information etc., above- mentioned information is generally unrelated with user or the degree of association of with user is smaller, thus needs for network link The above in the information obtained in equipment is cleaned.The data volume of correlation is reduced after cleaning, consequently facilitating using having The data of effect carry out the identification of user.
Step 3: strategy and rule base the identification user characteristics based on identification user
Include substantial amounts of user's characteristic information in user traffic data after processing, for the customer flow number after processing The basis as identification unique user is identified in each user characteristics in.
The identification for the user characteristics is strategy and rule base based on identification user to realize in this step. The strategy and rule base refer to the typelib of the predetermined feature related to user and recognize the plan of this feature type Slightly.For example, these characteristic types include but is not limited to:Cookie signature identifications, device identification, user account mark, statistics are drawn Mark, SSP advertisement engines mark and geographical position mark are held up, etc..These characteristic types all have respective data characteristicses, pass through For the data characteristicses analysis so that it is determined that recognize the strategy of these user characteristicses, in order to from the lattice chain way of magnanimity The user characteristics related to user characteristics identification is extracted according to middle.
Specifically, for example:
The cookie is that the data for feeding back to user terminal (be usually browser) are generated by server end, and user terminal can be by Cookie data are saved in the text under some catalogue, ask next time just to send the cookie to clothes during same website Business device.The feature of some user is can be identified for that by cookie, thus extracts the cookie numbers that network link equipment is collected It is identified according to and to it, it is meaningful for identification user.
The device identification includes but is not limited to mobile device, generally has different device identifications on different devices Number, for example, there is the coding for uniquely recognizing the mobile phone on a certain mobile phone, thus extraction and identification for device identification It is related to identification user.And device identification often has specific data format, thus it can be incited somebody to action by the analysis of data format The said equipment is identified.
The statistics engine mark, SSP advertisement engines mark refer to a user in statistics engine or SSP advertisement engines Corresponding data, are either applied due to counting solicitous and SSP advertisement engines for a user in the website of a certain scope On carried out the identification and push of information.Because the source that statistics engine is identified and SSP advertisement engines are identified is with significant special Levy, it is thus possible to being identified in the data on flows for collect from network link equipment it, and based on statistics engine data It is identified with SSP advertisement engine data also meaningful for identification user.
The user account mark, refers to identify the account information of the user, and for identifying user.Due to Account of the user in some website either application is often to determine, thus identifies that the account can be identified for that out the use Family.The characteristics of account of user has each specific in different websites and different application, can be according to using under specific environment User account information in the setting Rule Extraction flow of family account.
The geographical position mark, geographical location information produced by referring to user in different websites or application, this A little information can be that geographic coordinate information can also be geographical location information (such as city selection) after selection etc..This A little geographical location information have reference significance for identification user.
In addition, the user characteristics can also include broadband account, cell-phone number, Mac addresses etc., these information can The flow analysis obtained as the characteristic information of identification user from network link is obtained, so as to be used as the feature of identification user.
Carried out due to information that can be all on statistics network, thus by the data on flows obtained from network link Feature recognition is stated, can realize that the user-association in the range of universe recognizes user in the range of universe.And it can also count on User is on online media sites, social network sites, the register account number of electric business website, the cell-phone number of user, E-mail address account and broadband The strong Property ID such as net account, for recognizing that the raising of user's degree of accuracy has significant meaning.In addition, user uses distinct device When (such as using PC and mobile phone), (such as email accounts QQ number) has identical content in some identification features, because And the user surfed the Net using distinct device can be recognized by the data on flows analyzed in network link.
Step 4: the related user characteristics of association sets up customer relationship chain
After the user characteristics for being extracted correlation, you can describe a certain user with the information reflected according to user characteristics, By represent this with various user characteristicses associate, so as to set up customer relationship chain.The customer relationship chain is to institute The accurate portrait of user is stated, so as to complete the identification to the user.
Wherein, associate related user characteristics to refer to, a variety of user characteristicses for representing same user are associated.Association Features described above can be determined by predetermined rule, for example, understood based on the excavation to user characteristics, in certain time period, The user characteristics that data traffic includes in same IP, it is by analyzing the content in each user characteristics that related user is special Levy and associate one user of description.Analysis can also be passed through in the user characteristics included by the data on flows in a certain equipment Related user characteristics is associated one user of description by the content in each user characteristics.Account etc. can additionally be passed through The ID of strong attribute corresponding relation, carrys out multiple user characteristicses in associate traffic data so as to describe a user.
By setting up the association of user characteristics, the mark for user can be both realized.The association of the user characteristics is shown Example property, as shown in Figure 2.After analysis data on flows, following customer relationship chain can be obtained, by taking user Mike as an example, Under Mike network data, by Mike PC, Mike mobile phone association gets up to get through the boundary between equipment, while by Mike Microblog account, QQ accounts, the cookie of website such as Baidu, Sohu etc. associates, and forms user Mike relation chain.
Step 5: updating customer relationship chain based on user characteristics
Due to the change of various factors, user characteristics can produce change in different times, and such as user can change mobile phone, more Change number etc..The change of these user characteristicses needs to be updated for customer relationship chain, in order to improve the standard of user's identification True degree.
In this step, in the renewal of customer relationship chain can be by determining to need to(for) the analysis of user characteristics content Content, the analysis of these contents can determine according to the data characteristicses in specific user characteristics.For example work as user characteristics When middle device identification changes, can be identified by analyzing the strong ID such as telephone number, account related to device identification so that Determine that the change is due to caused by user has changed mobile device, so that it is related to update device identification in customer relationship chain etc. User characteristics.The mode that customer relationship chain is updated in certain the present embodiment is not limited to that, when user characteristics changes Associated user's feature in customer relationship chain can also be replaced, increases or deleted according to other rules.
Embodiment three
The present embodiment is related to a kind of network information control method, and this method is based on the identification for Internet user so as to this The relevant information of user is controlled.Methods described comprises the following steps:
Step 1: obtaining user characteristics
When user surfs the Net, user characteristics can be obtained by various modes, such as website can be by account either Cookie obtains user characteristics, using can pass through the acquisition of information user characteristics such as account.And may be used also on other network equipments To obtain user characteristics by analyzing user traffic data.
Step 2: based on the user characteristics and customer relationship chain identification Internet user
After user characteristics is got, you can to recognize Internet user, the customer relationship by user's relation chain Chain is the customer relationship chain that the method in embodiment one, embodiment two is set up.Identifying can both obtain after Internet user Part or all of feature of the Internet user in the customer relationship chain.
Step 3: the user characteristics in the customer relationship chain, controls the relevant information of the Internet user
The control includes the related control measure such as information push, Information Statistics, or information safety protection, for example When it is minor to identify the user, the web site contents browsed to the user or the scope browsed web sites are controlled System.In another example, when identifying the shopping preferences of the Internet user, it is controlled for the ad content for being pushed to the user. The relevant information includes all information that can control relevant with the network user, and those skilled in the art are for specific Specific purposes under environment, it may be determined that the content of the mode of the control and related relevant information.
It is only above this patent preferred embodiment, the protection domain of this patent should not limited to this. Every conversion under invention design for this patent application environment, and replacing for wherein particular technique means Generation, increase and omission should be all brought within the protection domain of this patent.

Claims (10)

1. a kind of method for recognizing Internet user, it is characterised in that methods described includes:
Obtain the user traffic data gathered in network link equipment;
User traffic data to acquisition is handled, and cleans the customer flow after the data unrelated with user characteristics are handled Data;
The user traffic data after the processing is analyzed, user characteristics therein is extracted;
All user characteristicses of unique user are associated, the customer relationship chain of unique user is formed.
2. according to the method described in claim 1, it is characterised in that methods described also includes:Update above-mentioned based on user characteristics Customer relationship chain.
3. method according to claim 1 or 2, it is characterised in that methods described includes:The user traffic data includes Setting flow optical splitter or using the user traffic data collected by way of Port Mirroring from interchanger.
4. the method according to any one of claim 1-3, it is characterised in that
The user characteristics includes:Cookie signature identifications, device identification, user account mark, statistics engine mark, SSP are wide Accuse engine identification and geographical position mark;
The user traffic data after the processing is analyzed, extracting user characteristics therein includes, and the user after processing is flowed Amount data analyzed, identify cookie signature identifications therein, device identification, user account mark, statistics engine identify, SSP advertisement engines are identified and geographical position mark, and are extracted.
5. the method according to any one of claim 1-4, it is characterised in that all user characteristicses of association unique user Including the relevance in the relevance of each user characteristics in the customer flow and each user characteristics content is closed Join the user characteristics.
6. a kind of device for recognizing user's online, it is characterised in that described device includes:
Data acquisition module, obtains the user traffic data gathered in network link equipment;
Data processing module, the user traffic data to acquisition is handled, and is cleaned the data unrelated with user characteristics and is obtained everywhere User traffic data after reason;
User characteristics extraction module, analyzes the user traffic data after the processing, extracts user characteristics therein;
User characteristics relating module, associates all user characteristicses of unique user, forms the customer relationship chain of unique user.
7. device according to claim 6, it is characterised in that described device also includes:
Customer relationship chain update module, above-mentioned customer relationship chain is updated based on user characteristics.
8. the method according to claim 6 or 7, it is characterised in that methods described includes:The user traffic data includes Setting flow optical splitter or using the user traffic data collected by way of Port Mirroring from interchanger.
9. the method according to any one of claim 6-8, it is characterised in that
The user characteristics includes:Cookie signature identifications, device identification, user account mark, statistics engine mark, SSP are wide Accuse engine identification and geographical position mark;
The user traffic data after the processing is analyzed, extracting user characteristics therein includes, and the user after processing is flowed Amount data analyzed, identify cookie signature identifications therein, device identification, user account mark, statistics engine identify, SSP advertisement engines are identified and geographical position mark, and are extracted.
10. the method according to any one of claim 6-9, it is characterised in that all users of association unique user are special Levy including the relevance in the relevance of each user characteristics in the customer flow and each user characteristics content Associate the user characteristics.
CN201610882549.7A 2016-10-10 2016-10-10 A kind of System and method for for recognizing Internet user Pending CN106998262A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610882549.7A CN106998262A (en) 2016-10-10 2016-10-10 A kind of System and method for for recognizing Internet user

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610882549.7A CN106998262A (en) 2016-10-10 2016-10-10 A kind of System and method for for recognizing Internet user

Publications (1)

Publication Number Publication Date
CN106998262A true CN106998262A (en) 2017-08-01

Family

ID=59431163

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610882549.7A Pending CN106998262A (en) 2016-10-10 2016-10-10 A kind of System and method for for recognizing Internet user

Country Status (1)

Country Link
CN (1) CN106998262A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107563810A (en) * 2017-08-31 2018-01-09 江苏省公用信息有限公司 A kind of advertisement placement method based on broadband account
CN107578272A (en) * 2017-08-10 2018-01-12 上海斐讯数据通信技术有限公司 A kind of method and device for kinsfolk's portrait
CN107682344A (en) * 2017-10-18 2018-02-09 南京邮数通信息科技有限公司 A kind of ID collection of illustrative plates method for building up based on DPI data interconnection net identifications
CN108038714A (en) * 2017-11-29 2018-05-15 链家网(北京)科技有限公司 Advertisement promotion processing method and processing device
CN109995605A (en) * 2018-01-02 2019-07-09 中国移动通信有限公司研究院 A traffic identification method and device, and a computer-readable storage medium
CN110020166A (en) * 2017-12-21 2019-07-16 腾讯科技(深圳)有限公司 A kind of data analysing method and relevant device
CN110502697A (en) * 2019-08-26 2019-11-26 武汉斗鱼网络科技有限公司 A kind of target user's recognition methods, device and electronic equipment
CN110519263A (en) * 2019-08-26 2019-11-29 北京百度网讯科技有限公司 Anti- brush amount method, apparatus, equipment and computer readable storage medium
CN110782222A (en) * 2019-10-11 2020-02-11 厦门谷道集团有限公司 Method, system and equipment for identifying social media account based on big data intelligent mailbox
CN111277453A (en) * 2020-01-14 2020-06-12 恩亿科(北京)数据科技有限公司 End-to-end communication method and data monitoring system
CN112367406A (en) * 2020-11-19 2021-02-12 全知科技(杭州)有限责任公司 Method for identifying account behavior analysis corresponding account correlation attribute in web application system
CN112446748A (en) * 2021-01-29 2021-03-05 上海钐昆网络科技有限公司 Advertisement putting method, device, equipment and storage medium
CN115277106A (en) * 2022-06-30 2022-11-01 北京安博通科技股份有限公司 User identification method and system of network equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070036146A1 (en) * 2005-08-10 2007-02-15 Bellsouth Intellectual Property Corporation Analyzing and resolving internet service problems
CN103118382A (en) * 2013-01-21 2013-05-22 北京拓明科技有限公司 Analytical method of data traffic neighborhood ping-pong reselection
CN103634164A (en) * 2013-12-04 2014-03-12 中国联合网络通信集团有限公司 Method and system for acquiring traffic information
CN103906111A (en) * 2012-12-27 2014-07-02 中国移动通信集团内蒙古有限公司 Problem determination method and device for general packet radio service network
CN104951544A (en) * 2015-06-19 2015-09-30 百度在线网络技术(北京)有限公司 User data processing method and system and method and system for providing user data
CN105224593A (en) * 2015-08-25 2016-01-06 中国人民解放军信息工程大学 Frequent co-occurrence account method for digging in a kind of of short duration online affairs

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070036146A1 (en) * 2005-08-10 2007-02-15 Bellsouth Intellectual Property Corporation Analyzing and resolving internet service problems
CN103906111A (en) * 2012-12-27 2014-07-02 中国移动通信集团内蒙古有限公司 Problem determination method and device for general packet radio service network
CN103118382A (en) * 2013-01-21 2013-05-22 北京拓明科技有限公司 Analytical method of data traffic neighborhood ping-pong reselection
CN103634164A (en) * 2013-12-04 2014-03-12 中国联合网络通信集团有限公司 Method and system for acquiring traffic information
CN104951544A (en) * 2015-06-19 2015-09-30 百度在线网络技术(北京)有限公司 User data processing method and system and method and system for providing user data
CN105224593A (en) * 2015-08-25 2016-01-06 中国人民解放军信息工程大学 Frequent co-occurrence account method for digging in a kind of of short duration online affairs

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107578272A (en) * 2017-08-10 2018-01-12 上海斐讯数据通信技术有限公司 A kind of method and device for kinsfolk's portrait
CN107563810A (en) * 2017-08-31 2018-01-09 江苏省公用信息有限公司 A kind of advertisement placement method based on broadband account
CN107682344A (en) * 2017-10-18 2018-02-09 南京邮数通信息科技有限公司 A kind of ID collection of illustrative plates method for building up based on DPI data interconnection net identifications
CN108038714A (en) * 2017-11-29 2018-05-15 链家网(北京)科技有限公司 Advertisement promotion processing method and processing device
CN110020166B (en) * 2017-12-21 2023-02-10 腾讯科技(深圳)有限公司 Data analysis method and related equipment
CN110020166A (en) * 2017-12-21 2019-07-16 腾讯科技(深圳)有限公司 A kind of data analysing method and relevant device
CN109995605B (en) * 2018-01-02 2021-04-13 中国移动通信有限公司研究院 A kind of traffic identification method, device and computer readable storage medium
CN109995605A (en) * 2018-01-02 2019-07-09 中国移动通信有限公司研究院 A traffic identification method and device, and a computer-readable storage medium
CN110519263A (en) * 2019-08-26 2019-11-29 北京百度网讯科技有限公司 Anti- brush amount method, apparatus, equipment and computer readable storage medium
CN110502697A (en) * 2019-08-26 2019-11-26 武汉斗鱼网络科技有限公司 A kind of target user's recognition methods, device and electronic equipment
CN110519263B (en) * 2019-08-26 2022-05-17 北京百度网讯科技有限公司 Anti-brush amount method, apparatus, device and computer readable storage medium
CN110502697B (en) * 2019-08-26 2022-06-21 武汉斗鱼网络科技有限公司 Target user identification method and device and electronic equipment
CN110782222A (en) * 2019-10-11 2020-02-11 厦门谷道集团有限公司 Method, system and equipment for identifying social media account based on big data intelligent mailbox
CN111277453A (en) * 2020-01-14 2020-06-12 恩亿科(北京)数据科技有限公司 End-to-end communication method and data monitoring system
CN112367406A (en) * 2020-11-19 2021-02-12 全知科技(杭州)有限责任公司 Method for identifying account behavior analysis corresponding account correlation attribute in web application system
CN112446748A (en) * 2021-01-29 2021-03-05 上海钐昆网络科技有限公司 Advertisement putting method, device, equipment and storage medium
CN115277106A (en) * 2022-06-30 2022-11-01 北京安博通科技股份有限公司 User identification method and system of network equipment
CN115277106B (en) * 2022-06-30 2024-03-19 北京安博通科技股份有限公司 User identification method and system of network equipment

Similar Documents

Publication Publication Date Title
CN106998262A (en) A kind of System and method for for recognizing Internet user
CN106874266A (en) User's portrait method and the device for user's portrait
CN103218431B (en) A kind ofly can identify the system that info web gathers automatically
CN110337059B (en) Analysis algorithm, server and network system for family relationship of user
CN102663105B (en) The method for building up and system of number information database
CN103402177B (en) A kind of WiFi terminal information transmission system and its implementation
CN110321424B (en) A Deep Learning-Based Behavior Analysis Method for AIDS Personnel
CN105007171A (en) User data analysis system and method based on big data in communication field
CN107515915A (en) User based on user behavior data identifies correlating method
CN111104521B (en) An anti-fraud detection method and detection system based on graph analysis
CN103810623A (en) Real-time automatic marketing method and system
CN104298782B (en) Internet user actively accesses the analysis method of action trail
CN109858919A (en) Determination method and device, online ordering method and the device of abnormal account
CN104951544A (en) User data processing method and system and method and system for providing user data
CN104217346A (en) Precision advertising equipment and precision advertising method
CN104636473A (en) Data processing method and system based on electronic payment behaviors
CN107465739A (en) The method and device of entity channel user drainage
CN107682344A (en) A kind of ID collection of illustrative plates method for building up based on DPI data interconnection net identifications
CN110020161A (en) Data processing method, log processing method and terminal
CN103593769A (en) Telecommunication behavior statistical analysis system
CN107666404A (en) Broadband network user identification method and device
CN119066132A (en) A method for analyzing and completing fraud-related subjects in knowledge graphs based on a large language model
CN111177481A (en) User identifier mapping method and device
CN105373619A (en) User big data based user group analysis method and system
CN107832333B (en) Method and system for constructing user network data fingerprint based on distributed processing and DPI data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170801