CN106998262A - A kind of System and method for for recognizing Internet user - Google Patents
A kind of System and method for for recognizing Internet user Download PDFInfo
- Publication number
- CN106998262A CN106998262A CN201610882549.7A CN201610882549A CN106998262A CN 106998262 A CN106998262 A CN 106998262A CN 201610882549 A CN201610882549 A CN 201610882549A CN 106998262 A CN106998262 A CN 106998262A
- Authority
- CN
- China
- Prior art keywords
- user
- data
- traffic data
- user characteristics
- identification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 238000012545 processing Methods 0.000 claims abstract description 24
- 235000014510 cooky Nutrition 0.000 claims description 29
- 239000000284 extract Substances 0.000 claims description 5
- 238000000605 extraction Methods 0.000 claims description 5
- 230000003287 optical effect Effects 0.000 claims description 5
- 238000004458 analytical method Methods 0.000 description 14
- 238000004140 cleaning Methods 0.000 description 5
- 238000013507 mapping Methods 0.000 description 5
- 238000009412 basement excavation Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000005206 flow analysis Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000005211 surface analysis Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/50—Network service management, e.g. ensuring proper service fulfilment according to agreements
- H04L41/5061—Network service management, e.g. ensuring proper service fulfilment according to agreements characterised by the interaction between service providers and their network customers, e.g. customer relationship management
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The invention discloses a kind of method for recognizing Internet user, methods described includes:Obtain the user traffic data gathered in network link equipment;User traffic data to acquisition is handled, and cleans the user traffic data after the data unrelated with user characteristics are handled;The user traffic data after the processing is analyzed, user characteristics therein is extracted;All user characteristicses of unique user are associated, the customer relationship chain of unique user is formed.By gathering the customer flow information on network link, and handled above- mentioned information and analyzed the mark scope expanded for user characteristics, and for being analyzed based on the user characteristics in above-mentioned flow information, the accuracy for user's identification is improved, and has broken limitation in the prior art for user's identification.
Description
Technical field
This patent belongs to internet arena, is related to a kind of System and method for for recognizing Internet user.
Background technology
Currently, the Working Life of internet and people, all trades and professions in society are closely merged.Each user is using mutual
More or less " vestige " is all left when in networking, such as user passes through internet in the different time, using different equipment
Similar " vestige " can all be left with website by accessing different applications.
If we can be based on these " vestiges ", different features is extracted from every vestige to identify this user,
Set up feature recognition storehouse for the user, this will draw a portrait in accurate user, the precision marketing of inter-network station and application, market survey, with
And the field such as network air control possesses very high value.
In internet, there is the technological means of mark unique subscriber in the prior art, for example:Used at PC ends
Cookie mapping carry out user's mark, i.e., the user surfed the Net by cookie mapping methods to PC ends enters mark;
In mobile terminal user is identified using MAC Address, iOS IDFA, Android ID and IMEI etc..
Although the above method is solving the problem of user identifies, skill of the prior art with scope to a certain degree
Art scheme still has following shortcoming:
First, method coverage of the prior art is not enough:If dependent on SSP, DSP, Ad Exchange and media net
Information when carrying out advertising business of standing exchanges to set up cookie mapping, it is contemplated that the channel of advertisement putting, media covering
Rate etc., the program is difficult that the cookie mapping of progress user's universe internet access and inter-network station are got through.On the other hand,
Tripartite's statistics company can not also cover all websites and code implant, so as to carry out universe cookie mapping.Secondly, use
To identify the ID species deficiency of user:Except cookie and mobile device unique mark (IMEI, IDFA, Android ID,
Mac), there are many strong Property IDs can be for mark user, for example, user is in online media sites, social network sites, electric business website
Register account number, the cell-phone number of user, E-mail address account, broadband internet-access account number etc., in the future as the increase of terminal form, can also
There are more strong Property IDs to be used for identifying user.Many strong schemes of the Property ID based on prior art can not all be recognized completely.The
Three, unrealized striding equipment gets through that scheme of the prior art all can only be based respectively on PC ends and mobile terminal is identified to user,
And Internet user may use plurality of devices in actual conditions.In addition, the technical scheme of existent technique can not also ensure
User characteristics home banking upgrades in time, and for an online media sites, it is to have a life cycle in the cookie of user terminal
, if the cookie rules of certain online media sites are changed, the scheme of prior art is necessarily dependent upon again to the user at this
Media deliver advertisement, feature recognition storehouse could be updated accordingly, this will influence the development of related service.
The content of the invention
This patent is based on the demand of the prior art and proposed, this patent technical problem to be solved is to carry
For a kind of System and method for for recognizing Internet user, in order to be able to improve the accuracy of identification user and expand what is be applicable
Scope.
In order to solve the above problems, the technical scheme that this patent is provided includes:
A kind of method for recognizing Internet user, methods described includes:Obtain the customer flow gathered in network link equipment
Data;User traffic data to acquisition is handled, the user's stream cleaned after the data unrelated with user characteristics are handled
Measure data;The user traffic data after the processing is analyzed, user characteristics therein is extracted;Associate all users of unique user
Feature, forms the customer relationship chain of unique user.
Preferably, this method also includes, and above-mentioned customer relationship chain is updated based on user characteristics.
Preferably, methods described includes:The user traffic data is included by setting flow optical splitter or using end
The user traffic data that the mode of port mirror image is collected from interchanger.
Preferably, the user characteristics includes:Cookie signature identifications, device identification, user account mark, statistics engine
Mark, SSP advertisement engines mark and geographical position mark;The user traffic data after the processing is analyzed, therein use is extracted
Family feature includes, and the user traffic data after processing is analyzed, and identifies cookie signature identifications therein, equipment
Mark, user account mark, statistics engine mark, SSP advertisement engines mark and geographical position mark, and extracted.
Preferably, all user characteristicses of association unique user include, and each user in the customer flow is special
Relevance in the relevance levied and each user characteristics content associates the user characteristics.
A kind of device for recognizing Internet user is additionally provided according to the other side of this patent, described device includes:Number
According to acquisition module, the user traffic data gathered in network link equipment is obtained;Data processing module, to the customer flow of acquisition
Data are handled, and clean the user traffic data after the data unrelated with user characteristics are handled;User characteristics extracts mould
Block, analyzes the user traffic data after the processing, extracts user characteristics therein;User characteristics relating module, is associated single
All user characteristicses of user, form the customer relationship chain of unique user.
According to the other side of this patent there is provided the method for another identification Internet user, methods described includes:
User traffic data is gathered in network link;User traffic data to collection is handled, including the user collected is flowed
Amount data are pre-processed and cleaned, and wrong and invalid data are removed in the pretreatment, and the pretreatment is weeded out and user
The unrelated data of feature;So as to the user traffic data after being handled;The user traffic data after the processing is analyzed, is extracted
User characteristics therein;All user characteristicses of unique user are associated, the customer relationship chain of unique user is formed.
According to the other side of this patent, a kind of network information control method is additionally provided, methods described includes:Step
First, user characteristics is obtained, the user characteristics includes the characteristic information for recognizing Internet user;Step 2: based on the user
Feature and the customer relationship chain identification Internet user as any one of claim 1-11;Step 3: being used according to the online
User characteristics in the customer relationship chain at family, controls the relevant information of the Internet user.
This patent is handled above- mentioned information and analyzed expansion by gathering the customer flow information on network link
For the mark scope of user characteristics, and for being analyzed based on the user characteristics in above-mentioned flow information, improve
For the accuracy of user's identification, and limitation in the prior art for user's identification is broken.
Brief description of the drawings
Fig. 1 is a kind of flow chart for the method for recognizing Internet user in this patent embodiment
Fig. 2 is the structural representation of customer relationship chain in this patent embodiment.
Embodiment
Specific implementation of the patent mode is described in detail below in conjunction with the accompanying drawings.It should be noted that the specific reality
It is only the citing to this patent optimal technical scheme to apply mode, can not be interpreted as the limitation to this patent protection domain.
Embodiment one
A kind of method for recognizing Internet user is present embodiments provided, the scene that methods described is realized includes but is not limited to one
Equipment is planted, for example, server, PC or mobile device;Cooperation either between the said equipment.Each of which is set
It can include assisting work to realize this method between multiple individuals, each individual in standby.
As shown in figure 1, methods described comprises the following steps:
Step 1: obtaining the user traffic data gathered in network link equipment
In this step, the data of acquisition are collected in network link, for example, setting flow on switches
Optical splitter has collected data on internet by the way of Port Mirroring.It can be obtained by way of directly transmitting
Take above-mentioned data;For example the data collected in network link can also be stored and obtained again by indirect mode
Obtained data.
It is further preferred that above-mentioned data on flows can be gathered in the present embodiment in multiple switch, then will be above-mentioned
Data are collected, and then expand the scope of data acquisition.
Data in the network link are distinct from the data collected in user terminal, and it comes from network link and set
It is standby, when same user is surfed the Net using different terminal devices, although information produced by each terminal device has certain area
Not, but the data on each terminal device can be delivered in the network link equipment, thus in network link equipment
The user traffic data collected can comprehensively reflect internet information of the user under each equipment or each environment, so that
As the basis analyzed comprehensively.
In the present embodiment, the data for coming from the network link equipment can be by pretreated, certainly
Can be without pretreated initial data.This can not influence the implementation of the present embodiment.
Step 2: the user traffic data to acquisition is handled
The data that network link equipment is collected contain the substantial amounts of information unrelated with identification Internet user, for example,
The information related to operator, information related with network environment etc., above- mentioned information it is generally unrelated with user or with
The degree of association at family is smaller, thus needs to clean for the above in the information that obtains in network link equipment.Cleaning
The data volume of correlation is reduced afterwards, consequently facilitating carrying out the identification of user using effective data.
Therefore, carrying out processing to the user traffic data of acquisition in this step includes data cleansing, and data cleansing can be with
Realized by rule-based mode, for example, pre-setting corresponding cleaning rule, the regular data will not met and picked
Remove, and retain and meet the regular data.All there is the feature in more obvious content due to the information related to user, and
And the information unrelated with user also has the feature in obvious content;Thus those skilled in the art can be according to specific feelings
Condition sets the content of respective rule, therefore does not carry out detailed expansion to the cleaning rule in this embodiment.
In addition, in this step, the user traffic data progress processing to acquisition can also be included besides cleaning
It is other be easy to analysis operations, for example characterize, or compression etc. processing.
The user traffic data after processing is produced after being handled the user traffic data of acquisition.
Step 3: strategy and rule base based on identification user carry out user characteristics identification
Include substantial amounts of user's characteristic information in user traffic data after processing, for the customer flow number after processing
The basis as identification unique user is identified in each user characteristics in.
The identification for the user characteristics is strategy and rule base based on identification user to realize in this step.
The strategy and rule base refer to the typelib of the predetermined feature related to user and recognize the plan of this feature type
Slightly.For example, these characteristic types include but is not limited to:Cookie signature identifications, device identification, user account mark, statistics are drawn
Mark and geographical position mark are held up, etc..These characteristic types all have respective data characteristicses, by for the data characteristicses
Analysis so that it is determined that recognize the strategy of these user characteristicses, it is special with user in order to be extracted from the network link data of magnanimity
Levy the related user characteristics of identification.
Specifically, for example:
The cookie is that the data for feeding back to user terminal (be usually browser) are generated by server end, and user terminal can be by
Cookie data are saved in the text under some catalogue, ask next time just to send the cookie to clothes during same website
Business device.The feature of some user is can be identified for that by cookie, thus extracts the cookie numbers that network link equipment is collected
It is identified according to and to it, it is meaningful for identification user.
The device identification includes but is not limited to mobile device, generally has different device identifications on different devices
Number, for example, there is the coding for uniquely recognizing the mobile phone on a certain mobile phone, thus extraction and identification for device identification
It is related to identification user.And device identification often has specific data format, thus it can be incited somebody to action by the analysis of data format
The said equipment is identified.
The statistics engine mark, SSP advertisement engines mark refer to a user in statistics engine or SSP advertisement engines
Corresponding data, are either applied due to counting solicitous and SSP advertisement engines for a user in the website of a certain scope
On carried out the identification and push of information.Because the source that statistics engine is identified and SSP advertisement engines are identified is with significant special
Levy, it is thus possible to being identified in the data on flows for collect from network link equipment it, and based on statistics engine data
It is identified with SSP advertisement engine data also meaningful for identification user.
The user account mark, refers to identify the account information of the user, and for identifying user.Due to
Account of the user in some website either application is often to determine, thus identifies that the account can be identified for that out the use
Family.The characteristics of account of user has each specific in different websites and different application, can be according to using under specific environment
User account information in the setting Rule Extraction flow of family account.
The geographical position mark, geographical location information produced by referring to user in different websites or application, this
A little information can be that geographic coordinate information can also be geographical location information (such as city selection) after selection etc..This
A little geographical location information have reference significance for identification user.
In addition, the user characteristics can also include broadband account, cell-phone number, Mac addresses etc., these information can
The flow analysis obtained as the characteristic information of identification user from network link is obtained, so as to be used as the feature of identification user.
Carried out due to information that can be all on statistics network, thus by the data on flows obtained from network link
Feature recognition is stated, can realize that the user-association in the range of universe recognizes user in the range of universe.And it can also count on
User is on online media sites, social network sites, the register account number of electric business website, the cell-phone number of user, E-mail address account and broadband
The strong Property ID such as net account, for recognizing that the raising of user's degree of accuracy has significant meaning.In addition, user uses distinct device
When (such as using PC and mobile phone), (such as email accounts QQ number) has identical content in some identification features, because
And the user surfed the Net using distinct device can be recognized by the data on flows analyzed in network link.
Step 4: the related user characteristics of association sets up customer relationship chain
After the user characteristics for being extracted correlation, you can describe a certain user with the information reflected according to user characteristics,
By represent this with various user characteristicses associate, so as to set up customer relationship chain.The customer relationship chain is to institute
The accurate portrait of user is stated, so as to complete the identification to the user.
Wherein, associate related user characteristics to refer to, a variety of user characteristicses for representing same user are associated.Association
Features described above can be determined by predetermined rule, for example, understood based on the excavation to user characteristics, in certain time period,
The user characteristics that data traffic includes in same IP, it is by analyzing the content in each user characteristics that related user is special
Levy and associate one user of description.Analysis can also be passed through in the user characteristics included by the data on flows in a certain equipment
Related user characteristics is associated one user of description by the content in each user characteristics.Account etc. can additionally be passed through
The ID of strong attribute corresponding relation, carrys out multiple user characteristicses in associate traffic data so as to describe a user.
By setting up the association of user characteristics, the mark for user can be both realized.The association of the user characteristics is shown
Example property, as shown in Figure 2.After analysis data on flows, following customer relationship chain can be obtained, by taking user Mike as an example,
Under Mike network data, by Mike PC, Mike mobile phone association gets up to get through the boundary between equipment, while by Mike
Microblog account, QQ accounts, the cookie of website such as Baidu, Sohu etc. associates, and forms user Mike relation chain.
Step 5: updating customer relationship chain based on user characteristics
Due to the change of various factors, user characteristics can produce change in different times, and such as user can change mobile phone, more
Change number etc..The change of these user characteristicses needs to be updated for customer relationship chain, in order to improve the standard of user's identification
True degree.
In this step, in the renewal of customer relationship chain can be by determining to need to(for) the analysis of user characteristics content
Content, the analysis of these contents can determine according to the data characteristicses in specific user characteristics.For example work as user characteristics
When middle device identification changes, can be identified by analyzing the strong ID such as telephone number, account related to device identification so that
Determine that the change is due to caused by user has changed mobile device, so that it is related to update device identification in customer relationship chain etc.
User characteristics.The mode that customer relationship chain is updated in certain the present embodiment is not limited to that, when user characteristics changes
Associated user's feature in customer relationship chain can also be replaced, increases or deleted according to other rules.
Embodiment two
A kind of method for recognizing Internet user is provided in the present embodiment, can be by means of in network link in this method
Multiple equipment is realized to realize, or in part steps using the multiple equipment in the network link.The lattice chain
Equipment in road includes light splitting machine, server etc..
Method in the present embodiment comprises the following steps:
Step 1: gathering user traffic data in network link
The user traffic data is collected in network link, for example, on the interchanger in network link
Flow optical splitter is set or data on internet have been collected by the way of Port Mirroring.Preferably, in the present embodiment
Above-mentioned data on flows can be gathered in multiple switch, is then collected above-mentioned data, and then expands data acquisition
Scope.Data in the network link are distinct from the data collected in user terminal, and it comes from network link equipment, when
When same user is surfed the Net using different terminal devices, although information produced by each terminal device has certain difference, but
It is that data on each terminal device can be delivered in the network link equipment, thus is collected in network link equipment
User traffic data can comprehensively reflect internet information of the user under each equipment or each environment so that as complete
The basis of surface analysis.
Step 2: being handled for the user traffic data
In this step, processing is carried out to user traffic data includes pre-processing for data, that is, rejects error number
According to, invalid data etc. substantially brings the data of noise, or masks and be substantially related to individual privacy or private data, so that
It is easy to follow-up analysis and processing.
In addition, the user traffic data, which is handled, also to be included filtering out from pretreated data with recognizing
The related data of network users.Such as pretreated data contain the information related to operator, related with network environment
Information etc., above- mentioned information is generally unrelated with user or the degree of association of with user is smaller, thus needs for network link
The above in the information obtained in equipment is cleaned.The data volume of correlation is reduced after cleaning, consequently facilitating using having
The data of effect carry out the identification of user.
Step 3: strategy and rule base the identification user characteristics based on identification user
Include substantial amounts of user's characteristic information in user traffic data after processing, for the customer flow number after processing
The basis as identification unique user is identified in each user characteristics in.
The identification for the user characteristics is strategy and rule base based on identification user to realize in this step.
The strategy and rule base refer to the typelib of the predetermined feature related to user and recognize the plan of this feature type
Slightly.For example, these characteristic types include but is not limited to:Cookie signature identifications, device identification, user account mark, statistics are drawn
Mark, SSP advertisement engines mark and geographical position mark are held up, etc..These characteristic types all have respective data characteristicses, pass through
For the data characteristicses analysis so that it is determined that recognize the strategy of these user characteristicses, in order to from the lattice chain way of magnanimity
The user characteristics related to user characteristics identification is extracted according to middle.
Specifically, for example:
The cookie is that the data for feeding back to user terminal (be usually browser) are generated by server end, and user terminal can be by
Cookie data are saved in the text under some catalogue, ask next time just to send the cookie to clothes during same website
Business device.The feature of some user is can be identified for that by cookie, thus extracts the cookie numbers that network link equipment is collected
It is identified according to and to it, it is meaningful for identification user.
The device identification includes but is not limited to mobile device, generally has different device identifications on different devices
Number, for example, there is the coding for uniquely recognizing the mobile phone on a certain mobile phone, thus extraction and identification for device identification
It is related to identification user.And device identification often has specific data format, thus it can be incited somebody to action by the analysis of data format
The said equipment is identified.
The statistics engine mark, SSP advertisement engines mark refer to a user in statistics engine or SSP advertisement engines
Corresponding data, are either applied due to counting solicitous and SSP advertisement engines for a user in the website of a certain scope
On carried out the identification and push of information.Because the source that statistics engine is identified and SSP advertisement engines are identified is with significant special
Levy, it is thus possible to being identified in the data on flows for collect from network link equipment it, and based on statistics engine data
It is identified with SSP advertisement engine data also meaningful for identification user.
The user account mark, refers to identify the account information of the user, and for identifying user.Due to
Account of the user in some website either application is often to determine, thus identifies that the account can be identified for that out the use
Family.The characteristics of account of user has each specific in different websites and different application, can be according to using under specific environment
User account information in the setting Rule Extraction flow of family account.
The geographical position mark, geographical location information produced by referring to user in different websites or application, this
A little information can be that geographic coordinate information can also be geographical location information (such as city selection) after selection etc..This
A little geographical location information have reference significance for identification user.
In addition, the user characteristics can also include broadband account, cell-phone number, Mac addresses etc., these information can
The flow analysis obtained as the characteristic information of identification user from network link is obtained, so as to be used as the feature of identification user.
Carried out due to information that can be all on statistics network, thus by the data on flows obtained from network link
Feature recognition is stated, can realize that the user-association in the range of universe recognizes user in the range of universe.And it can also count on
User is on online media sites, social network sites, the register account number of electric business website, the cell-phone number of user, E-mail address account and broadband
The strong Property ID such as net account, for recognizing that the raising of user's degree of accuracy has significant meaning.In addition, user uses distinct device
When (such as using PC and mobile phone), (such as email accounts QQ number) has identical content in some identification features, because
And the user surfed the Net using distinct device can be recognized by the data on flows analyzed in network link.
Step 4: the related user characteristics of association sets up customer relationship chain
After the user characteristics for being extracted correlation, you can describe a certain user with the information reflected according to user characteristics,
By represent this with various user characteristicses associate, so as to set up customer relationship chain.The customer relationship chain is to institute
The accurate portrait of user is stated, so as to complete the identification to the user.
Wherein, associate related user characteristics to refer to, a variety of user characteristicses for representing same user are associated.Association
Features described above can be determined by predetermined rule, for example, understood based on the excavation to user characteristics, in certain time period,
The user characteristics that data traffic includes in same IP, it is by analyzing the content in each user characteristics that related user is special
Levy and associate one user of description.Analysis can also be passed through in the user characteristics included by the data on flows in a certain equipment
Related user characteristics is associated one user of description by the content in each user characteristics.Account etc. can additionally be passed through
The ID of strong attribute corresponding relation, carrys out multiple user characteristicses in associate traffic data so as to describe a user.
By setting up the association of user characteristics, the mark for user can be both realized.The association of the user characteristics is shown
Example property, as shown in Figure 2.After analysis data on flows, following customer relationship chain can be obtained, by taking user Mike as an example,
Under Mike network data, by Mike PC, Mike mobile phone association gets up to get through the boundary between equipment, while by Mike
Microblog account, QQ accounts, the cookie of website such as Baidu, Sohu etc. associates, and forms user Mike relation chain.
Step 5: updating customer relationship chain based on user characteristics
Due to the change of various factors, user characteristics can produce change in different times, and such as user can change mobile phone, more
Change number etc..The change of these user characteristicses needs to be updated for customer relationship chain, in order to improve the standard of user's identification
True degree.
In this step, in the renewal of customer relationship chain can be by determining to need to(for) the analysis of user characteristics content
Content, the analysis of these contents can determine according to the data characteristicses in specific user characteristics.For example work as user characteristics
When middle device identification changes, can be identified by analyzing the strong ID such as telephone number, account related to device identification so that
Determine that the change is due to caused by user has changed mobile device, so that it is related to update device identification in customer relationship chain etc.
User characteristics.The mode that customer relationship chain is updated in certain the present embodiment is not limited to that, when user characteristics changes
Associated user's feature in customer relationship chain can also be replaced, increases or deleted according to other rules.
Embodiment three
The present embodiment is related to a kind of network information control method, and this method is based on the identification for Internet user so as to this
The relevant information of user is controlled.Methods described comprises the following steps:
Step 1: obtaining user characteristics
When user surfs the Net, user characteristics can be obtained by various modes, such as website can be by account either
Cookie obtains user characteristics, using can pass through the acquisition of information user characteristics such as account.And may be used also on other network equipments
To obtain user characteristics by analyzing user traffic data.
Step 2: based on the user characteristics and customer relationship chain identification Internet user
After user characteristics is got, you can to recognize Internet user, the customer relationship by user's relation chain
Chain is the customer relationship chain that the method in embodiment one, embodiment two is set up.Identifying can both obtain after Internet user
Part or all of feature of the Internet user in the customer relationship chain.
Step 3: the user characteristics in the customer relationship chain, controls the relevant information of the Internet user
The control includes the related control measure such as information push, Information Statistics, or information safety protection, for example
When it is minor to identify the user, the web site contents browsed to the user or the scope browsed web sites are controlled
System.In another example, when identifying the shopping preferences of the Internet user, it is controlled for the ad content for being pushed to the user.
The relevant information includes all information that can control relevant with the network user, and those skilled in the art are for specific
Specific purposes under environment, it may be determined that the content of the mode of the control and related relevant information.
It is only above this patent preferred embodiment, the protection domain of this patent should not limited to this.
Every conversion under invention design for this patent application environment, and replacing for wherein particular technique means
Generation, increase and omission should be all brought within the protection domain of this patent.
Claims (10)
1. a kind of method for recognizing Internet user, it is characterised in that methods described includes:
Obtain the user traffic data gathered in network link equipment;
User traffic data to acquisition is handled, and cleans the customer flow after the data unrelated with user characteristics are handled
Data;
The user traffic data after the processing is analyzed, user characteristics therein is extracted;
All user characteristicses of unique user are associated, the customer relationship chain of unique user is formed.
2. according to the method described in claim 1, it is characterised in that methods described also includes:Update above-mentioned based on user characteristics
Customer relationship chain.
3. method according to claim 1 or 2, it is characterised in that methods described includes:The user traffic data includes
Setting flow optical splitter or using the user traffic data collected by way of Port Mirroring from interchanger.
4. the method according to any one of claim 1-3, it is characterised in that
The user characteristics includes:Cookie signature identifications, device identification, user account mark, statistics engine mark, SSP are wide
Accuse engine identification and geographical position mark;
The user traffic data after the processing is analyzed, extracting user characteristics therein includes, and the user after processing is flowed
Amount data analyzed, identify cookie signature identifications therein, device identification, user account mark, statistics engine identify,
SSP advertisement engines are identified and geographical position mark, and are extracted.
5. the method according to any one of claim 1-4, it is characterised in that all user characteristicses of association unique user
Including the relevance in the relevance of each user characteristics in the customer flow and each user characteristics content is closed
Join the user characteristics.
6. a kind of device for recognizing user's online, it is characterised in that described device includes:
Data acquisition module, obtains the user traffic data gathered in network link equipment;
Data processing module, the user traffic data to acquisition is handled, and is cleaned the data unrelated with user characteristics and is obtained everywhere
User traffic data after reason;
User characteristics extraction module, analyzes the user traffic data after the processing, extracts user characteristics therein;
User characteristics relating module, associates all user characteristicses of unique user, forms the customer relationship chain of unique user.
7. device according to claim 6, it is characterised in that described device also includes:
Customer relationship chain update module, above-mentioned customer relationship chain is updated based on user characteristics.
8. the method according to claim 6 or 7, it is characterised in that methods described includes:The user traffic data includes
Setting flow optical splitter or using the user traffic data collected by way of Port Mirroring from interchanger.
9. the method according to any one of claim 6-8, it is characterised in that
The user characteristics includes:Cookie signature identifications, device identification, user account mark, statistics engine mark, SSP are wide
Accuse engine identification and geographical position mark;
The user traffic data after the processing is analyzed, extracting user characteristics therein includes, and the user after processing is flowed
Amount data analyzed, identify cookie signature identifications therein, device identification, user account mark, statistics engine identify,
SSP advertisement engines are identified and geographical position mark, and are extracted.
10. the method according to any one of claim 6-9, it is characterised in that all users of association unique user are special
Levy including the relevance in the relevance of each user characteristics in the customer flow and each user characteristics content
Associate the user characteristics.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610882549.7A CN106998262A (en) | 2016-10-10 | 2016-10-10 | A kind of System and method for for recognizing Internet user |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610882549.7A CN106998262A (en) | 2016-10-10 | 2016-10-10 | A kind of System and method for for recognizing Internet user |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN106998262A true CN106998262A (en) | 2017-08-01 |
Family
ID=59431163
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201610882549.7A Pending CN106998262A (en) | 2016-10-10 | 2016-10-10 | A kind of System and method for for recognizing Internet user |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN106998262A (en) |
Cited By (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107563810A (en) * | 2017-08-31 | 2018-01-09 | 江苏省公用信息有限公司 | A kind of advertisement placement method based on broadband account |
| CN107578272A (en) * | 2017-08-10 | 2018-01-12 | 上海斐讯数据通信技术有限公司 | A kind of method and device for kinsfolk's portrait |
| CN107682344A (en) * | 2017-10-18 | 2018-02-09 | 南京邮数通信息科技有限公司 | A kind of ID collection of illustrative plates method for building up based on DPI data interconnection net identifications |
| CN108038714A (en) * | 2017-11-29 | 2018-05-15 | 链家网(北京)科技有限公司 | Advertisement promotion processing method and processing device |
| CN109995605A (en) * | 2018-01-02 | 2019-07-09 | 中国移动通信有限公司研究院 | A traffic identification method and device, and a computer-readable storage medium |
| CN110020166A (en) * | 2017-12-21 | 2019-07-16 | 腾讯科技(深圳)有限公司 | A kind of data analysing method and relevant device |
| CN110502697A (en) * | 2019-08-26 | 2019-11-26 | 武汉斗鱼网络科技有限公司 | A kind of target user's recognition methods, device and electronic equipment |
| CN110519263A (en) * | 2019-08-26 | 2019-11-29 | 北京百度网讯科技有限公司 | Anti- brush amount method, apparatus, equipment and computer readable storage medium |
| CN110782222A (en) * | 2019-10-11 | 2020-02-11 | 厦门谷道集团有限公司 | Method, system and equipment for identifying social media account based on big data intelligent mailbox |
| CN111277453A (en) * | 2020-01-14 | 2020-06-12 | 恩亿科(北京)数据科技有限公司 | End-to-end communication method and data monitoring system |
| CN112367406A (en) * | 2020-11-19 | 2021-02-12 | 全知科技(杭州)有限责任公司 | Method for identifying account behavior analysis corresponding account correlation attribute in web application system |
| CN112446748A (en) * | 2021-01-29 | 2021-03-05 | 上海钐昆网络科技有限公司 | Advertisement putting method, device, equipment and storage medium |
| CN115277106A (en) * | 2022-06-30 | 2022-11-01 | 北京安博通科技股份有限公司 | User identification method and system of network equipment |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070036146A1 (en) * | 2005-08-10 | 2007-02-15 | Bellsouth Intellectual Property Corporation | Analyzing and resolving internet service problems |
| CN103118382A (en) * | 2013-01-21 | 2013-05-22 | 北京拓明科技有限公司 | Analytical method of data traffic neighborhood ping-pong reselection |
| CN103634164A (en) * | 2013-12-04 | 2014-03-12 | 中国联合网络通信集团有限公司 | Method and system for acquiring traffic information |
| CN103906111A (en) * | 2012-12-27 | 2014-07-02 | 中国移动通信集团内蒙古有限公司 | Problem determination method and device for general packet radio service network |
| CN104951544A (en) * | 2015-06-19 | 2015-09-30 | 百度在线网络技术(北京)有限公司 | User data processing method and system and method and system for providing user data |
| CN105224593A (en) * | 2015-08-25 | 2016-01-06 | 中国人民解放军信息工程大学 | Frequent co-occurrence account method for digging in a kind of of short duration online affairs |
-
2016
- 2016-10-10 CN CN201610882549.7A patent/CN106998262A/en active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070036146A1 (en) * | 2005-08-10 | 2007-02-15 | Bellsouth Intellectual Property Corporation | Analyzing and resolving internet service problems |
| CN103906111A (en) * | 2012-12-27 | 2014-07-02 | 中国移动通信集团内蒙古有限公司 | Problem determination method and device for general packet radio service network |
| CN103118382A (en) * | 2013-01-21 | 2013-05-22 | 北京拓明科技有限公司 | Analytical method of data traffic neighborhood ping-pong reselection |
| CN103634164A (en) * | 2013-12-04 | 2014-03-12 | 中国联合网络通信集团有限公司 | Method and system for acquiring traffic information |
| CN104951544A (en) * | 2015-06-19 | 2015-09-30 | 百度在线网络技术(北京)有限公司 | User data processing method and system and method and system for providing user data |
| CN105224593A (en) * | 2015-08-25 | 2016-01-06 | 中国人民解放军信息工程大学 | Frequent co-occurrence account method for digging in a kind of of short duration online affairs |
Cited By (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN107578272A (en) * | 2017-08-10 | 2018-01-12 | 上海斐讯数据通信技术有限公司 | A kind of method and device for kinsfolk's portrait |
| CN107563810A (en) * | 2017-08-31 | 2018-01-09 | 江苏省公用信息有限公司 | A kind of advertisement placement method based on broadband account |
| CN107682344A (en) * | 2017-10-18 | 2018-02-09 | 南京邮数通信息科技有限公司 | A kind of ID collection of illustrative plates method for building up based on DPI data interconnection net identifications |
| CN108038714A (en) * | 2017-11-29 | 2018-05-15 | 链家网(北京)科技有限公司 | Advertisement promotion processing method and processing device |
| CN110020166B (en) * | 2017-12-21 | 2023-02-10 | 腾讯科技(深圳)有限公司 | Data analysis method and related equipment |
| CN110020166A (en) * | 2017-12-21 | 2019-07-16 | 腾讯科技(深圳)有限公司 | A kind of data analysing method and relevant device |
| CN109995605B (en) * | 2018-01-02 | 2021-04-13 | 中国移动通信有限公司研究院 | A kind of traffic identification method, device and computer readable storage medium |
| CN109995605A (en) * | 2018-01-02 | 2019-07-09 | 中国移动通信有限公司研究院 | A traffic identification method and device, and a computer-readable storage medium |
| CN110519263A (en) * | 2019-08-26 | 2019-11-29 | 北京百度网讯科技有限公司 | Anti- brush amount method, apparatus, equipment and computer readable storage medium |
| CN110502697A (en) * | 2019-08-26 | 2019-11-26 | 武汉斗鱼网络科技有限公司 | A kind of target user's recognition methods, device and electronic equipment |
| CN110519263B (en) * | 2019-08-26 | 2022-05-17 | 北京百度网讯科技有限公司 | Anti-brush amount method, apparatus, device and computer readable storage medium |
| CN110502697B (en) * | 2019-08-26 | 2022-06-21 | 武汉斗鱼网络科技有限公司 | Target user identification method and device and electronic equipment |
| CN110782222A (en) * | 2019-10-11 | 2020-02-11 | 厦门谷道集团有限公司 | Method, system and equipment for identifying social media account based on big data intelligent mailbox |
| CN111277453A (en) * | 2020-01-14 | 2020-06-12 | 恩亿科(北京)数据科技有限公司 | End-to-end communication method and data monitoring system |
| CN112367406A (en) * | 2020-11-19 | 2021-02-12 | 全知科技(杭州)有限责任公司 | Method for identifying account behavior analysis corresponding account correlation attribute in web application system |
| CN112446748A (en) * | 2021-01-29 | 2021-03-05 | 上海钐昆网络科技有限公司 | Advertisement putting method, device, equipment and storage medium |
| CN115277106A (en) * | 2022-06-30 | 2022-11-01 | 北京安博通科技股份有限公司 | User identification method and system of network equipment |
| CN115277106B (en) * | 2022-06-30 | 2024-03-19 | 北京安博通科技股份有限公司 | User identification method and system of network equipment |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN106998262A (en) | A kind of System and method for for recognizing Internet user | |
| CN106874266A (en) | User's portrait method and the device for user's portrait | |
| CN103218431B (en) | A kind ofly can identify the system that info web gathers automatically | |
| CN110337059B (en) | Analysis algorithm, server and network system for family relationship of user | |
| CN102663105B (en) | The method for building up and system of number information database | |
| CN103402177B (en) | A kind of WiFi terminal information transmission system and its implementation | |
| CN110321424B (en) | A Deep Learning-Based Behavior Analysis Method for AIDS Personnel | |
| CN105007171A (en) | User data analysis system and method based on big data in communication field | |
| CN107515915A (en) | User based on user behavior data identifies correlating method | |
| CN111104521B (en) | An anti-fraud detection method and detection system based on graph analysis | |
| CN103810623A (en) | Real-time automatic marketing method and system | |
| CN104298782B (en) | Internet user actively accesses the analysis method of action trail | |
| CN109858919A (en) | Determination method and device, online ordering method and the device of abnormal account | |
| CN104951544A (en) | User data processing method and system and method and system for providing user data | |
| CN104217346A (en) | Precision advertising equipment and precision advertising method | |
| CN104636473A (en) | Data processing method and system based on electronic payment behaviors | |
| CN107465739A (en) | The method and device of entity channel user drainage | |
| CN107682344A (en) | A kind of ID collection of illustrative plates method for building up based on DPI data interconnection net identifications | |
| CN110020161A (en) | Data processing method, log processing method and terminal | |
| CN103593769A (en) | Telecommunication behavior statistical analysis system | |
| CN107666404A (en) | Broadband network user identification method and device | |
| CN119066132A (en) | A method for analyzing and completing fraud-related subjects in knowledge graphs based on a large language model | |
| CN111177481A (en) | User identifier mapping method and device | |
| CN105373619A (en) | User big data based user group analysis method and system | |
| CN107832333B (en) | Method and system for constructing user network data fingerprint based on distributed processing and DPI data |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170801 |