Nothing Special   »   [go: up one dir, main page]

CN112785315A - Batch registration identification method and device - Google Patents

Batch registration identification method and device Download PDF

Info

Publication number
CN112785315A
CN112785315A CN201911081028.1A CN201911081028A CN112785315A CN 112785315 A CN112785315 A CN 112785315A CN 201911081028 A CN201911081028 A CN 201911081028A CN 112785315 A CN112785315 A CN 112785315A
Authority
CN
China
Prior art keywords
account
registered
registration
community
registration information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911081028.1A
Other languages
Chinese (zh)
Other versions
CN112785315B (en
Inventor
李慧萍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Wodong Tianjun Information Technology Co Ltd
Original Assignee
Beijing Wodong Tianjun Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Wodong Tianjun Information Technology Co Ltd filed Critical Beijing Wodong Tianjun Information Technology Co Ltd
Priority to CN201911081028.1A priority Critical patent/CN112785315B/en
Publication of CN112785315A publication Critical patent/CN112785315A/en
Application granted granted Critical
Publication of CN112785315B publication Critical patent/CN112785315B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/018Certifying business or products
    • G06Q30/0185Product, service or business identity fraud
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0207Discounts or incentives, e.g. coupons or rebates
    • G06Q30/0225Avoiding frauds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Health & Medical Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Primary Health Care (AREA)
  • Human Resources & Organizations (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Game Theory and Decision Science (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The application provides a batch registration identification method and device. The method comprises the following steps: acquiring registration information of a current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a device used in registration; determining an account having a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account having the relationship with the current registered account is an account having the same one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration when the registered information is registered; determining a community where the current registered account is located according to a historical graph and an account which has a relationship with the current registered account, wherein the historical graph is an unauthorized undirected graph generated according to the relationship between historical registered accounts; and identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold value. Therefore, the efficiency and the accuracy of batch registration identification are improved.

Description

Batch registration identification method and device
Technical Field
The present application relates to the field of data processing technologies, and in particular, to a batch registration identification method and apparatus.
Background
At present, various websites become an indispensable part of life of people, and provide great convenience for people, for example, e-commerce websites enable people to purchase and deliver goods to home without leaving home, social networking websites enable people to communicate in real time across tens of millions of distances, and video websites enable people to watch favorite video programs at any time. Meanwhile, in order to acquire more users, the websites can correspondingly provide more preferential conditions for the newly registered users, such as coupon shopping for new people of the e-commerce websites and the like. Therefore, some illegal users perform batch registration on the website, for example, a plurality of accounts are automatically registered by using a registration machine within one day, then new coupons are picked up, and then the new coupons are sent to the website for consumption, so that large asset loss is brought to a website provider. Therefore, a batch registration identification method is required to solve this problem.
In the existing batch registration identification method, one is a manual rule judgment method, and specifically, the risk of the account in the aspect of batch registration is judged by manually checking data to identify whether the mobile number is a commonly-used batch registration mobile number segment, whether the location of commonly-used batch registration information is located or whether the registration time is abnormal. The other method is a blacklist identification method, and specifically, mobile phone numbers, mailboxes and the like in the registration information identified in the historical time period are stored in a blacklist library, and if a user uses the information in the blacklist to register, the user can directly intercept the information.
However, the manual rule judgment method needs an experienced expert to identify and mark data, so that the required time is long and the efficiency is low; the blacklist identification method depends heavily on historical data, and if registration information outside the blacklist exists, effective identification cannot be achieved, and accuracy is low.
Disclosure of Invention
The application provides a batch registration identification method and device, which are used for improving the efficiency and accuracy of batch registration identification.
In a first aspect, the present application provides a batch registration identification method, including:
acquiring registration information of a current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a device used in registration;
determining an account having a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account having the relationship with the current registered account is an account which is the same as one or more of a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a used device during registration when the registered information is registered;
determining a community where the current registered account is located according to a history graph and an account having a relationship with the current registered account, wherein the history graph is an unauthorized undirected graph generated according to the relationship between history registered accounts;
and identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold, wherein the preset threshold is set according to business requirements.
Optionally, the registration information further includes an account identifier, and the determining, according to the history map and the account having a relationship with the current registered account, a community in which the current registered account is located includes:
adding the identifier of the current registered account and the identifier of the account having a relationship with the current registered account into the historical graph as edges of vertexes to obtain a new graph;
and finding out the community in which the current registered account is located from the new graph.
Optionally, the identifying whether the current registered account is a batch registered account according to the batch registration risk value of the community includes:
if the batch registration risk value of the community is larger than the preset threshold value, determining that the current registered account is a batch registered account;
and if the batch registration risk value of the community is smaller than the preset threshold value, and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account according to the account number of the new community and the preset threshold value.
Optionally, the method further includes:
generating the historical map according to the historical account registration information;
dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities;
and determining the batch registration risk value of each community according to the element number of each community.
Optionally, the generating the history map according to the historical account registration information includes:
finding out accounts which have a relationship with each other in the historical account registration information, wherein the accounts which have the relationship with each other are accounts which are the same with one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration;
and generating the history graph according to all the edges and the vertexes by taking the account identifications as the vertexes, connecting the vertexes as edges and taking the accounts with the mutual relation as one edge.
Optionally, before generating the history map according to the historical account registration information, the method further includes:
performing at least one of the following processes on the historical account registration information:
deleting account registration information with abnormal mobile phone number digits in the historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format; and
and deleting the account registration information of the local area network to which the network IP address belongs when the historical account registration information is registered.
Optionally, the method further includes:
and if the current registered account is identified as the batch registered account, performing interception registration.
In a second aspect, the present application provides a batch registration identification apparatus, including:
the acquisition module is used for acquiring the registration information of the current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a device used in registration;
the first processing module is used for determining an account which has a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account which has the relationship with the current registered account is the account which is the same as one or more of a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a used device during registration when the registered information is registered;
the second processing module is used for determining a community where the current registered account is located according to a historical graph and the account which has a relationship with the current registered account, wherein the historical graph is an unauthorized undirected graph generated according to the relationship between historical registered accounts;
and the identification module is used for identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold, and the preset threshold is set according to business requirements.
Optionally, the registration information further includes an identifier of an account, and the second processing module is configured to:
adding the identifier of the current registered account and the identifier of the account having a relationship with the current registered account into the historical graph as edges of vertexes to obtain a new graph;
and finding out the community in which the current registered account is located from the new graph.
Optionally, the identification module is configured to:
if the batch registration risk value of the community is larger than the preset threshold value, determining that the current registered account is a batch registered account;
and if the batch registration risk value of the community is smaller than the preset threshold value, and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account according to the account number of the new community and the preset threshold value.
Optionally, the first processing module is further configured to:
generating the historical map according to the historical account registration information;
dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities;
and determining the batch registration risk value of each community according to the element number of each community.
Optionally, the first processing module is configured to:
finding out accounts which have a relationship with each other in the historical account registration information, wherein the accounts which have the relationship with each other are accounts which are the same with one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration;
and generating the history graph according to all the edges and the vertexes by taking the account identifications as the vertexes, connecting the vertexes as edges and taking the accounts with the mutual relation as one edge.
Optionally, the first processing module is further configured to:
before generating the history map according to the history account registration information, performing at least one of the following processes on the history account registration information:
deleting account registration information with abnormal mobile phone number digits in the historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format; and
and deleting the account registration information of the local area network to which the network IP address belongs when the historical account registration information is registered.
Optionally, the identification module is further configured to:
and if the current registered account is identified as the batch registered account, performing interception registration.
According to the batch registration identification method and device, registration information of a current registration account is obtained, an account having a relation with the current registration account is determined according to the registration information and historical account registration information, the account having the relation with the current registration account is an account having one or more of the same network IP address, the same registration mobile phone number, the same registration mailbox and the same identifier of a device used during registration when the account is registered in the registration information, a community where the current registration account is located is determined according to a historical map and the account having the relation with the current registration account, and finally whether the current registration account is a batch registration account is identified according to a batch registration risk value of the community and a preset threshold value. Therefore, batch registration is identified by detecting the aggregation of the account registration information, marking data is not needed in the method, so that the work of manual experience and the like is avoided, and the efficiency and the accuracy of batch registration identification can be improved.
Drawings
In order to more clearly illustrate the technical solutions in the present application or the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a schematic view of an application scenario of the present application;
FIG. 2 is a flowchart of an embodiment of a batch enrollment identification method provided herein;
FIG. 3 is a flowchart of an embodiment of a batch enrollment identification method provided herein;
fig. 4 is a schematic structural diagram of a batch registration recognition apparatus provided in the present application;
fig. 5 is a schematic diagram of a hardware structure of an electronic device provided in the present application.
Detailed Description
To make the purpose, technical solutions and advantages of the present application clearer, the technical solutions in the present application will be clearly and completely described below with reference to the drawings in the present application, and it is obvious that the described embodiments are some, but not all embodiments of the present application. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
First, some terms in the embodiments of the present application are explained below to facilitate understanding by those skilled in the art.
1. The figure is the main research object of graph theory. Graph theory (Graph theory) is a branch of combinatorial mathematics, and has close relation with other mathematical branches, such as group theory, matrix theory and topology. A graph is a graph of a number of given vertices and edges connecting the two vertices, such a graph is often used to describe some specific relationship between something. Vertices are used to represent objects, and edges connecting two vertices are used to represent that there is such a relationship between two objects.
2. Community discovery: the community reflects the local characteristics of individual behaviors in the network and the association relationship between the individual behaviors, and the research on the community in the network plays an important role in understanding the structure and the function of the whole network and is helpful for analyzing and predicting the interaction relationship among elements of the whole network. The social discovery is more widely applied to the Internet Web, and if the social discovery is used for accurate advertisement delivery in the microblog, the social discovery is carried out on the users in the electronic commerce so as to help the users to establish a more reliable recommendation system, and more personalized search results can be provided for the users.
In the existing batch registration identification method, an experienced expert is needed for identifying and marking data in the manual rule judgment method, so that the required time is long and the efficiency is low; the blacklist identification method depends heavily on historical data, and if registration information outside the blacklist exists, effective identification cannot be achieved, and accuracy is low. In order to solve the problem, the application provides a batch registration identification method and device, through the similarity of user registration information and according to the characteristics of batch, a community discovery algorithm based on graph clustering is used for detecting the aggregative property of account registration information, so that batch registration is identified, marking data is not needed, so that the work of manual experience and the like is avoided, the efficiency and the accuracy of batch registration identification can be improved, the application can immediately carry out risk identification after the user registration action is finished, further ticket leading and order placing actions of batch registered users are avoided, and the asset loss of a service party can be effectively reduced. The following describes a specific implementation process of the batch registration identification method according to the embodiment of the present application in detail by using a specific embodiment with reference to the accompanying drawings.
Fig. 1 is a schematic view of an application scenario of the present application, as shown in fig. 1, when a user registers an account through a site or an application, the site or the application collects registration information of a currently registered account in real time, where the registration information includes an account identifier (such as an account number), a network IP address during registration, a registered mobile phone number, a registered mailbox, and an identifier of a device used during registration, when the user clicks a registration button, the site or the application sends the collected registration information of the currently registered account to a server, the server performs batch registration identification on the registration information of the currently registered account according to the batch registration identification method provided by the present application, and if the currently registered account is identified as a batch registered account, the registration is intercepted. If the current registered account is identified not to be the batch registered account, the interception operation is not carried out, and the user can successfully register.
Fig. 2 is a flowchart of an embodiment of a batch registration identification method provided in the present application, and as shown in fig. 2, the method of the present embodiment may include:
s101, obtaining the registration information of the current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a used device during registration.
Specifically, when a user registers an account through a site or an application, the site or the application collects registration information of a current registered account in real time, where the registration information includes a network IP address, a registered mobile phone number, a registered mailbox, and an identifier of a device used during registration, and the registration information may further include an account identifier (such as an account number), where the device used during registration, that is, an electronic device where the site or the application is located, may be an electronic device such as a mobile phone and a computer, and the identifier of the device used during registration is a mobile phone serial number (IMSI), specifically, if the system of the device used during registration is an android system, the identifier of the device is a unique advertisement Identifier (IDFA), and if the system of the device used during registration is an IOS system, the identifier of the device is a unique advertisement identifier (identifier for advertisement, IDFA). Optionally, the registration information may further include a registration site type (pc/app/m station, etc.) and a registration duration.
And S102, determining an account having a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account having the relationship with the current registered account is the account which is the same as one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration.
Specifically, if the network IP addresses of the two accounts during registration are the same, it is considered that the two accounts have a relationship, or if the registered mobile phone numbers of the two accounts are the same as the registered mailbox, it is considered that the two accounts have a relationship, or if the network IP addresses of the two accounts during registration, the registered mobile phone numbers, the registered mailbox, and the identifiers of the devices used during registration are the same, it is considered that the two accounts have a relationship, and so on.
S103, determining a community where the current registered account is located according to the historical graph and the account which has a relationship with the current registered account, wherein the historical graph is an unauthorized undirected graph generated according to the relationship between the historical registered accounts.
As an implementation manner, S103 may specifically be: and adding edges taking the identification of the current registered account and the identification of the account having a relationship with the current registered account as vertexes into the historical graph to obtain a new graph, and searching the community where the current registered account is located from the new graph.
And S104, identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold value.
As an implementation manner, S104 may specifically be: if the batch registration risk value of the community is larger than a preset threshold value, determining that the current registered account is a batch registered account, wherein the preset threshold value is set according to business requirements;
and if the batch registration risk value of the community is smaller than the preset threshold value and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account or not according to the account number of the new community and the preset threshold value.
Further, if the current registered account is identified as the batch registered account, intercepting and registering. If the current registered account is identified not to be the batch registered account, the interception operation is not carried out, and the user can successfully register.
Optionally, before S101, the method of this embodiment may further include:
and S105, generating a history map according to the history account registration information.
Specifically, S105 may specifically be: and finding out accounts with a mutual relation in the historical account registration information, wherein the accounts with the mutual relation are accounts with one or more same items in the network IP address, the registered mobile phone number, the registered mailbox and the identification of the used equipment during registration, the account identification is used as a vertex, the vertices are connected together to be an edge, the accounts with the mutual relation are used as one edge, and a historical map is generated according to all the edges and the vertex.
And S106, dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities, namely each community does not overlap with other communities.
Specifically, a community discovery Algorithm such as Label Propagation Algorithm (LPA), Louvain community discovery, Fast community discovery Algorithm (Fast Unfolding), etc. may be used, and other graph clustering or community discovery algorithms such as weighted LPA Algorithm, community discovery Algorithm (GN), etc. may also be used.
And S107, determining the batch registration risk value of each community according to the element number of each community.
Specifically, the larger the number of elements of a community, the stronger the aggregativeness, and the greater the possibility of batch registration, as a practical way, the number of elements of each community may be used as a batch registration risk value, for example, a threshold value of 10 is set, and accordingly, if the number of elements of a community is 11, 11 is the batch registration risk value of the community, and 11 is greater than 10, the elements in the community are batch registered. As another practical way, the community with the largest number of elements in all the communities is determined, the number of the largest community is used as a denominator, each community is used as a numerator, a probability value of each community is sequentially calculated as a batch registration risk value of each community, for example, there are 5 communities in total, the number of the communities is respectively 10, 15, 35, 40 and 50, and the batch registration risk values of each community are respectively 10/50, 15/50, 35/50, 40/50 and 1. The preset threshold is set according to the business requirement, and can also be set by referring to the batch registration risk value and the business requirement of each community.
Further, before generating the history map according to the history account registration information, the method of this embodiment may further include:
performing at least one of the following processes on the historical account registration information:
deleting account registration information with abnormal mobile phone number digits in historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format, and removing the national area code identifications if the domestic mobile phone numbers are headed by the national area code identifications ' +086 ' ″, 86 ' and the like; and
and deleting the account registration information of the local area network belonging to the network IP address when the historical account registration information is registered. The local area network, for example 192.168.0.0, etc., deletes such account registration information because the local area network cannot effectively provide the location of the registered user when the registration is performed, and a mismatch is caused when the relationship is generated later, that is, a relationship is generated between registered users in unknown regions.
By processing the historical account registration information, the accuracy of batch registration identification can be further improved.
In the embodiment, through the similarity of the user registration information and according to the characteristics of batch, the community discovery algorithm based on graph clustering is used for detecting the aggregative property of the account registration information, so that batch registration is identified, marking data is not needed in the method, the work of manual experience and the like is avoided, the efficiency and the accuracy of batch registration identification can be improved, risk identification can be immediately carried out after the user registration action is finished, further ticket getting and order placing actions of the batch registered users are avoided, and the asset loss of a business party can be effectively reduced.
The batch registration identification method provided by this embodiment determines, by obtaining registration information of a current registration account, an account having a relationship with the current registration account according to the registration information and registration information of a historical account, where the account having the relationship with the current registration account is an account having the same one or more of a network IP address, a registered mobile phone number, a registered mailbox, and an identifier of a device used during registration when registered in the registration information, determines a community where the current registration account is located according to a historical map and the account having the relationship with the current registration account, and finally identifies whether the current registration account is a batch registration account according to a batch registration risk value of the community and a preset threshold. Therefore, batch registration is identified by detecting the aggregation of the account registration information, marking data is not needed in the method, so that the work of manual experience and the like is avoided, and the efficiency and the accuracy of batch registration identification can be improved.
The following describes the technical solution of the embodiment of the method shown in fig. 2 in detail by using a specific embodiment.
Fig. 3 is a flowchart of an embodiment of a batch registration identification method provided in the present application, and as shown in fig. 3, the method of the present embodiment may include:
s201, collecting historical account registration information and processing the historical account registration information.
Specifically, the historical account registration information is processed by at least one of the following processes:
deleting account registration information with abnormal mobile phone number digits in historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format, and removing the national area code identifications if the domestic mobile phone numbers are headed by the national area code identifications ' +086 ' ″, 86 ' and the like; and
and deleting the account registration information of the local area network belonging to the network IP address when the historical account registration information is registered. The local area network, for example 192.168.0.0, etc., deletes such account registration information because the local area network cannot effectively provide the location of the registered user when the registration is performed, and a mismatch is caused when the relationship is generated later, that is, a relationship is generated between registered users in unknown regions.
By processing the historical account registration information, the accuracy of the historical account registration information can be further improved, and a graph generated according to the historical account registration information is more accurate.
And S202, generating a history map according to the history account registration information.
Specifically, accounts with a relationship in the historical account registration information are found, the accounts with the relationship are accounts with the same one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration, the account identifier is used as a vertex, the vertices are connected to form an edge, the accounts with the relationship are used as one edge, and a historical map is generated according to all the edges and the vertex. Changing the above found relationship into the form of account Identification (ID) pair, for example, the relationship between account 1(ID1) and account 2(ID2) generates a ID pair (ID1, ID2) between them, and stores them in the graph database as an edge of the graph (an unauthorized undirected edge), and the account ID is a vertex point where the edges are connected, and the unauthorized undirected graph composed of the above edges and points is a history graph.
S203, dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities, namely each community does not overlap with other communities.
Specifically, community discovery algorithms such as LPA, Louvain community discovery, Fast Unfolding algorithm, etc. may be used, and other graph clustering or community discovery algorithms such as weighted LPA algorithm, GN community discovery algorithm, etc. may also be used.
And S204, determining the batch registration risk value of each community according to the element number of each community.
As one practical way, the number of elements of each community may be used as a batch registration risk value, and as another practical way, a community with the largest number of elements in all the communities is determined, and with the number of the largest community as a denominator and each community as a numerator, a probability value of each community is sequentially calculated to be used as a batch registration risk value of each community.
After the batch registration risk values of a plurality of communities are obtained, the community can be put into use, and a specific use process is carried out.
S205, obtaining the registration information of the current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a used device during registration.
And S206, determining an account which has a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account which has the relationship with the current registered account is the account which has the same one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration when the registered information is registered.
And S207, determining a community where the current registered account is located according to the historical graph and the account having the relationship with the current registered account.
And S208, identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold, wherein the preset threshold is set according to business requirements.
Specifically, if the batch registration risk value of the community is greater than a preset threshold value, determining that the current registered account is a batch registered account; and if the batch registration risk value of the community is smaller than the preset threshold value and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account or not according to the account number of the new community and the preset threshold value.
And if the current registered account is identified as the batch registered account, performing interception registration. If the current registered account is identified not to be the batch registered account, the interception operation is not carried out, and the user can successfully register.
For example, when a user registers an account through a site or an application program, the site or the application program collects registration information of a current registered account in real time, the registration information includes account identification (such as an account number), a network IP address during registration, a registered mobile phone number, a registered mailbox and identification of a device used during registration, when the user clicks a registration button, the site or the application program sends the collected registration information of the current registered account to a server, the server determines an account having a relationship with the current registered account according to the registration information and historical account registration information, generates a relationship between the current registered account and the account, generates new edges, and adds the new edges into a historical map. Then, a community discovery algorithm is carried out to determine the community where the current registered account is located, if the current registered account is classified into a certain historical community, the risk value of the account is equal to the risk value of the community, the batch registration risk value of the community where the current registered account is located is assumed to be 0.9, the preset threshold value is 0.5, and obviously 0.9 is greater than 0.5, so that the batch registration risk of the current registered account is judged, and the intercepting registration operation is carried out; if the batch registration risk value of the community in which the current registration account is located is smaller than 0.5, for example, 0.4 and the like, it is determined that the batch registration risk of the current registration account is relatively small, no interception operation is performed, and the user can successfully register.
Fig. 4 is a schematic structural diagram of a batch registration identification apparatus provided in the present application, and as shown in fig. 4, the apparatus of this embodiment may include: an acquisition module 11, a first processing module 12, a second processing module 13 and an identification module 14, wherein,
the obtaining module 11 is configured to obtain registration information of a current registered account, where the registration information includes a network IP address during registration, a registered mobile phone number, a registered mailbox, and an identifier of a device used during registration;
the first processing module 12 is configured to determine, according to the registration information and the historical account registration information, an account that has a relationship with the current registered account, where the account that has a relationship with the current registered account is an account that is the same as one or more of a network IP address, a registered mobile phone number, a registered mailbox, and an identifier of a device used during registration when registered in the registration information;
the second processing module 13 is configured to determine a community in which the current registered account is located according to a history map and an account having a relationship with the current registered account, where the history map is an unauthorized undirected map generated according to a relationship between history registered accounts;
the identification module 14 is configured to identify whether the current registered account is a batch registered account according to the batch registration risk value of the community and a preset threshold, where the preset threshold is set according to a business requirement.
Further, the registration information further includes an identification of the account, and the second processing module 13 is configured to:
adding the edge taking the identifier of the current registered account and the identifier of the account having a relationship with the current registered account as a vertex into the historical graph to obtain a new graph;
and finding out the community in which the current registered account is located from the new graph.
Further, the identification module 14 is configured to:
if the batch registration risk value of the community is larger than a preset threshold value, determining that the current registered account is a batch registered account;
and if the batch registration risk value of the community is smaller than the preset threshold value and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account or not according to the account number of the new community and the preset threshold value.
Further, the first processing module 12 is further configured to:
generating a history graph according to the registration information of the history account;
dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities;
and determining the batch registration risk value of each community according to the element number of each community.
Further, the first processing module 12 is configured to:
finding out accounts which have a relationship with each other in the historical account registration information, wherein the accounts which have the relationship with each other are accounts which are the same with one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration;
and (4) taking account identifications as vertexes, connecting the vertexes to form edges, taking the accounts with mutual relation as one edge, and generating a history graph according to all the edges and the vertexes.
Further, the first processing module is further configured to:
before generating a history map according to the history account registration information, performing at least one of the following processes on the history account registration information:
deleting account registration information with abnormal mobile phone number digits in historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format; and
and deleting the account registration information of the local area network belonging to the network IP address when the historical account registration information is registered.
Further, the identification module is further configured to:
and if the current registered account is identified as the batch registered account, performing interception registration.
The apparatus provided in the embodiment of the present application may implement the method embodiment, and specific implementation principles and technical effects thereof may be referred to the method embodiment, which is not described herein again.
In the present application, the terminal device may be divided into functional modules according to the above method example, for example, each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. It should be noted that the division of the modules in the embodiments of the present application is schematic, and is only one division of logic functions, and there may be another division manner in actual implementation.
Fig. 5 is a schematic diagram of a hardware structure of an electronic device provided in the present application. As shown in fig. 5, the electronic device 20 of the present embodiment may include: a memory 21 and a processor 22;
a memory 21 for storing a computer program;
a processor 22 for executing the computer program stored in the memory to implement the printing method in the above-described embodiments. Reference may be made in particular to the description relating to the method embodiments described above.
Alternatively, the memory 21 may be separate or integrated with the processor 22.
When the memory 21 is a device separate from the processor 22, the electronic device 20 may further include:
a bus 23 for connecting the memory 21 and the processor 22.
Optionally, this embodiment further includes: a communication interface 24, the communication interface 24 being connectable to the processor 22 via a bus 23. The processor 22 may control the communication interface 23 to implement the above-described receiving and transmitting functions of the electronic device 20.
The electronic device provided by this embodiment can be used to execute the above method, and its implementation manner and technical effect are similar, and this embodiment is not described herein again.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described embodiments of the apparatus are merely illustrative, and for example, the division of modules is only one logical division, and other divisions may be realized in practice, for example, a plurality of modules may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or modules, and may be in an electrical, mechanical or other form.
Modules described as separate parts may or may not be physically separate, and parts displayed as modules may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment.
In addition, functional modules in the embodiments of the present application may be integrated into one processing unit, or each module may exist alone physically, or two or more modules are integrated into one unit. The unit formed by the modules can be realized in a hardware form, and can also be realized in a form of hardware and a software functional unit.
The integrated module implemented in the form of a software functional module may be stored in a computer-readable storage medium. The software functional module is stored in a storage medium and includes several instructions to enable a computer device (which may be a personal computer, a server, or a network device) or a processor (processor) to execute some steps of the methods according to the embodiments of the present application.
It should be understood that the Processor may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of a method disclosed in connection with the present invention may be embodied directly in a hardware processor, or in a combination of the hardware and software modules within the processor.
The memory may comprise a high-speed RAM memory, and may further comprise a non-volatile storage NVM, such as at least one disk memory, and may also be a usb disk, a removable hard disk, a read-only memory, a magnetic or optical disk, etc.
The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended ISA (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, the buses in the figures of the present application are not limited to only one bus or one type of bus.
The computer-readable storage medium may be implemented by any type or combination of volatile or non-volatile memory devices, such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks. A storage media may be any available media that can be accessed by a general purpose or special purpose computer.
Those of ordinary skill in the art will understand that: all or a portion of the steps of implementing the above-described method embodiments may be performed by hardware associated with program instructions. The program may be stored in a computer-readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
Finally, it should be noted that: the above embodiments are only used for illustrating the technical solutions of the present application, and not for limiting the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present application.

Claims (16)

1. A batch registration identification method is characterized by comprising the following steps:
acquiring registration information of a current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a device used in registration;
determining an account having a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account having the relationship with the current registered account is an account which is the same as one or more of a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a used device during registration when the registered information is registered;
determining a community where the current registered account is located according to a history graph and an account having a relationship with the current registered account, wherein the history graph is an unauthorized undirected graph generated according to the relationship between history registered accounts;
and identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold, wherein the preset threshold is set according to business requirements.
2. The method of claim 1, wherein the registration information further includes an account identifier, and wherein determining the community in which the current registered account is located according to the history map and the account having the relationship with the current registered account comprises:
adding the identifier of the current registered account and the identifier of the account having a relationship with the current registered account into the historical graph as edges of vertexes to obtain a new graph;
and finding out the community in which the current registered account is located from the new graph.
3. The method of claim 1, wherein identifying whether the current registered account is a bulk registered account based on a bulk registration risk value of the community comprises:
if the batch registration risk value of the community is larger than the preset threshold value, determining that the current registered account is a batch registered account;
and if the batch registration risk value of the community is smaller than the preset threshold value, and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account according to the account number of the new community and the preset threshold value.
4. The method according to any one of claims 1-3, further comprising:
generating the historical map according to the historical account registration information;
dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities;
and determining the batch registration risk value of each community according to the element number of each community.
5. The method of claim 4, wherein the generating the historical map from the historical account registration information comprises:
finding out accounts which have a relationship with each other in the historical account registration information, wherein the accounts which have the relationship with each other are accounts which are the same with one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration;
and generating the history graph according to all the edges and the vertexes by taking the account identifications as the vertexes, connecting the vertexes as edges and taking the accounts with the mutual relation as one edge.
6. The method of claim 4, wherein prior to generating the history map from the historical account registration information, the method further comprises:
performing at least one of the following processes on the historical account registration information:
deleting account registration information with abnormal mobile phone number digits in the historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format; and
and deleting the account registration information of the local area network to which the network IP address belongs when the historical account registration information is registered.
7. The method of claim 1, further comprising:
and if the current registered account is identified as the batch registered account, performing interception registration.
8. A batch enrollment recognition apparatus, comprising:
the acquisition module is used for acquiring the registration information of the current registered account, wherein the registration information comprises a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a device used in registration;
the first processing module is used for determining an account which has a relationship with the current registered account according to the registration information and the historical account registration information, wherein the account which has the relationship with the current registered account is the account which is the same as one or more of a network IP address, a registered mobile phone number, a registered mailbox and an identifier of a used device during registration when the registered information is registered;
the second processing module is used for determining a community where the current registered account is located according to a historical graph and the account which has a relationship with the current registered account, wherein the historical graph is an unauthorized undirected graph generated according to the relationship between historical registered accounts;
and the identification module is used for identifying whether the current registered account is a batch registered account or not according to the batch registered risk value of the community and a preset threshold, and the preset threshold is set according to business requirements.
9. The apparatus of claim 8, wherein the registration information further comprises an identification of an account, and wherein the second processing module is configured to:
adding the identifier of the current registered account and the identifier of the account having a relationship with the current registered account into the historical graph as edges of vertexes to obtain a new graph;
and finding out the community in which the current registered account is located from the new graph.
10. The apparatus of claim 8, wherein the identification module is configured to:
if the batch registration risk value of the community is larger than the preset threshold value, determining that the current registered account is a batch registered account;
and if the batch registration risk value of the community is smaller than the preset threshold value, and the current registered account and the account registered in the preset time period form a new community, identifying whether the current registered account is a batch registered account according to the account number of the new community and the preset threshold value.
11. The apparatus of any of claims 8-10, wherein the first processing module is further configured to:
generating the historical map according to the historical account registration information;
dividing the history graph into a plurality of communities by using a community discovery algorithm, wherein each community has no shared vertex with other communities;
and determining the batch registration risk value of each community according to the element number of each community.
12. The apparatus of claim 11, wherein the first processing module is configured to:
finding out accounts which have a relationship with each other in the historical account registration information, wherein the accounts which have the relationship with each other are accounts which are the same with one or more of the network IP address, the registered mobile phone number, the registered mailbox and the identifier of the used equipment during registration;
and generating the history graph according to all the edges and the vertexes by taking the account identifications as the vertexes, connecting the vertexes as edges and taking the accounts with the mutual relation as one edge.
13. The apparatus of claim 11, wherein the first processing module is further configured to:
before generating the history map according to the history account registration information, performing at least one of the following processes on the history account registration information:
deleting account registration information with abnormal mobile phone number digits in the historical account registration information;
processing the mobile phone numbers in the historical account registration information into the same format; and
and deleting the account registration information of the local area network to which the network IP address belongs when the historical account registration information is registered.
14. The apparatus of claim 8, wherein the identification module is further configured to:
and if the current registered account is identified as the batch registered account, performing interception registration.
15. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the batch enrollment identification method according to any one of claims 1 to 7.
16. An electronic device, comprising:
a processor; and
a memory for storing executable instructions of the processor;
wherein the processor is configured to perform the batch enrollment identification method of any of claims 1-7 via execution of the executable instructions.
CN201911081028.1A 2019-11-07 2019-11-07 Batch registration identification method and device Active CN112785315B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911081028.1A CN112785315B (en) 2019-11-07 2019-11-07 Batch registration identification method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911081028.1A CN112785315B (en) 2019-11-07 2019-11-07 Batch registration identification method and device

Publications (2)

Publication Number Publication Date
CN112785315A true CN112785315A (en) 2021-05-11
CN112785315B CN112785315B (en) 2024-06-21

Family

ID=75748205

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911081028.1A Active CN112785315B (en) 2019-11-07 2019-11-07 Batch registration identification method and device

Country Status (1)

Country Link
CN (1) CN112785315B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114219439A (en) * 2021-12-15 2022-03-22 写逸网络科技(上海)有限公司 Batch social software registration method for preventing feature recognition
CN117934128A (en) * 2023-06-28 2024-04-26 南京光普信息技术有限公司 Method, system, medium and equipment for identifying suspicion of surrounding string in bidding

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106339615A (en) * 2016-08-29 2017-01-18 北京红马传媒文化发展有限公司 Abnormal registration behavior recognition method, system and equipment
CN107733883A (en) * 2017-10-09 2018-02-23 武汉斗鱼网络科技有限公司 A kind of method and device for detecting batch registration account
CN107800678A (en) * 2017-02-16 2018-03-13 平安科技(深圳)有限公司 The method and device that detection terminal is registered extremely
CN109561050A (en) * 2017-09-26 2019-04-02 武汉斗鱼网络科技有限公司 A kind of method and apparatus identifying batch account
US10333964B1 (en) * 2015-05-29 2019-06-25 Microsoft Technology Licensing, Llc Fake account identification
CN110032857A (en) * 2019-02-19 2019-07-19 阿里巴巴集团控股有限公司 The registration of account, the recognition methods of credible equipment and device
CN110324352A (en) * 2019-07-11 2019-10-11 武汉斗鱼网络科技有限公司 Identify the method and device of batch registration account group

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10333964B1 (en) * 2015-05-29 2019-06-25 Microsoft Technology Licensing, Llc Fake account identification
CN106339615A (en) * 2016-08-29 2017-01-18 北京红马传媒文化发展有限公司 Abnormal registration behavior recognition method, system and equipment
CN107800678A (en) * 2017-02-16 2018-03-13 平安科技(深圳)有限公司 The method and device that detection terminal is registered extremely
CN109561050A (en) * 2017-09-26 2019-04-02 武汉斗鱼网络科技有限公司 A kind of method and apparatus identifying batch account
CN107733883A (en) * 2017-10-09 2018-02-23 武汉斗鱼网络科技有限公司 A kind of method and device for detecting batch registration account
CN110032857A (en) * 2019-02-19 2019-07-19 阿里巴巴集团控股有限公司 The registration of account, the recognition methods of credible equipment and device
CN110324352A (en) * 2019-07-11 2019-10-11 武汉斗鱼网络科技有限公司 Identify the method and device of batch registration account group

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114219439A (en) * 2021-12-15 2022-03-22 写逸网络科技(上海)有限公司 Batch social software registration method for preventing feature recognition
CN117934128A (en) * 2023-06-28 2024-04-26 南京光普信息技术有限公司 Method, system, medium and equipment for identifying suspicion of surrounding string in bidding

Also Published As

Publication number Publication date
CN112785315B (en) 2024-06-21

Similar Documents

Publication Publication Date Title
CN106296344B (en) Malicious address identification method and device
CN111666346B (en) Information merging method, transaction inquiring method, device, computer and storage medium
WO2016015444A1 (en) Target user determination method, device and network server
US20210035126A1 (en) Data processing method, system and computer device based on electronic payment behaviors
CN111198976B (en) Cloud asset association analysis system, method, electronic equipment and medium
CN109086317B (en) Risk control method and related device
CN110689084B (en) Abnormal user identification method and device
CN105989106A (en) Recommendation method and device based on interest similarity
CN107612922A (en) User ID authentication method and device based on user operation habits and geographical position
CN113572752A (en) Abnormal flow detection method and device, electronic equipment and storage medium
CN112785315B (en) Batch registration identification method and device
CN108684044B (en) User behavior detection system, method and device
CN113327146A (en) Information tracking method and device
CN111242723B (en) User child and child condition judgment method, server and computer readable storage medium
CN111767481B (en) Access processing method, device, equipment and storage medium
CN110599278B (en) Method, apparatus, and computer storage medium for aggregating device identifiers
CN106933905B (en) Method and device for monitoring webpage access data
CN110943989B (en) Equipment identification method and device, electronic equipment and readable storage medium
CN109325348B (en) Application security analysis method and device, computing equipment and computer storage medium
CN114520830A (en) Message pushing method, device and system
CN106779899A (en) The recognition methods of malice order and device
CN112560961B (en) Target identification method and device based on graph clustering, electronic equipment and storage medium
CN108268545B (en) Method and device for establishing hierarchical user label library
CN111127089B (en) Bill data processing method and device and electronic equipment
CN108055661B (en) Telephone number blacklist establishing method and device based on communication network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant