Nothing Special   »   [go: up one dir, main page]

CN117177177B - Method and system for demographics of small-area occupancy based on signaling data - Google Patents

Method and system for demographics of small-area occupancy based on signaling data Download PDF

Info

Publication number
CN117177177B
CN117177177B CN202311452812.5A CN202311452812A CN117177177B CN 117177177 B CN117177177 B CN 117177177B CN 202311452812 A CN202311452812 A CN 202311452812A CN 117177177 B CN117177177 B CN 117177177B
Authority
CN
China
Prior art keywords
user
preset
boundary
base station
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202311452812.5A
Other languages
Chinese (zh)
Other versions
CN117177177A (en
Inventor
张广志
成立立
于笑博
李铭哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beiling Rongxin Datalnfo Science and Technology Ltd
Original Assignee
Beiling Rongxin Datalnfo Science and Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beiling Rongxin Datalnfo Science and Technology Ltd filed Critical Beiling Rongxin Datalnfo Science and Technology Ltd
Priority to CN202311452812.5A priority Critical patent/CN117177177B/en
Publication of CN117177177A publication Critical patent/CN117177177A/en
Application granted granted Critical
Publication of CN117177177B publication Critical patent/CN117177177B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Mobile Radio Communication Systems (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a small-area occupancy demographics method and a system based on signaling data, wherein the method comprises the steps of acquiring occupancy list information; based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group; acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations; obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations; judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information. The invention bypasses the existing method that the actual position of the user is replaced by the position of the base station, simulates the longitude and latitude coordinate point of the actual position of the user according to the jump condition of the base station during the static period of the user, and is convenient for the modification of the statistical caliber, and the like.

Description

Method and system for demographics of small-area occupancy based on signaling data
Technical Field
The invention relates to the technical field of data processing, in particular to a small-area occupancy demographics method and system based on signaling data.
Background
When the prior art performs occupancy population statistics, the occupancy population and the attribution of the workplace can be judged only according to the base station or the position of the base station, and the method cannot perform attribution statistics on small area ranges such as buildings, cells and the like.
Accordingly, there is a need for improvement in the art.
Disclosure of Invention
In view of the above problems, an object of the present invention is to provide a method and a system for small-area occupancy demographics based on signaling data, which can more conveniently and accurately count occupancy demographics in a small area, and separate occupancy positions of the occupancy demographics from base station positions, so as to facilitate subsequent statistics.
The first aspect of the present invention provides a method for small-area occupancy demographics based on signaling data, comprising:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
In this scheme, the step of acquiring the staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
In this scheme, the step of obtaining behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically includes:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
In this scheme, the formula for actually counting the position of the corresponding position of the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>
In this scheme, the step of acquiring the weight value of the corresponding base station specifically includes:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then-> ,/>
In this scheme, if not, after deleting the corresponding user information, specifically including:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
In this scheme, if yes, the probability that the marking user actually counts the position of the affiliated job to be within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
The second aspect of the present invention provides a signaling data-based small-area occupancy demographics system, comprising a memory and a processor, wherein the memory stores a signaling data-based small-area occupancy demographics method program, and the processor executes the signaling data-based small-area occupancy demographics method program to implement the following steps:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
In this scheme, the step of acquiring the staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
In this scheme, the step of obtaining behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically includes:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
In this scheme, the formula for actually counting the position of the corresponding position of the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>
In this scheme, the step of acquiring the weight value of the corresponding base station specifically includes:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then-> ,/>
In this scheme, if not, after deleting the corresponding user information, specifically including:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
In this scheme, if yes, the probability that the marking user actually counts the position of the affiliated job to be within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
The invention discloses a small-area occupancy demographics method and a system based on signaling data, which simulate longitude and latitude coordinate points of the actual position of a user according to the jump condition of a base station during the rest period of the user by bypassing the existing method and system for replacing the actual position of the user by the position of the base station, thereby flexibly judging the small area of the user, such as buildings, cells and the like, and facilitating the modification of the statistical caliber and the like.
Drawings
FIG. 1 illustrates a flow chart of a small area occupancy demographic method based on signaling data in accordance with the present invention;
fig. 2 shows a block diagram of a small area occupancy demographic system based on signaling data in accordance with the present invention.
Detailed Description
In order that the above-recited objects, features and advantages of the present invention will be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings and appended detailed description. It should be noted that, in the case of no conflict, the embodiments of the present application and the features in the embodiments may be combined with each other.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those described herein, and therefore the scope of the present invention is not limited to the specific embodiments disclosed below.
Fig. 1 shows a flow chart of a small area occupancy demographics method based on signaling data of the present invention.
As shown in fig. 1, the small-area occupancy demographics method based on signaling data of the present invention includes:
s101, acquiring staff information;
s102, based on a preset base station, extracting users belonging to the preset base station in a living or working area in a staff table as a preselected user group;
s103, acquiring three pieces of base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of the corresponding three base stations;
s104, obtaining the actual statistics of the position of the job of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
s105, judging whether the user actually counts the position of the corresponding job in the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
According to the embodiment of the invention, the staff table stores user information and information that the user living or working place belongs to the base station, the preset base station is a boundary of a preset area and a base station in a buffer area range, the buffer area ranges set in different scenes are different, wherein the buffer area ranges set in areas with more population and denser base stations are smaller, the buffer area ranges set in areas with more remote positions and less base stations are larger, for example, the boundary of the preset area is outwards expanded by 20 meters to be set as the buffer area range corresponding to the preset area, and the buffer area ranges are set by a person skilled in the art according to actual requirements.
According to an embodiment of the present invention, the step of obtaining staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
It should be noted that the behavior track information of the corresponding user includes the duration of residence of the corresponding user on the behavior track and the base station to which the corresponding user belongs, the duration of residence of the user on the behavior track is arranged in order from small to large, the base station with the longest residence time and on the behavior track is extracted, the base station with the longest residence time of the user is set as the home base station, and the preset table stores the user information.
According to the embodiment of the invention, the step of obtaining the behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically comprises the following steps:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
It should be noted that, the behavior track of the user is formed by drawing a connection line of the base station according to the time sequence of the user, and the behavior track information of the user includes the information of the base station of the user and the time length of the corresponding user in the base station of the user.
According to the embodiment of the invention, the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>
It should be noted that, according to the information of three base stations with longest residence time of the user in the pre-selected user group in the preset time period, the corresponding user actual statistics of the position of the corresponding user is obtained through calculation; weighting the base station position where the user resides according to the weight value of the base station, whereinRepresenting the longitude and latitude of the base station with the number 1; />Representing the longitude and latitude of the base station with the number of 2; />Representing the longitude and latitude of the base station with the number of 2; wherein->Representing longitude, & gt of base station>Representing the latitude of the base station; wherein->A weight value representing number 1; />A weight value representing the base station of number 2; />The weight value of the base station of number 3 is indicated.
According to the embodiment of the invention, the step of acquiring the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->,/>
The longer the user resides in the base station location, the higher the base station location weight value.
According to an embodiment of the present invention, if not, deleting the corresponding user information includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
It should be noted that, in order to prevent deviation of some user signaling data, after user information in the staff table is deleted, the deleted user is marked and set as a marked user, if the preset first time range is set to 15 days, the staff position of the marked user in the actual statistics of 15 days is recorded, according to whether the staff position of the marked user in the actual statistics of the preset first time range is judged in the boundary of the preset area, the probability that the staff position of the marked user in the actual statistics is in the boundary of the preset area is determined, if the corresponding probability is greater than the preset probability threshold, the staff population of the corresponding marked user as the preset area is indicated, and the preset distance threshold and the preset probability threshold are set by a person skilled in the art.
According to the embodiment of the present invention, if yes, the probability that the marking user actually counts the position of the job to which the marking user belongs is within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
It should be noted that, the probability that the marking user actually counts the position of the home position within the boundary of the preset area is equal to the number of times that the marking user actually counts the position of the home position within the boundary of the preset area divided by the total number of times.
According to an embodiment of the present invention, further comprising:
acquiring an area value in a preset area and information of the number of the living population;
dividing the number of the occupancy population by the area value in the preset area to obtain population density of the corresponding preset area;
judging whether the population density of the preset area is larger than a preset population density threshold value, if so, triggering population prompt information;
and sending the population prompt information to a preset management end for display.
It should be noted that, when the population density in the preset area is greater than the preset population density threshold, the population in the corresponding preset area is too compact, so that the population prompt information is triggered, and the preset proportion threshold is set by a person skilled in the art.
According to an embodiment of the present invention, further comprising:
extracting the residence time values of the users in the preselected user group in the corresponding three base stations;
obtaining a maximum time value according to the time values of the users in the preselected user group in the corresponding three base stations;
judging whether the maximum time value is larger than a preset time threshold value, if so, the users in the corresponding preselected user group are reasonable users; if not, deleting the users in the corresponding preselected user group.
It should be noted that, the time value of the user in the pre-selected user group corresponding to the three base stations is the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations, for example, the preset time threshold is 8 hours, when the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations exceeds 8 hours, the user in the pre-selected user group is a reasonable user, otherwise, the user in the pre-selected user group is deleted.
Fig. 2 shows a block diagram of a small area occupancy demographic system based on signaling data in accordance with the present invention.
As shown in fig. 2, a second aspect of the present invention provides a signaling data based small area occupancy demographics system 2, comprising a memory 21 and a processor 22, wherein the memory stores a signaling data based small area occupancy demographics method program, which when executed by the processor, implements the steps of:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information.
According to the embodiment of the invention, the staff table stores user information and information that the user living or working place belongs to the base station, the preset base station is a boundary of a preset area and a base station in a buffer area range, the buffer area ranges set in different scenes are different, wherein the buffer area ranges set in areas with more population and denser base stations are smaller, the buffer area ranges set in areas with more remote positions and less base stations are larger, for example, the boundary of the preset area is outwards expanded by 20 meters to be set as the buffer area range corresponding to the preset area, and the buffer area ranges are set by a person skilled in the art according to actual requirements.
According to an embodiment of the present invention, the step of obtaining staff information specifically includes:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
It should be noted that the behavior track information of the corresponding user includes the duration of residence of the corresponding user on the behavior track and the base station to which the corresponding user belongs, the duration of residence of the user on the behavior track is arranged in order from small to large, the base station with the longest residence time and on the behavior track is extracted, the base station with the longest residence time of the user is set as the home base station, and the preset table stores the user information.
According to the embodiment of the invention, the step of obtaining the behavior track information of the corresponding user according to the mobile phone signaling data information of the user specifically comprises the following steps:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
It should be noted that, the behavior track of the user is formed by drawing a connection line of the base station according to the time sequence of the user, and the behavior track information of the user includes the information of the base station of the user and the time length of the corresponding user in the base station of the user.
According to the embodiment of the invention, the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, specifically:
setting the position of the position to which the user actually statistics belongsThe formula is
Wherein the method comprises the steps ofRespectively representing longitude and latitude of corresponding three base stations, < ->Respectively represent weight values corresponding to three base stations, and +.>
It should be noted that, according to the information of three base stations with longest residence time of the user in the pre-selected user group in the preset time period, the corresponding user actual statistics of the position of the corresponding user is obtained through calculation; weighting the base station position where the user resides according to the weight value of the base station, whereinRepresenting the longitude and latitude of the base station with the number 1; />Representing the longitude and latitude of the base station with the number of 2; />Representing the longitude and latitude of the base station with the number of 2; wherein->Representing longitude, & gt of base station>Representing the latitude of the base station; wherein->A weight value representing number 1; />A weight value representing the base station of number 2; />The weight value of the base station of number 3 is indicated.
According to the embodiment of the invention, the step of acquiring the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->,/>
The longer the user resides in the base station location, the higher the base station location weight value.
According to an embodiment of the present invention, if not, deleting the corresponding user information includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
It should be noted that, in order to prevent deviation of some user signaling data, after user information in the staff table is deleted, the deleted user is marked and set as a marked user, if the preset first time range is set to 15 days, the staff position of the marked user in the actual statistics of 15 days is recorded, according to whether the staff position of the marked user in the actual statistics of the preset first time range is judged in the boundary of the preset area, the probability that the staff position of the marked user in the actual statistics is in the boundary of the preset area is determined, if the corresponding probability is greater than the preset probability threshold, the staff population of the corresponding marked user as the preset area is indicated, and the preset distance threshold and the preset probability threshold are set by a person skilled in the art.
According to the embodiment of the present invention, if yes, the probability that the marking user actually counts the position of the job to which the marking user belongs is within the boundary of the preset area is obtained, which specifically includes:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
It should be noted that, the probability that the marking user actually counts the position of the home position within the boundary of the preset area is equal to the number of times that the marking user actually counts the position of the home position within the boundary of the preset area divided by the total number of times.
According to an embodiment of the present invention, further comprising:
acquiring an area value in a preset area and information of the number of the living population;
dividing the number of the occupancy population by the area value in the preset area to obtain population density of the corresponding preset area;
judging whether the population density of the preset area is larger than a preset population density threshold value, if so, triggering population prompt information;
and sending the population prompt information to a preset management end for display.
It should be noted that, when the population density in the preset area is greater than the preset population density threshold, the population in the corresponding preset area is too compact, so that the population prompt information is triggered, and the preset proportion threshold is set by a person skilled in the art.
According to an embodiment of the present invention, further comprising:
extracting the residence time values of the users in the preselected user group in the corresponding three base stations;
obtaining a maximum time value according to the time values of the users in the preselected user group in the corresponding three base stations;
judging whether the maximum time value is larger than a preset time threshold value, if so, the users in the corresponding preselected user group are reasonable users; if not, deleting the users in the corresponding preselected user group.
It should be noted that, the time value of the user in the pre-selected user group corresponding to the three base stations is the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations, for example, the preset time threshold is 8 hours, when the sum of the residence times of the user in the pre-selected user group corresponding to the three base stations exceeds 8 hours, the user in the pre-selected user group is a reasonable user, otherwise, the user in the pre-selected user group is deleted.
The invention discloses a small-area occupancy demographics method and a system based on signaling data, wherein the method comprises the steps of acquiring occupancy list information; based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group; acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations; obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations; judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information. The invention simulates the longitude and latitude coordinate point of the actual position of the user according to the jump condition of the base station during the static period of the user by bypassing the existing base station position to replace the actual position of the user, thereby flexibly judging the small area of the user, such as buildings, cells and the like, and being convenient for the modification of the statistical caliber and the like.
In the several embodiments provided in this application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above described device embodiments are only illustrative, e.g. the division of the units is only one logical function division, and there may be other divisions in practice, such as: multiple units or components may be combined or may be integrated into another system, or some features may be omitted, or not performed. In addition, the various components shown or discussed may be coupled or directly coupled or communicatively coupled to each other via some interface, whether indirectly coupled or communicatively coupled to devices or units, whether electrically, mechanically, or otherwise.
The units described above as separate components may or may not be physically separate, and components shown as units may or may not be physical units; can be located in one place or distributed to a plurality of network units; some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in each embodiment of the present invention may be integrated in one processing unit, or each unit may be separately used as one unit, or two or more units may be integrated in one unit; the integrated units may be implemented in hardware or in hardware plus software functional units.
Those of ordinary skill in the art will appreciate that: all or part of the steps for implementing the above method embodiments may be implemented by hardware related to program instructions, and the foregoing program may be stored in a computer readable storage medium, where the program, when executed, performs steps including the above method embodiments; and the aforementioned storage medium includes: a mobile storage device, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk or an optical disk, or the like, which can store program codes.
Alternatively, the above-described integrated units of the present invention may be stored in a computer-readable storage medium if implemented in the form of software functional modules and sold or used as separate products. Based on such understanding, the technical solutions of the embodiments of the present invention may be embodied in essence or a part contributing to the prior art in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the methods described in the embodiments of the present invention. And the aforementioned storage medium includes: a removable storage device, ROM, RAM, magnetic or optical disk, or other medium capable of storing program code.

Claims (7)

1. A method of small area occupancy demographics based on signaling data, comprising:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information;
the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, and specifically comprises the following steps:
setting the position of the user to which the actual statistics belongsThe formula is
Wherein the method comprises the steps ofRespectively representing the longitude and latitude of the corresponding three base stations,respectively represent weight values corresponding to three base stations, and +.>
The step of obtaining the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->,/>
If not, deleting the corresponding user information, which specifically includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
2. A method of small area occupancy demographics based on signaling data as claimed in claim 1, wherein said step of obtaining occupancy table information comprises:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
3. The method for small-area occupancy demographics based on signaling data according to claim 2, wherein the step of obtaining the behavior trace information of the corresponding user according to the signaling data information of the mobile phone of the user specifically comprises:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
4. The method of claim 1, wherein if so, obtaining a probability that the marking user actually counts the occupancy location within the boundary of the preset area comprises:
when the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, recording that the actual statistics of the position of the mark user is once within the boundary of the preset area;
when the actual statistical position of the mark user in the preset first time range is not within the boundary of the preset area, recording that the actual statistical position of the mark user is not within the boundary of the preset area once;
acquiring the number of times that the marking user actually counts the position of the job to be in the boundary of the preset area and the number of times that the marking user is not in the boundary of the preset area;
the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area and the number of times that the marking user is not within the boundary of the preset area are accumulated, and the total number of times is obtained;
dividing the number of times that the marking user actually counts the position of the belonged position within the boundary of the preset area by the total number of times to obtain the probability that the marking user actually counts the position of the belonged position within the boundary of the preset area.
5. A signaling data based small area occupancy demographics system comprising a memory and a processor, wherein the memory stores a signaling data based small area occupancy demographics method program, which when executed by the processor performs the steps of:
acquiring staff information;
based on a preset base station, extracting users belonging to the preset base station in residence or workplace in a staff table as a preselected user group;
acquiring three base station information with highest residence time of users in a preselected user group in a preset time period, and extracting longitude and latitude information of corresponding three base stations;
obtaining the actual statistics of the position of the user according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations;
judging whether the user actually counts the position of the job to which the user belongs to be within the boundary of the preset area, if so, the corresponding user is the job population of the preset area; if not, deleting the corresponding user information;
the formula for actually counting the position of the corresponding job by the user is obtained according to the longitude and latitude of the corresponding three base stations and the weight values of the corresponding base stations, and specifically comprises the following steps:
setting the position of the user to which the actual statistics belongsThe formula is
Wherein the method comprises the steps ofRespectively representing the longitude and latitude of the corresponding three base stations,respectively represent weight values corresponding to three base stations, and +.>
The step of obtaining the weight value of the corresponding base station specifically comprises the following steps:
according to the information of three base stations with highest residence time of users in the preselected user group in a preset time period, obtaining residence time values of the users in the corresponding three base stations, and setting the residence time values as respectivelyThe method comprises the steps of carrying out a first treatment on the surface of the Then->,/>
If not, deleting the corresponding user information, which specifically includes:
extracting the deleted user, and setting the user as a marked user;
acquiring the longitude and latitude of the position of the mark user to which the mark user actually belongs;
according to the boundary of the preset area and the longitude and latitude of the position of the job to which the marking user actually belongs, obtaining the shortest distance value from the marking user to the boundary of the preset area;
judging whether the shortest distance value from the marking user to the boundary of the preset area is smaller than a preset distance threshold value, if so, acquiring the position of the job to which the actual statistics of the marking user in a preset first time range belongs;
judging whether the actual statistics of the position of the mark user in the preset first time range is within the boundary of the preset area, if so, obtaining the probability that the actual statistics of the position of the mark user is within the boundary of the preset area;
judging whether the probability that the marking user actually counts the position of the corresponding job to be in the boundary of the preset area is larger than a preset probability threshold value, if so, setting the corresponding marking user as the job population of the preset area; if not, the corresponding marked user is not the occupancy population of the preset area.
6. A small area occupancy demographic system based on signaling data in accordance with claim 5 wherein said step of obtaining occupancy table information comprises:
acquiring mobile phone signaling data information of a user;
obtaining behavior track information of a corresponding user according to the mobile phone signaling data information of the user;
obtaining a base station with the longest residence time of the corresponding user according to the obtained behavior track information of the corresponding user;
and setting the base station with the longest residence time of the corresponding user as the home base station, and sending the home base station to a preset table for storage to obtain the job-holding table.
7. The small area occupancy demographics system based on signaling data of claim 6, wherein the step of obtaining behavior trace information of the corresponding user according to the signaling data information of the mobile phone of the user specifically comprises:
obtaining a base station which the corresponding user passes through and time point information of the base station according to the mobile phone signaling data information of the user;
and connecting the positions of the users according to the time sequence of the users passing through the base stations by taking the base stations corresponding to the users as the positions of the users, so as to obtain the behavior track information of the users.
CN202311452812.5A 2023-11-03 2023-11-03 Method and system for demographics of small-area occupancy based on signaling data Active CN117177177B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311452812.5A CN117177177B (en) 2023-11-03 2023-11-03 Method and system for demographics of small-area occupancy based on signaling data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311452812.5A CN117177177B (en) 2023-11-03 2023-11-03 Method and system for demographics of small-area occupancy based on signaling data

Publications (2)

Publication Number Publication Date
CN117177177A CN117177177A (en) 2023-12-05
CN117177177B true CN117177177B (en) 2024-02-27

Family

ID=88943587

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311452812.5A Active CN117177177B (en) 2023-11-03 2023-11-03 Method and system for demographics of small-area occupancy based on signaling data

Country Status (1)

Country Link
CN (1) CN117177177B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011021606A1 (en) * 2009-08-17 2011-02-24 株式会社エヌ・ティ・ティ・ドコモ Population fluidity information generation system and population fluidity information generation method
KR20140056461A (en) * 2012-10-25 2014-05-12 에스케이텔레콤 주식회사 Supporting method for forecasting population density and apparatus supporting the same
CN111615054A (en) * 2020-05-25 2020-09-01 和智信(山东)大数据科技有限公司 Population analysis method and device
CN115665677A (en) * 2022-10-14 2023-01-31 深圳市规划国土发展研究中心 Method and system for acquiring regional population based on mobile phone signaling data
CN115866547A (en) * 2023-03-01 2023-03-28 北京融信数联科技有限公司 Fixed area tourist counting method, system and storage medium based on signaling data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011021606A1 (en) * 2009-08-17 2011-02-24 株式会社エヌ・ティ・ティ・ドコモ Population fluidity information generation system and population fluidity information generation method
KR20140056461A (en) * 2012-10-25 2014-05-12 에스케이텔레콤 주식회사 Supporting method for forecasting population density and apparatus supporting the same
CN111615054A (en) * 2020-05-25 2020-09-01 和智信(山东)大数据科技有限公司 Population analysis method and device
CN115665677A (en) * 2022-10-14 2023-01-31 深圳市规划国土发展研究中心 Method and system for acquiring regional population based on mobile phone signaling data
CN115866547A (en) * 2023-03-01 2023-03-28 北京融信数联科技有限公司 Fixed area tourist counting method, system and storage medium based on signaling data

Also Published As

Publication number Publication date
CN117177177A (en) 2023-12-05

Similar Documents

Publication Publication Date Title
CN110401779A (en) A kind of method, apparatus and computer readable storage medium identifying telephone number
CN111524609B (en) Method and system for generating screening model and screening infectious disease high-risk infected people
CN111626754B (en) Card-keeping user identification method and device
CN104702804A (en) Method and device for marking number
CN112954626A (en) Mobile phone signaling data analysis method and device, electronic equipment and storage medium
CN111148018A (en) Method and device for identifying and positioning regional value based on communication data
CN110909263B (en) Method and device for determining companion relationship of identity characteristics
CN108985048A (en) Simulator recognition methods and relevant apparatus
CN117177177B (en) Method and system for demographics of small-area occupancy based on signaling data
CN114024737B (en) Method, apparatus and computer readable storage medium for determining live room volume
CN114662772A (en) Traffic noise early warning method, model training method, device, equipment and medium
CN108600961A (en) Preparation method and device, equipment, the storage medium of user&#39;s similarity
CN111669710B (en) Demographic deduplication method
CN111538652B (en) Application control testing method and related equipment
CN109413459B (en) User recommendation method and related equipment in live broadcast platform
CN115967906A (en) User resident position identification method, terminal, electronic device and storage medium
CN116561508B (en) Outlier detection method, system and medium for population data based on big data
CN115150749B (en) High-risk roaming user positioning method, equipment, device and storage medium
CN117473428A (en) Service promotion method, device, equipment and storage medium
CN116957520A (en) Big data-based loss of business rate monitoring method, system and storage medium
CN114169458B (en) Fraudster identification method and device, storage medium and computer equipment
CN110933605B (en) Excavation method and device for moving target
CN113742571A (en) Message pushing method and device based on big data and storage medium
CN111143333B (en) Labeling data processing method, device, equipment and computer readable storage medium
CN107770129A (en) Method and apparatus for detecting user behavior

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant