CN102945290A - Hot microblog topic digging device and method - Google Patents
Hot microblog topic digging device and method Download PDFInfo
- Publication number
- CN102945290A CN102945290A CN2012105078624A CN201210507862A CN102945290A CN 102945290 A CN102945290 A CN 102945290A CN 2012105078624 A CN2012105078624 A CN 2012105078624A CN 201210507862 A CN201210507862 A CN 201210507862A CN 102945290 A CN102945290 A CN 102945290A
- Authority
- CN
- China
- Prior art keywords
- microblogging
- keyword sets
- popular keyword
- classification
- topic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a hot microblog topic digging device and method. The device comprises an acquiring module, an extracting module, a computing module and an ordering module, wherein the acquiring module is applicable to acquiring microblog information through an open interface, wherein the microblog information includes microblog contents and microblog parameters; the extracting module is applicable to carrying out words segmentation on the acquired microblog contents, and extracting hot key words; the computing module is applicable to counting the number of microblogs relating to the hot key words, and carrying out weighting computation according to the number of the microblogs and the microblog parameters of the corresponding microblogs, so as to acquire hot values of the hot key words; and the ordering module is applicable to ordering the hot values of the hot key words, and acquiring a hot microblog topic rank. By utilizing the technical scheme disclosed by the invention, hot topics of the microblogs can be accurately judged, so that the objective fact of the Internet public opinion can be relatively reflected through a digging result.
Description
Technical field
The present invention relates to field of Internet communication, particularly relate to a kind of microblogging much-talked-about topic excavating gear and method.
Background technology
In the prior art, development along with the internet, microblogging becomes the important channel of people's obtaining information, exchange of information, a large amount of netizens deliver the suggestion of oneself and disclose all kinds of news in microblogging, there is every day thousands of topic to produce from microblogging, how from the microblogging magnanimity information, obtains faster netizen's focus and will dynamically play the directiveness effect to understanding social development situation, grasp public opinion.
The microblogging focus method for digging that generally adopts at present is by the microblogging quantity under the microblog topic in the special time period being compared, obtain the hottest microblog topic by the quantity ordering, and the microblogging quantity unencryped word topic of more speaking more is more active.But there is following problem in technique scheme: because technique scheme is only added up the microblogging quantity of single topic, the topic of therefore easily waterborne troops's violence being issued is mistaken for much-talked-about topic; And, technique scheme is not thought of as microblogging and transmits number and the several factors to microblog topic of microblogging comment, cause the ardent microblog topic of some comment to be left in the basket, in addition, technique scheme is not considered the microblogging authenticated factor of (that is, adding V user) yet, and authenticated participates in hot issue of more events, to sum up, technique scheme of the prior art can not comprehensive and accurately be excavated the microblogging much-talked-about topic.
Summary of the invention
In view of the above problems, the present invention has been proposed in order to a kind of microblogging much-talked-about topic excavating gear and method that overcomes the problems referred to above or address the above problem at least in part is provided.
The invention provides a kind of microblogging much-talked-about topic excavating gear, comprising: acquisition module, be suitable for gathering micro-blog information by open interface, wherein, micro-blog information comprises: microblogging content and microblogging parameter; Abstraction module is suitable for the microblogging content that gathers is carried out participle, and extracts popular keyword sets; Computing module is suitable for the microblogging quantity that relates to popular keyword sets is added up, and is weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, obtains the temperature value of popular keyword sets; Order module is suitable for the temperature value of popular keyword sets is sorted, and obtains microblogging much-talked-about topic seniority among brothers and sisters.
Alternatively, acquisition module is further adapted for: the micro-blog information that gathers this door microblogging by the open interface of a door microblogging appointment.
Alternatively, said apparatus also comprises: sort module, be suitable for adopting the method for automatic cluster that microblogging is classified according to the microblogging content that gathers, and obtain different microblogging classifications.
Alternatively, above-mentioned abstraction module is further adapted for: the microblogging content under each the microblogging classification that gathers is carried out respectively participle, and extract respectively the popular keyword sets under each microblogging classification.
Alternatively, above-mentioned abstraction module is further adapted for: extract one or more centre words the microblogging content under each the microblogging classification that gathers; The centre word that extracts from same microblogging content is sorted, and the centre word after will sorting makes up, obtain the center phrase; Add up the related microblogging quantity of each center phrase under each microblogging classification, and from the phrase of center, extract popular keyword sets under each microblogging classification according to microblogging quantity.
Alternatively, above-mentioned abstraction module further comprises: filter submodule, be suitable for filtering the rubbish phrase according to rubbish phrase database from the phrase of center.
Alternatively, above-mentioned computing module is further adapted for: the microblogging quantity that relates to popular keyword sets under the same microblogging classification is added up, and be weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, obtain the temperature value of popular keyword sets under each microblogging classification.
Alternatively, above-mentioned microblogging parameter further comprises following one or more combination: microblogging is always transmitted number, microblogging general comment number, the microblogging authenticated transmits number and the microblogging authenticated is commented on number.
Alternatively, above-mentioned computing module is further adapted for: the temperature value of obtaining respectively popular keyword sets under each microblogging classification according to following formula: the microblogging quantity * microblogging quantity weight coefficient+microblogging of the popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.
Alternatively, above-mentioned order module is further adapted for: the temperature value to popular keyword sets under each microblogging classification is carried out descending sort, and the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.
Alternatively, said apparatus also comprises: acquisition module is suitable for obtaining the related microblogging content of each microblogging much-talked-about topic in the microblogging much-talked-about topic seniority among brothers and sisters; Display module is suitable for showing the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
The present invention also provides a kind of microblogging much-talked-about topic method for digging, comprising: gather micro-blog information by open interface, wherein, micro-blog information comprises: microblogging content and microblogging parameter; The microblogging content that gathers is carried out participle, and extract popular keyword sets; The microblogging quantity that relates to popular keyword sets is added up, and be weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, obtain the temperature value of popular keyword sets; Temperature value to popular keyword sets sorts, and obtains microblogging much-talked-about topic seniority among brothers and sisters.
Alternatively, above-mentioned collection micro-blog information further comprises: the micro-blog information that gathers this door microblogging by the open interface of a door microblogging appointment.
Alternatively, gather after the micro-blog information, said method also comprises: adopt the method for automatic cluster that microblogging is classified according to the microblogging content that gathers, obtain different microblogging classifications.
Alternatively, above-mentioned the microblogging content that gathers is carried out participle, and extract popular keyword sets and further comprise: the microblogging content under each the microblogging classification that gathers is carried out respectively participle, and extract respectively the popular keyword sets under each microblogging classification.
Alternatively, above-mentioned microblogging content under each the microblogging classification that gathers is carried out respectively participle, and the popular keyword sets that extracts respectively under each microblogging classification comprises further: extract one or more centre words the microblogging content under each the microblogging classification that gathers; The centre word that extracts from same microblogging content is sorted, and the centre word after will sorting makes up, obtain the center phrase; Add up the related microblogging quantity of each center phrase under each microblogging classification, and from the phrase of center, extract popular keyword sets under each microblogging classification according to microblogging quantity.
Alternatively, the centre word after the ordering is made up, obtain after the phrase of center, said method also comprises: filter the rubbish phrase according to rubbish phrase database from the phrase of center.
Alternatively, above-mentioned the microblogging quantity that relates to popular keyword sets is added up, and be weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, the temperature value of obtaining popular keyword sets further comprises: the microblogging quantity that relates to popular keyword sets under the same microblogging classification is added up, and be weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, obtain the temperature value of popular keyword sets under each microblogging classification.
Alternatively, above-mentioned microblogging parameter further comprises following one or more combination: microblogging is always transmitted number, microblogging general comment number, the microblogging authenticated transmits number and the microblogging authenticated is commented on number.
Alternatively, above-mentioned according to microblogging quantity, and the microblogging parameter of corresponding microblogging is weighted calculating, and the temperature value of obtaining popular keyword sets under each microblogging classification further comprises: the temperature value of obtaining respectively popular keyword sets under each microblogging classification according to following formula: the microblogging quantity * microblogging quantity weight coefficient+microblogging of the popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.
Alternatively, above-mentioned temperature value to popular keyword sets sorts, obtaining microblogging much-talked-about topic seniority among brothers and sisters further comprises: the temperature value to popular keyword sets under each microblogging classification is carried out descending sort, and the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.
Alternatively, obtain after the microblogging much-talked-about topic seniority among brothers and sisters, said method also comprises: obtain the related microblogging content of each microblogging much-talked-about topic in the microblogging much-talked-about topic seniority among brothers and sisters; Show the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
Beneficial effect of the present invention is as follows:
Calculate by carrying out hot word according to the microblogging content that gathers, and according to the microblogging parameter of obtaining the hot word that calculates is carried out temperature calculating, thereby can judge exactly the hot issue of microblogging, make Result more can reflect the objective fact of internet public opinion.
Above-mentioned explanation only is the general introduction of technical solution of the present invention, for can clearer understanding technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Description of drawings
By reading hereinafter detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skills.Accompanying drawing only is used for the purpose of preferred implementation is shown, and does not think limitation of the present invention.And in whole accompanying drawing, represent identical parts with identical reference symbol.In the accompanying drawings:
Fig. 1 is the structural representation of the microblogging much-talked-about topic excavating gear of one embodiment of the invention;
Fig. 2 is the synoptic diagram for the treatment of scheme of the abstraction module of one embodiment of the invention;
Fig. 3 is the microblogging parameter of one embodiment of the invention and the synoptic diagram of weight coefficient corresponding relation;
Fig. 4 is the process flow diagram of the microblogging much-talked-about topic method for digging of one embodiment of the invention.
Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in the accompanying drawing, yet should be appreciated that and to realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order to understand the disclosure more thoroughly that these embodiment are provided, and can with the scope of the present disclosure complete convey to those skilled in the art.
In order to excavate fast the much-talked-about topic that occurs in the recent period on the microblogging, the difficult problem of microblogging focus is excavated in solution from magnanimity microblogging data, the invention provides a kind of microblogging much-talked-about topic excavating gear and method, the embodiment of the invention utilizes Technologies of Automated Text Classification, hot word computing technique and temperature computing technique to carry out the excavation of microblogging much-talked-about topic.Wherein, text automatic classification refers to: utilize the principle of machine learning to rely on the model parameter behind the small-sample learning that text set (or other entities or object) is carried out the automatic classification mark according to certain taxonomic hierarchies or standard; Hot word computing technique refers to: automatically the web page text of Real-time Collection carried out participle, grouping merger, calculate high frequency focus keyword, and filter according to predefined dictionary and preset rules, export real-time internet hot spots vocabulary.The temperature computing technique refers to: automatically to the forwarding number of microblogging, comment on number, add the parameter such as V participation number and carry out statistical computation, and according to the predefine rule, the temperature value of output topic.
Below in conjunction with accompanying drawing and embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, does not limit the present invention.
According to embodiments of the invention, a kind of microblogging much-talked-about topic excavating gear is provided, Fig. 1 is the structural representation of the microblogging much-talked-about topic excavating gear of one embodiment of the invention, as shown in Figure 1, microblogging much-talked-about topic excavating gear according to the embodiment of the invention comprises: acquisition module 10, abstraction module 12, computing module 14 and order module 16 below are described in detail the modules of the embodiment of the invention.
Particularly, acquisition module 10 can gather by the open interface of a door microblogging appointment micro-blog information of this door microblogging.
In actual applications, different microblogging classification have different hot issues, and the topic temperature of different classification is also different, and for example, the hot issue temperature of field of finance and economics microblogging is more much lower than the hot issue temperature of amusement Eight Diagrams class microblogging.This just need to classify to microblog topic, makes the user check the microblogging focus according to different microblogging classification.
Preferably, in embodiments of the present invention, (for example reflect more targetedly a certain field for enough, military affairs, politics, the people's livelihood, society, the world, amusement etc.) the microblogging much-talked-about topic, microblogging much-talked-about topic excavating gear according to the embodiment of the invention also comprises: sort module, be suitable for adopting the method for automatic cluster that microblogging is classified according to the microblogging content that gathers, obtain different microblogging classifications.So that other modules when carrying out subsequent treatment, can be carried out respectively for dissimilar microbloggings the excavation of much-talked-about topic.
As mentioned above, the embodiment of the invention adopts the method for automatic cluster to come the microblogging classification, wherein, automatic cluster refers to: inside or the surface of being investigated object by computing machine according to quilt, according to certain requirement (for example, the restricted number of classification, the degree etc. of getting close to of homogeneous object), the process that the object of close, similar or same characteristic features is condensed together.The microblogging content is carried out automatic classification can be divided into automotive-type microblogging, amusement class microblogging, finance and economic microblogging etc.
Classification based on sort module is processed, and abstraction module 12 need to carry out respectively participle to the microblogging content under each the microblogging classification that gathers, and extracts respectively the popular keyword sets under each microblogging classification.
Particularly, abstraction module 12 need to be handled as follows: at first extract one or more centre words the microblogging content under each the microblogging classification that gathers, that is to say that a microblogging may have a plurality of centre words; Subsequently, the centre word that extracts from same microblogging content is sorted, for example, the centre word of a microblogging extraction is bca, becomes abc after the ordering; After ordering, centre word is made up, obtain the center phrase; Wherein, carrying out the centre word combination refers to: according to
The centre word that will belong to after the ordering of same microblogging content makes up, and wherein, n is the total number that belongs to the centre word of same text header, r≤n and 2≤r≤5, and for example, combinatorial formula is:
Can only keep 2-5 center phrase; At last, abstraction module 12 needs the related microblogging quantity of each center phrase under each microblogging classification of statistics, and extracts popular keyword sets under each microblogging classification according to microblogging quantity from the phrase of center.For example, when abstraction module 12 was analyzed all centers phrase in tabulate statistics, the appearance quantity that can add up by the hour the center phrase was found out popular keyword sets, and these popular keyword sets are exactly the hot issue of microblogging behind.When abstraction module 12 is analyzed all keyword sets in tabulate statistics, can form a popular keyword sets ranking list, add up each popular keyword sets behind microblogging quantity and by the descending sort of microblogging quantity.
In embodiments of the present invention, abstraction module 12 can further include: filter submodule, be suitable for filtering the rubbish phrase according to rubbish phrase database from the phrase of center.For example, remove as getting the winning number in a bond, seek advice from the rubbish phrase of class, wherein, above-mentioned rubbish phrase database is being managed background maintenance by the O﹠M personnel.
Below in conjunction with accompanying drawing, the processing of above-mentioned abstraction module 12 is illustrated.
Fig. 2 is the synoptic diagram for the treatment of scheme of the abstraction module of one embodiment of the invention, as shown in Figure 2:
Microblogging one: extract centre word b, a, c out, a, b, c after the ordering form phrase ab, bc, ac, abc;
Microblogging two: extract centre word c, b, d out, b, c, d after the ordering form phrase bc, cd, bd, bcd;
Microblogging three: extract centre word b, c out and form phrase bc;
The phrase seniority among brothers and sisters that forms of these three microbloggings is exactly so: bc(3), ab(1), ac(1), cd(1), bd(1), abc(1), bcd(1), thereby definite popular keyword sets is b+c.
Particularly, computing module 14 need to be added up the microblogging quantity that relates to popular keyword sets under the same microblogging classification, and be weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, obtain the temperature value of popular keyword sets under each microblogging classification.
That is to say, after calculating popular keyword sets by hot word, computing module 14 needs to calculate these popular keyword sets microblogging parameter behind, the forwarding number of comprehensive microblogging, comment on number, add the microblogging parameter such as V participation number and carry out statistical computation, and according to the predefine rule, the temperature value of output topic.
Particularly, comprise that in the microblogging parameter microblogging is always transmitted number, microblogging general comment number, microblogging authenticated (namely adding V user) transmits number and the microblogging authenticated is commented in the situation of number, computing module 14 obtains respectively the temperature value of popular keyword sets under each microblogging classification according to following formula:
The microblogging quantity * microblogging quantity weight coefficient+microblogging of the popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.
Below in conjunction with accompanying drawing, the processing procedure that computing module 14 is calculated the temperature value of popular keyword sets is illustrated.
Fig. 3 is the microblogging parameter of one embodiment of the invention and the synoptic diagram of weight coefficient corresponding relation, and as shown in Figure 3, the temperature value computing formula of the popular keyword sets of computing module 14 is as follows:
Microblogging quantity+the microblogging of the popular keyword sets of topic temperature=relate to is always transmitted number+microblogging general comment number * 2+ microblogging authenticated and is transmitted number * 10+ microblogging authenticated comment number * 20.
For example: Diaoyu Island anthelion parade event, the center phrase that is drawn into are " Diaoyu Island+anthelion parade ", have 10000 pieces of microbloggings behind, these microblogging revolution accumulative totals are 300000, and comment number accumulative total is 200000, and wherein adding V forwarding number is 2000, adding V comment number is 1000, then:
Diaoyu Island topic temperature=10000+300000+200000 * 2+2000 * 10+1000 * 20;
Need to prove that the topic of different classification also is same computing method, that is, the popular keyword sets microblogging parameter behind of affiliated classification is added up.
Particularly, order module 16 need to be carried out descending sort to the temperature value of popular keyword sets under each microblogging classification, and the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.
Preferably, check behind microblogging content of each hot issue, see each microblogging that this microblog topic is discussed and check the microblogging that adds V user's issue that the microblogging much-talked-about topic excavating gear of the embodiment of the invention can also comprise for the ease of the user:
Acquisition module is suitable for obtaining the related microblogging content of each microblogging much-talked-about topic in the microblogging much-talked-about topic seniority among brothers and sisters;
Display module is suitable for showing the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
In sum, technical scheme by means of the embodiment of the invention, calculate by carrying out hot word according to the microblogging content that gathers, and according to the microblogging parameter of obtaining the hot word that calculates is carried out temperature and calculate, thereby can judge exactly the hot issue of microblogging, make Result more can reflect the objective fact of internet public opinion, in addition, by automatic classification technology microblogging is classified, can reflect more targetedly the microblogging much-talked-about topic of a certain field (such as military affairs, politics, the people's livelihood, society, the world, amusement etc.).
According to embodiments of the invention, a kind of microblogging much-talked-about topic method for digging is provided, Fig. 4 is the process flow diagram of the microblogging much-talked-about topic method for digging of one embodiment of the invention, as shown in Figure 4, comprises following processing according to the microblogging much-talked-about topic method for digging of the embodiment of the invention:
Step 401 gathers micro-blog information by open interface, and wherein, described micro-blog information comprises: microblogging content and microblogging parameter; Above-mentioned microblogging parameter can comprise following one or more combination: microblogging is always transmitted number, microblogging general comment number, microblogging authenticated (namely adding V user) transmits number and the microblogging authenticated is commented on number.In actual applications, the microblogging parameter can also comprise: microblogging bloger information, microblogging issuing time information etc.
Particularly, in step 401, can gather by the open interface of a door microblogging appointment micro-blog information of this door microblogging.
In actual applications, different microblogging classification have different hot issues, and the topic temperature of different classification is also different, and for example, the hot issue temperature of field of finance and economics microblogging is more much lower than the hot issue temperature of amusement Eight Diagrams class microblogging.This just need to classify to microblog topic, makes the user check the microblogging focus according to different microblogging classification.
Preferably, in embodiments of the present invention, (for example reflect more targetedly a certain field for enough, military affairs, politics, the people's livelihood, society, the world, amusement etc.) the microblogging much-talked-about topic, gather after the micro-blog information, can adopt the method for automatic cluster that microblogging is classified according to the described microblogging content that gathers, obtain different microblogging classifications.So that when carrying out subsequent treatment, can carry out respectively for dissimilar microbloggings the excavation of much-talked-about topic.
As mentioned above, the embodiment of the invention adopts the method for automatic cluster to come the microblogging classification, wherein, automatic cluster refers to: inside or the surface of being investigated object by computing machine according to quilt, according to certain requirement (for example, the restricted number of classification, the degree etc. of getting close to of homogeneous object), the process that the object of close, similar or same characteristic features is condensed together.The microblogging content is carried out automatic classification can be divided into automotive-type microblogging, amusement class microblogging, finance and economic microblogging etc.
Step 402 is carried out participle to the described microblogging content that gathers, and is extracted popular keyword sets;
Process based on above-mentioned microblogging classification, in step 402, need to carry out respectively participle to the microblogging content under each the microblogging classification that gathers, and extract respectively the popular keyword sets under each microblogging classification.
Particularly, step 402 need to be handled as follows: at first extract one or more centre words the microblogging content under each the microblogging classification that gathers, that is to say that a microblogging may have a plurality of centre words; Subsequently, the centre word that extracts from same microblogging content is sorted, for example, the centre word of a microblogging extraction is bca, becomes abc after the ordering; After ordering, centre word is made up, obtain the center phrase; Wherein, carrying out the centre word combination refers to: according to
The centre word that will belong to after the ordering of same microblogging content makes up, and wherein, n is the total number that belongs to the centre word of same text header, r≤n and 2≤r≤5, and for example, combinatorial formula is:
Can only keep 2-5 center phrase; At last, need the related microblogging quantity of each center phrase under each microblogging classification of statistics, and from the phrase of center, extract popular keyword sets under each microblogging classification according to microblogging quantity.For example, when all centers phrase was analyzed in tabulate statistics, the appearance quantity that can add up by the hour the center phrase was found out popular keyword sets, and these popular keyword sets are exactly the hot issue of microblogging behind.In the step 402, when all keyword sets are analyzed in tabulate statistics, can form a popular keyword sets ranking list, add up each popular keyword sets behind microblogging quantity and by the descending sort of microblogging quantity.
In embodiments of the present invention, the described centre word after the ordering is made up, obtain after the phrase of center, can also from the phrase of described center, filter the rubbish phrase according to rubbish phrase database.For example, remove as getting the winning number in a bond, seek advice from the rubbish phrase of class, wherein, above-mentioned rubbish phrase database is being managed background maintenance by the O﹠M personnel.
Below in conjunction with accompanying drawing, the processing of above-mentioned steps 402 is illustrated.As shown in Figure 2:
Microblogging one: extract centre word b, a, c out, a, b, c after the ordering form phrase ab, bc, ac, abc;
Microblogging two: extract centre word c, b, d out, b, c, d after the ordering form phrase bc, cd, bd, bcd;
Microblogging three: extract centre word b, c out and form phrase bc;
The phrase seniority among brothers and sisters that forms of these three microbloggings is exactly so: bc(3), ab(1), ac(1), cd(1), bd(1), abc(1), bcd(1), thereby definite popular keyword sets is b+c.
Step 403 is added up the microblogging quantity that relates to described popular keyword sets, and is weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtains the temperature value of described popular keyword sets;
Particularly, in step 403, need to add up the microblogging quantity that relates to popular keyword sets under the same microblogging classification, and be weighted calculating according to the microblogging parameter of microblogging quantity and corresponding microblogging, obtain the temperature value of popular keyword sets under each microblogging classification.
That is to say, after calculating popular keyword sets by hot word, need to calculate these popular keyword sets microblogging parameter behind, the forwarding number of comprehensive microblogging, comment on number, add the microblogging parameter such as V participation number and carry out statistical computation, and according to the predefine rule, the temperature value of output topic.
Particularly, comprise that in the microblogging parameter microblogging is always transmitted number, microblogging general comment number, microblogging authenticated (namely adding V user) transmits number and the microblogging authenticated is commented in the situation of number, can obtain respectively according to following formula the temperature value of popular keyword sets under each microblogging classification:
The microblogging quantity * microblogging quantity weight coefficient+microblogging of the popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.
Below in conjunction with accompanying drawing, the processing procedure of calculating the temperature value of popular keyword sets in the step 403 is illustrated.
As shown in Figure 3, the temperature value computing formula of popular keyword sets is as follows:
Microblogging quantity+the microblogging of the popular keyword sets of topic temperature=relate to is always transmitted number+microblogging general comment number * 2+ microblogging authenticated and is transmitted number * 10+ microblogging authenticated comment number * 20.
For example: Diaoyu Island anthelion parade event, the center phrase that is drawn into are " Diaoyu Island+anthelion parade ", have 10000 pieces of microbloggings behind, these microblogging revolution accumulative totals are 300000, and comment number accumulative total is 200000, and wherein adding V forwarding number is 2000, adding V comment number is 1000, then:
Diaoyu Island topic temperature=10000+300000+200000 * 2+2000 * 10+1000 * 20;
Need to prove that the topic of different classification also is same computing method, that is, the popular keyword sets microblogging parameter behind of affiliated classification is added up.
Step 404 sorts to the temperature value of described popular keyword sets, obtains microblogging much-talked-about topic seniority among brothers and sisters.
Particularly, in step 404, need to carry out descending sort to the temperature value of popular keyword sets under each microblogging classification, the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.
Preferably, check behind microblogging content of each hot issue, see each microblogging that this microblog topic is discussed and check the microblogging that adds V user's issue for the ease of the user, after obtaining microblogging much-talked-about topic seniority among brothers and sisters, also comprise according to the microblogging much-talked-about topic method for digging of the embodiment of the invention:
Obtain the related microblogging content of each microblogging much-talked-about topic in the described microblogging much-talked-about topic seniority among brothers and sisters;
Show the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
In sum, technical scheme by means of the embodiment of the invention, calculate by carrying out hot word according to the microblogging content that gathers, and according to the microblogging parameter of obtaining the hot word that calculates is carried out temperature and calculate, thereby can judge exactly the hot issue of microblogging, make Result more can reflect the objective fact of internet public opinion, in addition, by automatic classification technology microblogging is classified, can reflect more targetedly the microblogging much-talked-about topic of a certain field (such as military affairs, politics, the people's livelihood, society, the world, amusement etc.).
Intrinsic not relevant with any certain computer, virtual system or miscellaneous equipment with demonstration at this algorithm that provides.Various general-purpose systems also can be with using based on the teaching at this.According to top description, it is apparent constructing the desired structure of this type systematic.In addition, the present invention is not also for any certain programmed language.Should be understood that and to utilize various programming languages to realize content of the present invention described here, and the top description that language-specific is done is in order to disclose preferred forms of the present invention.
In the instructions that provides herein, a large amount of details have been described.Yet, can understand, embodiments of the invention can be put into practice in the situation of these details not having.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the description to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes in the above.Yet the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires the more feature of feature clearly put down in writing than institute in each claim.Or rather, as following claims reflected, inventive aspect was to be less than all features of the disclosed single embodiment in front.Therefore, follow claims of embodiment and incorporate clearly thus this embodiment into, wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can adaptively change and they are arranged in one or more equipment different from this embodiment the module in the equipment among the embodiment.Can be combined into a module or unit or assembly to the module among the embodiment or unit or assembly, and can be divided into a plurality of submodules or subelement or sub-component to them in addition.In such feature and/or process or unit at least some are mutually repelling, and can adopt any combination to disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and so all processes or the unit of disclosed any method or equipment make up.Unless in addition clearly statement, disclosed each feature can be by providing identical, being equal to or the alternative features of similar purpose replaces in this instructions (comprising claim, summary and the accompanying drawing followed).
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature rather than further feature included among other embodiment, the combination of the feature of different embodiment means and is within the scope of the present invention and forms different embodiment.For example, in the following claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, perhaps realizes with the software module of moving at one or more processor, and perhaps the combination with them realizes.It will be understood by those of skill in the art that and to use in practice microprocessor or digital signal processor (DSP) to realize according to some or all some or repertoire of parts in the microblogging much-talked-about topic excavating gear of the embodiment of the invention.The present invention can also be embodied as be used to part or all equipment or the device program (for example, computer program and computer program) of carrying out method as described herein.Such realization program of the present invention can be stored on the computer-readable medium, perhaps can have the form of one or more signal.Such signal can be downloaded from internet website and obtain, and perhaps provides at carrier signal, perhaps provides with any other form.
It should be noted above-described embodiment the present invention will be described rather than limit the invention, and those skilled in the art can design alternative embodiment in the situation of the scope that does not break away from claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed in element or step in the claim.Being positioned at word " " before the element or " one " does not get rid of and has a plurality of such elements.The present invention can realize by means of the hardware that includes some different elements and by means of the computing machine of suitably programming.In having enumerated the unit claim of some devices, several in these devices can be to come imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title with these word explanations.
Herein disclosed is A1, a kind of microblogging much-talked-about topic excavating gear, it is characterized in that, comprising: acquisition module, be suitable for gathering micro-blog information by open interface, wherein, described micro-blog information comprises: microblogging content and microblogging parameter; Abstraction module is suitable for the described microblogging content that gathers is carried out participle, and extracts popular keyword sets; Computing module is suitable for the microblogging quantity that relates to described popular keyword sets is added up, and is weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtains the temperature value of described popular keyword sets; Order module is suitable for the temperature value of described popular keyword sets is sorted, and obtains microblogging much-talked-about topic seniority among brothers and sisters.A2, such as the described device of A1, it is characterized in that described acquisition module is further adapted for: the micro-blog information that gathers this door microblogging by the open interface of a door microblogging appointment.A3, such as the described device of A1, it is characterized in that described device also comprises: sort module, be suitable for adopting the method for automatic cluster that microblogging is classified according to the described microblogging content that gathers, obtain different microblogging classifications.A4, such as the described device of A3, it is characterized in that described abstraction module is further adapted for: the microblogging content under each the microblogging classification that gathers is carried out respectively participle, and extract respectively the popular keyword sets under each microblogging classification.A5, such as the described device of A4, it is characterized in that described abstraction module is further adapted for: extract one or more centre words the described microblogging content under each the microblogging classification that gathers; The described centre word that extracts from same microblogging content is sorted, and the described centre word after will sorting makes up, obtain the center phrase; Add up the related microblogging quantity of each center phrase under each microblogging classification, and from the phrase of described center, extract popular keyword sets under each microblogging classification according to described microblogging quantity.A6, such as the described device of A5, it is characterized in that described abstraction module further comprises: filter submodule, be suitable for from the phrase of described center, filtering the rubbish phrase according to rubbish phrase database.A7, such as the described device of A4, it is characterized in that, described computing module is further adapted for: the microblogging quantity that relates to described popular keyword sets under the same microblogging classification is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtain the temperature value of described popular keyword sets under each microblogging classification.A8, such as the described device of A7, it is characterized in that described microblogging parameter further comprises following one or more combination: microblogging is always transmitted number, microblogging general comment number, microblogging authenticated and is transmitted number and microblogging authenticated comment number.A9, such as the described device of A8, it is characterized in that described computing module is further adapted for: the temperature value of obtaining respectively described popular keyword sets under each microblogging classification according to following formula: the microblogging quantity * microblogging quantity weight coefficient+microblogging of the described popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.A10, such as the described device of A7, it is characterized in that, described order module is further adapted for: the temperature value to described popular keyword sets under each microblogging classification is carried out descending sort, and the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.A11, such as the described device of A1, it is characterized in that described device also comprises: acquisition module is suitable for obtaining the related microblogging content of each microblogging much-talked-about topic in the described microblogging much-talked-about topic seniority among brothers and sisters; Display module is suitable for showing the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
Herein disclosed is B12, a kind of microblogging much-talked-about topic method for digging, it is characterized in that, comprising: gather micro-blog information by open interface, wherein, described micro-blog information comprises: microblogging content and microblogging parameter; The described microblogging content that gathers is carried out participle, and extract popular keyword sets; The microblogging quantity that relates to described popular keyword sets is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtain the temperature value of described popular keyword sets; Temperature value to described popular keyword sets sorts, and obtains microblogging much-talked-about topic seniority among brothers and sisters.B13, such as the described method of B12, it is characterized in that described collection micro-blog information further comprises: the micro-blog information that gathers this door microblogging by the open interface of a door microblogging appointment.B14, such as the described method of B12, it is characterized in that after the described collection micro-blog information, described method also comprises: adopt the method for automatic cluster that microblogging is classified according to the described microblogging content that gathers, obtain different microblogging classifications.B15, such as the described method of B14, it is characterized in that, the described microblogging content that gathers is carried out participle, and extract popular keyword sets and further comprise: the microblogging content under each the microblogging classification that gathers is carried out respectively participle, and extract respectively the popular keyword sets under each microblogging classification.B16, such as the described method of B15, it is characterized in that, microblogging content under each the microblogging classification that gathers is carried out respectively participle, and the popular keyword sets that extracts respectively under each microblogging classification comprises further: extract one or more centre words the described microblogging content under each the microblogging classification that gathers; The described centre word that extracts from same microblogging content is sorted, and the described centre word after will sorting makes up, obtain the center phrase; Add up the related microblogging quantity of each center phrase under each microblogging classification, and from the phrase of described center, extract popular keyword sets under each microblogging classification according to described microblogging quantity.B17, such as the described method of B16, it is characterized in that, with the ordering after described centre word make up, obtain after the phrase of center, described method also comprises: filter the rubbish phrase according to rubbish phrase database from the phrase of described center.B18, such as the described method of B15, it is characterized in that, the microblogging quantity that relates to described popular keyword sets is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, the temperature value of obtaining described popular keyword sets further comprises: the microblogging quantity that relates to described popular keyword sets under the same microblogging classification is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtain the temperature value of described popular keyword sets under each microblogging classification.B19, such as the described method of B18, it is characterized in that described microblogging parameter further comprises following one or more combination: microblogging is always transmitted number, microblogging general comment number, microblogging authenticated and is transmitted number and microblogging authenticated comment number.B20, such as the described method of B19, it is characterized in that, according to described microblogging quantity, and the microblogging parameter of corresponding microblogging is weighted calculating, and the temperature value of obtaining described popular keyword sets under each microblogging classification further comprises: the temperature value of obtaining respectively described popular keyword sets under each microblogging classification according to following formula: the microblogging quantity * microblogging quantity weight coefficient+microblogging of the described popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.B21, such as the described method of B18, it is characterized in that, temperature value to described popular keyword sets sorts, obtaining microblogging much-talked-about topic seniority among brothers and sisters further comprises: the temperature value to described popular keyword sets under each microblogging classification is carried out descending sort, and the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.B22, such as the described method of B12, it is characterized in that obtain after the microblogging much-talked-about topic seniority among brothers and sisters, described method also comprises: obtain the related microblogging content of each microblogging much-talked-about topic in the described microblogging much-talked-about topic seniority among brothers and sisters; Show the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
Claims (20)
1. a microblogging much-talked-about topic excavating gear is characterized in that, comprising:
Acquisition module is suitable for gathering micro-blog information by open interface, and wherein, described micro-blog information comprises: microblogging content and microblogging parameter;
Abstraction module is suitable for the described microblogging content that gathers is carried out participle, and extracts popular keyword sets;
Computing module is suitable for the microblogging quantity that relates to described popular keyword sets is added up, and is weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtains the temperature value of described popular keyword sets;
Order module is suitable for the temperature value of described popular keyword sets is sorted, and obtains microblogging much-talked-about topic seniority among brothers and sisters.
2. device as claimed in claim 1 is characterized in that, described acquisition module is further adapted for: the micro-blog information that gathers this door microblogging by the open interface of a door microblogging appointment.
3. device as claimed in claim 1 is characterized in that, described device also comprises:
Sort module is suitable for adopting the method for automatic cluster that microblogging is classified according to the described microblogging content that gathers, and obtains different microblogging classifications.
4. device as claimed in claim 3 is characterized in that, described abstraction module is further adapted for:
Microblogging content under each the microblogging classification that gathers is carried out respectively participle, and extract respectively the popular keyword sets under each microblogging classification.
5. device as claimed in claim 4 is characterized in that, described abstraction module is further adapted for:
Extract one or more centre words the described microblogging content under each the microblogging classification that gathers;
The described centre word that extracts from same microblogging content is sorted, and the described centre word after will sorting makes up, obtain the center phrase;
Add up the related microblogging quantity of each center phrase under each microblogging classification, and from the phrase of described center, extract popular keyword sets under each microblogging classification according to described microblogging quantity.
6. device as claimed in claim 5 is characterized in that, described abstraction module further comprises:
Filter submodule, be suitable for from the phrase of described center, filtering the rubbish phrase according to rubbish phrase database.
7. device as claimed in claim 4 is characterized in that, described computing module is further adapted for:
The microblogging quantity that relates to described popular keyword sets under the same microblogging classification is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtain the temperature value of described popular keyword sets under each microblogging classification.
8. device as claimed in claim 7 is characterized in that, described microblogging parameter further comprises following one or more combination: microblogging is always transmitted number, microblogging general comment number, the microblogging authenticated transmits number and the microblogging authenticated is commented on number.
9. device as claimed in claim 8 is characterized in that, described computing module is further adapted for:
Obtain respectively the temperature value of described popular keyword sets under each microblogging classification according to following formula:
The microblogging quantity * microblogging quantity weight coefficient+microblogging of the described popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.
10. device as claimed in claim 7 is characterized in that, described order module is further adapted for:
Temperature value to described popular keyword sets under each microblogging classification is carried out descending sort, and the microblogging much-talked-about topic of obtaining respectively under each microblogging classification is ranked and total microblogging much-talked-about topic seniority among brothers and sisters.
11. device as claimed in claim 1 is characterized in that, described device also comprises:
Acquisition module is suitable for obtaining the related microblogging content of each microblogging much-talked-about topic in the described microblogging much-talked-about topic seniority among brothers and sisters;
Display module is suitable for showing the microblogging content that corresponding microblogging much-talked-about topic is related according to user's request or active to the user.
12. a microblogging much-talked-about topic method for digging is characterized in that, comprising:
Gather micro-blog information by open interface, wherein, described micro-blog information comprises: microblogging content and microblogging parameter;
The described microblogging content that gathers is carried out participle, and extract popular keyword sets;
The microblogging quantity that relates to described popular keyword sets is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtain the temperature value of described popular keyword sets;
Temperature value to described popular keyword sets sorts, and obtains microblogging much-talked-about topic seniority among brothers and sisters.
13. method as claimed in claim 12 is characterized in that, described collection micro-blog information further comprises: the micro-blog information that gathers this door microblogging by the open interface of a door microblogging appointment.
14. method as claimed in claim 12 is characterized in that, after the described collection micro-blog information, described method also comprises:
Adopt the method for automatic cluster that microblogging is classified according to the described microblogging content that gathers, obtain different microblogging classifications.
15. method as claimed in claim 14 is characterized in that, the described microblogging content that gathers is carried out participle, and extracts popular keyword sets and further comprise:
Microblogging content under each the microblogging classification that gathers is carried out respectively participle, and extract respectively the popular keyword sets under each microblogging classification.
16. method as claimed in claim 15 is characterized in that, the microblogging content under each the microblogging classification that gathers is carried out respectively participle, and the popular keyword sets that extracts respectively under each microblogging classification comprises further:
Extract one or more centre words the described microblogging content under each the microblogging classification that gathers;
The described centre word that extracts from same microblogging content is sorted, and the described centre word after will sorting makes up, obtain the center phrase;
Add up the related microblogging quantity of each center phrase under each microblogging classification, and from the phrase of described center, extract popular keyword sets under each microblogging classification according to described microblogging quantity.
17. method as claimed in claim 16 is characterized in that, the described centre word after the ordering is made up, and obtains after the phrase of center, described method also comprises:
From the phrase of described center, filter the rubbish phrase according to rubbish phrase database.
18. method as claimed in claim 15, it is characterized in that, the microblogging quantity that relates to described popular keyword sets is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, the temperature value of obtaining described popular keyword sets further comprises:
The microblogging quantity that relates to described popular keyword sets under the same microblogging classification is added up, and be weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, obtain the temperature value of described popular keyword sets under each microblogging classification.
19. method as claimed in claim 18 is characterized in that, described microblogging parameter further comprises following one or more combination: microblogging is always transmitted number, microblogging general comment number, the microblogging authenticated transmits number and the microblogging authenticated is commented on number.
20. method as claimed in claim 19 is characterized in that, is weighted calculating according to the microblogging parameter of described microblogging quantity and corresponding microblogging, the temperature value of obtaining described popular keyword sets under each microblogging classification further comprises:
Obtain respectively the temperature value of described popular keyword sets under each microblogging classification according to following formula:
The microblogging quantity * microblogging quantity weight coefficient+microblogging of the described popular keyword sets of the temperature value of popular keyword sets=relate to is always transmitted number * and is always transmitted number weight coefficient+microblogging general comment and count the * general comment and count weight coefficient+microblogging authenticated and transmit number * authenticated and transmit number weight coefficient+microblogging authenticated comment number * authenticated comment number weight coefficient.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210507862.4A CN102945290B (en) | 2012-12-03 | 2012-12-03 | Hot microblog topic excavating gear and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210507862.4A CN102945290B (en) | 2012-12-03 | 2012-12-03 | Hot microblog topic excavating gear and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102945290A true CN102945290A (en) | 2013-02-27 |
CN102945290B CN102945290B (en) | 2015-12-23 |
Family
ID=47728234
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210507862.4A Expired - Fee Related CN102945290B (en) | 2012-12-03 | 2012-12-03 | Hot microblog topic excavating gear and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102945290B (en) |
Cited By (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103309962A (en) * | 2013-05-31 | 2013-09-18 | 华东师范大学 | Microblog service expert positioning method based on content relevance and social contact influence |
CN103580997A (en) * | 2013-11-19 | 2014-02-12 | 湖南蚁坊软件有限公司 | Extraction method and device for hot microblogs in vertical field |
CN103605673A (en) * | 2013-10-29 | 2014-02-26 | 北京奇虎科技有限公司 | Method and device for analyzing multiple network resource points |
CN103714132A (en) * | 2013-12-17 | 2014-04-09 | 北京本果信息技术有限公司 | Method and equipment used for mining hot events based on regions and industries |
CN103761234A (en) * | 2013-10-29 | 2014-04-30 | 北京奇虎科技有限公司 | Method and device for optimizing search ranking of network resource point |
CN104052765A (en) * | 2013-03-12 | 2014-09-17 | 蓝燕君 | Media information communication method and system |
CN104102681A (en) * | 2013-04-15 | 2014-10-15 | 腾讯科技(深圳)有限公司 | Microblog key event acquiring method and device |
CN104281653A (en) * | 2014-09-16 | 2015-01-14 | 南京弘数信息科技有限公司 | Viewpoint mining method for ten million microblog texts |
CN104462118A (en) * | 2013-09-21 | 2015-03-25 | 郑建锋 | Information spreading risk control method and system |
CN104504024A (en) * | 2014-12-11 | 2015-04-08 | 中国科学院计算技术研究所 | Method and system for mining keywords based on microblog content |
CN104516962A (en) * | 2014-12-18 | 2015-04-15 | 北京牡丹电子集团有限责任公司数字电视技术中心 | Monitoring method and system for microblogging public opinion |
CN104598450A (en) * | 2013-10-30 | 2015-05-06 | 北大方正集团有限公司 | Popularity analysis method and system of network public opinion event |
CN104615685A (en) * | 2015-01-22 | 2015-05-13 | 中国科学院计算技术研究所 | Hot degree evaluating method for network topic |
CN104615593A (en) * | 2013-11-01 | 2015-05-13 | 北大方正集团有限公司 | Method and device for automatic detection of microblog hot topics |
CN104778184A (en) * | 2014-01-15 | 2015-07-15 | 腾讯科技(深圳)有限公司 | Feedback keyword determining method and device |
CN104915447A (en) * | 2015-06-30 | 2015-09-16 | 北京奇艺世纪科技有限公司 | Method and device for tracing hot topics and confirming keywords |
CN105159882A (en) * | 2015-09-16 | 2015-12-16 | 中国地质大学(北京) | Method and apparatus for determining microblog hot topic |
CN105828198A (en) * | 2016-04-21 | 2016-08-03 | 深圳市金立通信设备有限公司 | Program recommendation method and terminal |
CN105975517A (en) * | 2016-04-27 | 2016-09-28 | 湖南蚁坊软件有限公司 | Microblog popularity index analysis method |
CN105989066A (en) * | 2015-02-09 | 2016-10-05 | 阿里巴巴集团控股有限公司 | Information processing method and device |
CN105989176A (en) * | 2015-03-05 | 2016-10-05 | 北大方正集团有限公司 | Data processing method and device |
CN105989143A (en) * | 2015-02-28 | 2016-10-05 | 科大讯飞股份有限公司 | Network entity popular degree analysis method and system |
CN106021316A (en) * | 2016-05-06 | 2016-10-12 | 长沙市麓智信息科技有限公司 | Core patent determination system and determination method |
CN106156182A (en) * | 2015-04-20 | 2016-11-23 | 富士通株式会社 | The method and apparatus that microblog topic word is categorized into specific field |
CN106294332A (en) * | 2015-05-11 | 2017-01-04 | 国家计算机网络与信息安全管理中心 | A kind of microblog topic feature extracting method and device |
CN106970924A (en) * | 2016-01-14 | 2017-07-21 | 北京国双科技有限公司 | A kind of topic sort method and device |
CN107122481A (en) * | 2017-05-04 | 2017-09-01 | 成都华栖云科技有限公司 | News temperature real-time online Forecasting Methodology |
CN107784127A (en) * | 2017-11-30 | 2018-03-09 | 杭州数梦工场科技有限公司 | A kind of focus localization method and device |
CN108733791A (en) * | 2018-05-11 | 2018-11-02 | 北京科技大学 | network event detection method |
CN109885688A (en) * | 2019-03-05 | 2019-06-14 | 湖北亿咖通科技有限公司 | File classification method, device, computer readable storage medium and electronic equipment |
CN110990571A (en) * | 2019-12-02 | 2020-04-10 | 精硕科技(北京)股份有限公司 | Method and device for obtaining discussion occupation ratio, storage medium and electronic equipment |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101408883A (en) * | 2008-11-24 | 2009-04-15 | 电子科技大学 | Method for collecting network public feelings viewpoint |
CN101751458A (en) * | 2009-12-31 | 2010-06-23 | 暨南大学 | Network public sentiment monitoring system and method |
CN101923544A (en) * | 2009-06-15 | 2010-12-22 | 北京百分通联传媒技术有限公司 | Method for monitoring and displaying Internet hot spots |
CN102289523A (en) * | 2011-09-20 | 2011-12-21 | 北京金和软件股份有限公司 | Method for intelligently extracting text labels |
CN102346766A (en) * | 2011-09-20 | 2012-02-08 | 北京邮电大学 | Method and device for detecting network hot topics found based on maximal clique |
CN102609475A (en) * | 2012-01-19 | 2012-07-25 | 浙江省公众信息产业有限公司 | Method for monitoring content of microblog and monitoring system |
US20120296920A1 (en) * | 2011-05-19 | 2012-11-22 | Yahoo! Inc. | Method to increase content relevance using insights obtained from user activity updates |
-
2012
- 2012-12-03 CN CN201210507862.4A patent/CN102945290B/en not_active Expired - Fee Related
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101408883A (en) * | 2008-11-24 | 2009-04-15 | 电子科技大学 | Method for collecting network public feelings viewpoint |
CN101923544A (en) * | 2009-06-15 | 2010-12-22 | 北京百分通联传媒技术有限公司 | Method for monitoring and displaying Internet hot spots |
CN101751458A (en) * | 2009-12-31 | 2010-06-23 | 暨南大学 | Network public sentiment monitoring system and method |
US20120296920A1 (en) * | 2011-05-19 | 2012-11-22 | Yahoo! Inc. | Method to increase content relevance using insights obtained from user activity updates |
CN102289523A (en) * | 2011-09-20 | 2011-12-21 | 北京金和软件股份有限公司 | Method for intelligently extracting text labels |
CN102346766A (en) * | 2011-09-20 | 2012-02-08 | 北京邮电大学 | Method and device for detecting network hot topics found based on maximal clique |
CN102609475A (en) * | 2012-01-19 | 2012-07-25 | 浙江省公众信息产业有限公司 | Method for monitoring content of microblog and monitoring system |
Non-Patent Citations (1)
Title |
---|
刘旭: "《博客热点话题挖掘方法》", 《中国优秀硕士学位论文全文数据库(信息科技辑)》, vol. 2012, no. 2, 15 February 2012 (2012-02-15) * |
Cited By (45)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104052765A (en) * | 2013-03-12 | 2014-09-17 | 蓝燕君 | Media information communication method and system |
CN104102681B (en) * | 2013-04-15 | 2017-05-17 | 腾讯科技(深圳)有限公司 | Microblog key event acquiring method and device |
CN104102681A (en) * | 2013-04-15 | 2014-10-15 | 腾讯科技(深圳)有限公司 | Microblog key event acquiring method and device |
CN103309962A (en) * | 2013-05-31 | 2013-09-18 | 华东师范大学 | Microblog service expert positioning method based on content relevance and social contact influence |
CN104462118A (en) * | 2013-09-21 | 2015-03-25 | 郑建锋 | Information spreading risk control method and system |
CN103605673A (en) * | 2013-10-29 | 2014-02-26 | 北京奇虎科技有限公司 | Method and device for analyzing multiple network resource points |
CN103761234A (en) * | 2013-10-29 | 2014-04-30 | 北京奇虎科技有限公司 | Method and device for optimizing search ranking of network resource point |
CN103605673B (en) * | 2013-10-29 | 2017-06-09 | 北京奇虎科技有限公司 | A kind of method and apparatus for analyzing multiple network resource points |
CN104598450A (en) * | 2013-10-30 | 2015-05-06 | 北大方正集团有限公司 | Popularity analysis method and system of network public opinion event |
CN104615593A (en) * | 2013-11-01 | 2015-05-13 | 北大方正集团有限公司 | Method and device for automatic detection of microblog hot topics |
CN104615593B (en) * | 2013-11-01 | 2017-09-29 | 北大方正集团有限公司 | Hot microblog topic automatic testing method and device |
CN103580997B (en) * | 2013-11-19 | 2017-09-29 | 湖南蚁坊软件有限公司 | The extracting method and its device of a kind of popular microblogging in vertical field |
CN103580997A (en) * | 2013-11-19 | 2014-02-12 | 湖南蚁坊软件有限公司 | Extraction method and device for hot microblogs in vertical field |
CN103714132A (en) * | 2013-12-17 | 2014-04-09 | 北京本果信息技术有限公司 | Method and equipment used for mining hot events based on regions and industries |
CN104778184A (en) * | 2014-01-15 | 2015-07-15 | 腾讯科技(深圳)有限公司 | Feedback keyword determining method and device |
CN104281653A (en) * | 2014-09-16 | 2015-01-14 | 南京弘数信息科技有限公司 | Viewpoint mining method for ten million microblog texts |
CN104281653B (en) * | 2014-09-16 | 2018-07-27 | 南京弘数信息科技有限公司 | A kind of opining mining method for millions scale microblogging text |
CN104504024A (en) * | 2014-12-11 | 2015-04-08 | 中国科学院计算技术研究所 | Method and system for mining keywords based on microblog content |
CN104516962A (en) * | 2014-12-18 | 2015-04-15 | 北京牡丹电子集团有限责任公司数字电视技术中心 | Monitoring method and system for microblogging public opinion |
CN104615685B (en) * | 2015-01-22 | 2018-01-26 | 中国科学院计算技术研究所 | A kind of temperature evaluation method of network-oriented topic |
CN104615685A (en) * | 2015-01-22 | 2015-05-13 | 中国科学院计算技术研究所 | Hot degree evaluating method for network topic |
CN105989066A (en) * | 2015-02-09 | 2016-10-05 | 阿里巴巴集团控股有限公司 | Information processing method and device |
CN105989143B (en) * | 2015-02-28 | 2019-09-03 | 科大讯飞股份有限公司 | Network entity temperature analysis method and system |
CN105989143A (en) * | 2015-02-28 | 2016-10-05 | 科大讯飞股份有限公司 | Network entity popular degree analysis method and system |
CN105989176A (en) * | 2015-03-05 | 2016-10-05 | 北大方正集团有限公司 | Data processing method and device |
CN106156182A (en) * | 2015-04-20 | 2016-11-23 | 富士通株式会社 | The method and apparatus that microblog topic word is categorized into specific field |
CN106294332B (en) * | 2015-05-11 | 2020-02-14 | 国家计算机网络与信息安全管理中心 | Microblog topic feature extraction method and device |
CN106294332A (en) * | 2015-05-11 | 2017-01-04 | 国家计算机网络与信息安全管理中心 | A kind of microblog topic feature extracting method and device |
CN104915447B (en) * | 2015-06-30 | 2018-04-20 | 北京奇艺世纪科技有限公司 | A kind of much-talked-about topic tracking and keyword determine method and device |
CN104915447A (en) * | 2015-06-30 | 2015-09-16 | 北京奇艺世纪科技有限公司 | Method and device for tracing hot topics and confirming keywords |
CN105159882A (en) * | 2015-09-16 | 2015-12-16 | 中国地质大学(北京) | Method and apparatus for determining microblog hot topic |
CN106970924A (en) * | 2016-01-14 | 2017-07-21 | 北京国双科技有限公司 | A kind of topic sort method and device |
CN106970924B (en) * | 2016-01-14 | 2020-10-20 | 北京国双科技有限公司 | Topic sorting method and device |
CN105828198A (en) * | 2016-04-21 | 2016-08-03 | 深圳市金立通信设备有限公司 | Program recommendation method and terminal |
CN105828198B (en) * | 2016-04-21 | 2019-05-17 | 深圳市金立通信设备有限公司 | A kind of program commending method and terminal |
CN105975517A (en) * | 2016-04-27 | 2016-09-28 | 湖南蚁坊软件有限公司 | Microblog popularity index analysis method |
CN106021316A (en) * | 2016-05-06 | 2016-10-12 | 长沙市麓智信息科技有限公司 | Core patent determination system and determination method |
CN107122481B (en) * | 2017-05-04 | 2020-06-30 | 成都华栖云科技有限公司 | Real-time online prediction method for news popularity |
CN107122481A (en) * | 2017-05-04 | 2017-09-01 | 成都华栖云科技有限公司 | News temperature real-time online Forecasting Methodology |
CN107784127A (en) * | 2017-11-30 | 2018-03-09 | 杭州数梦工场科技有限公司 | A kind of focus localization method and device |
CN108733791A (en) * | 2018-05-11 | 2018-11-02 | 北京科技大学 | network event detection method |
CN109885688A (en) * | 2019-03-05 | 2019-06-14 | 湖北亿咖通科技有限公司 | File classification method, device, computer readable storage medium and electronic equipment |
CN109885688B (en) * | 2019-03-05 | 2021-05-28 | 湖北亿咖通科技有限公司 | Text classification method and device, computer-readable storage medium and electronic equipment |
CN110990571A (en) * | 2019-12-02 | 2020-04-10 | 精硕科技(北京)股份有限公司 | Method and device for obtaining discussion occupation ratio, storage medium and electronic equipment |
CN110990571B (en) * | 2019-12-02 | 2024-04-02 | 北京秒针人工智能科技有限公司 | Method and device for acquiring discussion duty ratio, storage medium and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN102945290B (en) | 2015-12-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102945290B (en) | Hot microblog topic excavating gear and method | |
CN102982157A (en) | Device and method used for mining microblog hot topics | |
CN102831248B (en) | Network focus method for digging and device | |
Bozarth et al. | Toward a better performance evaluation framework for fake news classification | |
CN103793503B (en) | Opinion mining and classification method based on web texts | |
US10235421B2 (en) | Systems and methods for facilitating the gathering of open source intelligence | |
US8935197B2 (en) | Systems and methods for facilitating open source intelligence gathering | |
CN103617169B (en) | A kind of hot microblog topic extracting method based on Hadoop | |
US20190151758A1 (en) | Unique virtual entity creation based on real world data sources | |
US20130304818A1 (en) | Systems and methods for discovery of related terms for social media content collection over social networks | |
Ackland et al. | Hyperlinks and networked communication: a comparative study of political parties online | |
CN102812475A (en) | System And Method For Determining Sentiment Expressed In Documents | |
CN108170692A (en) | A kind of focus incident information processing method and device | |
CN102207961B (en) | Automatic web page classification method and device | |
Wicaksono | A proposed method for predicting US presidential election by analyzing sentiment in social media | |
CN103559207A (en) | Financial behavior analyzing system based on social media calculation | |
CN103617213B (en) | Method and system for identifying newspage attributive characters | |
CN103177076A (en) | Public sentiment monitoring system and method based on fixed point websites | |
CN105378730A (en) | Social media content analysis and output | |
CN103778225A (en) | Processing method, identifying device and identifying system of advertisement marketing language information | |
CN111340147B (en) | Decision behavior generation method and system based on decision tree | |
CN109766441A (en) | File classification method, apparatus and system | |
CN107220745A (en) | A kind of recognition methods, system and equipment for being intended to behavioral data | |
CN103955480B (en) | A kind of method and apparatus for determining the target object information corresponding to user | |
Edouard | Event detection and analysis on short text messages |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20151223 Termination date: 20211203 |
|
CF01 | Termination of patent right due to non-payment of annual fee |