Nothing Special   »   [go: up one dir, main page]

CN108965938A - Potential paying customer prediction technique and system in smart television - Google Patents

Potential paying customer prediction technique and system in smart television Download PDF

Info

Publication number
CN108965938A
CN108965938A CN201810879462.3A CN201810879462A CN108965938A CN 108965938 A CN108965938 A CN 108965938A CN 201810879462 A CN201810879462 A CN 201810879462A CN 108965938 A CN108965938 A CN 108965938A
Authority
CN
China
Prior art keywords
program
user
feature
watched
paying customer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810879462.3A
Other languages
Chinese (zh)
Other versions
CN108965938B (en
Inventor
姜国运
马军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong University
Original Assignee
Shandong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong University filed Critical Shandong University
Priority to CN201810879462.3A priority Critical patent/CN108965938B/en
Publication of CN108965938A publication Critical patent/CN108965938A/en
Application granted granted Critical
Publication of CN108965938B publication Critical patent/CN108965938B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/251Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25866Management of end-user data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4667Processing of monitored end-user data, e.g. trend analysis based on the log file of viewer selections

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Computer Graphics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses potential paying customer prediction technique and systems in smart television, comprising: building is positively correlated data set and negatively correlated data set;Feature is positively correlated from being positively correlated to extract in data set;Negatively correlated feature is extracted from negatively correlated data set;Data characteristics is derivative;The Embeded Wide&Deep model built in advance is trained using fused positive correlation new feature and fused negatively correlated new feature, obtains trained Embeded Wide&Deep model;The new feature of user to be predicted is input in trained Embeded Wide&Deep model, exporting user to be predicted is potential paying customer or paying customer non-potential.It is effective to excavate potential paying customer, the solution currently in intelligent television field is proposed, promotes firms profitability in order to which enterprise formulates corresponding business strategy.

Description

Potential paying customer prediction technique and system in smart television
Technical field
The present invention relates to ntelligent television technolog field, more particularly to potential paying customer prediction technique in smart television and System.
Background technique
With daily household electrical appliances such as internet+and the artificial intelligence technology applications in household electrical appliances, including TV, refrigerator and air-conditioning Design, manufacture kimonos does honest work and develops to intelligent and networked services directions.There is predictive display, arrives the year two thousand twenty, intelligence TV market occupation rate is up to 90% or more, and global artificial intelligence system will be brought for household appliances enterprise more than 47,000,000,000 dollars Income, shows powerful market potential.Here smart television may be defined as to connect radio and television broadband programme content and It can charge and video program given by free video display set meal and network video provider provided by video on demand producer. It due to the improvement of people's living standards, is statisticallyd analyze according to the user of Hisense of Largest In China TV producer TV, is ready that viewing is paid The user for taking program is more and more.A challenging project is brought therewith, i.e., how to predict that a user can be from mesh Preceding non-payment user becomes paying customer.Obviously, the solution of the above problem will help enterprise to carry out user behavior analysis, business wide It accuses and the accurate dispensing of favor information, the help enterprises such as TV producer and video website brings more interests.Also it helps simultaneously TV user purchase value-added service and obtain better user experience etc..
Fewer and fewer for the research of potential paying customer forecasting problem both at home and abroad at present, current recommender system is for inhaling It quotes family to be known as paying the scheme of member, if red-letter day is invigorated dynamic, the mode extensively casted net provides discount coupon etc., and this mode is for root Originally it is unwilling for the user of payment to be a kind of " bothering ", while inefficiency.
Due to the complexity of user behavior in smart television and the deficient unicity of data information, with traditional electronic commerce Middle user behavior modeling has essential distinction, for the associated solutions in traditional electronic commerce field, is not suitable for smart television Field.
Summary of the invention
In order to solve the deficiencies in the prior art, the present invention provides potential paying customer prediction technique in smart television and it is System;
As the first aspect of the present invention, potential paying customer prediction technique in smart television is provided;
Potential paying customer prediction technique in smart television, comprising:
Data set building: building is positively correlated data set and negatively correlated data set;
Data characteristics is extracted: being positively correlated feature from being positively correlated to extract in data set;Negative is extracted from negatively correlated data set Close feature;
Data characteristics is derivative: deriving to feature is positively correlated, generates and be positively correlated new feature;Feature and positive will be positively correlated It closes new feature and carries out Fusion Features, obtain fused positive correlation feature;Negatively correlated feature is derived, is generated negatively correlated new Feature;Negatively correlated feature and negatively correlated new feature are subjected to Fusion Features, obtain fused negatively correlated feature;
Model training step: using fused positive correlation new feature and fused negatively correlated new feature to building in advance Embeded Wide&Deep model be trained, obtain trained Embeded Wide&Deep model;
Potential paying customer prediction: user itself behavioural characteristic of user to be predicted, the spy of the watched program of user are extracted It seeks peace by the feature of viewing program;To the quilt of user itself behavioural characteristic of the user to be predicted of extraction, the watched program of user It moves feature and carries out feature by the active features of viewing program and derive, form the new feature of user to be predicted;By user to be predicted New feature be input in trained Embeded Wide&Deep model, export user to be predicted be potential paying customer also It is non-potential paying customer.
Further, the positive correlation data set, comprising: the sponsored program of paying customer's history viewing, paying customer go through History viewing non-payment program, paying customer's history pay records, paying customer's history viewing sponsored program recommended information and The recommended information of paying customer's history viewing non-payment program;Paying customer's history pay records, comprising: paying customer goes through The sponsored program and corresponding paid-for time of history viewing.
Further, the negatively correlated data set, comprising: the program and non-paid user of non-paid user history viewing are gone through The recommended information of history viewing program.
Further, the positive correlation feature, comprising: program viewing period, the history pay records frequency, charges paid section Purpose spending amount, the type of program, the performer of program, the director of program, program country, section object language, program it is upper Reflect time, the scoring of program and the corresponding set meal ID of program.
Further, the negatively correlated feature, comprising: program viewing period, the type of program, the performer of program, section Purpose director, the country of program, section object language, the show time of program, the scoring of program and the corresponding set meal ID of program.
Further, the positive correlation new feature, comprising: continuous to be positively correlated feature and continuous-discrete positive correlation feature;
Derivative continuous positive correlation feature: charges paid program is watched from user in set period of time is gone out derived from positive correlation feature Number of days, the average number of programs watched daily of user and the average charges paid program of viewing daily of user duration characteristics;
Derivative continuous-discrete positive correlation feature: paid from going out to watch certain seed type daily in derived from positive correlation feature Take the quantity of program and watches within one day the duration characteristics of type charges paid program in certain per hour.
Further, the negatively correlated new feature, comprising: continuous negative correlation feature and continuous-discrete negatively correlated feature;
Derivative continuous negatively correlated feature: non-paid program is watched from user in set period of time is gone out derived from negatively correlated feature Number of days, the average non-paid number of programs watched daily of user and the average duration characteristics for watching non-paid program daily of user;
Derivative continuous-discrete negatively correlated feature: unpaid from going out to watch certain seed type daily in derived from negatively correlated feature Take the quantity of program and watches within one day the duration characteristics of the non-paid program of type in certain per hour.
Further, itself behavioural characteristic of the user of user to be predicted, comprising: the time of user's viewing TV;
Further, the feature of the watched program of user to be predicted, comprising: program is watched within the scope of user's setting time Number of days, the average number of programs watched daily of user, the average duration watched daily of user;
Further, by the feature of viewing program, comprising: the program viewing period, the type of program, program performer, The director of program, the country of program, section object language, the show time of program, the scoring of program and the corresponding set meal ID of program;
Further, to the passive spy of user itself behavioural characteristic of the user to be predicted of extraction, the watched program of user It seeks peace derivative by the active features progress feature of viewing program, forms the new feature of user to be predicted are as follows:
User watches program number of days, the number of programs watched daily that is averaged, the average time watched daily, is averaged one week seven days Viewing number of days accounting, the one week seven days average number of programs watched daily, the one week seven days average time watched daily, every kind daily Time that type programs are averagely watched, the average number watched of each type program, each type program is averaged one week seven days The average number watched of the time of viewing, one week seven days each type program, each type program are averaged by viewing time, often A performer uses language by watched time, each program belonging country watched time, every kind of program by watched time, each director Watched time, each program show date watched time, different scoring program by watched time or watch program and belong to set meal Number.
Further, the Embeded Wide&Deep model, comprising: linear model and depth model two parts;
The linear model is that Logic Regression Models learn user's low order feature representation, expression are as follows:
xembed=wembed*x
Y=wTxembed+b
Wherein, xembedTo be embedded in vector, x is input layer user vector, wembedFor embeding layer map vector, w is model power Weight vector, b are bigoted item, and y is predicted value.
The depth model part is multiple perceptron model, learns user's Higher Order Abstract feature, expression are as follows:
Y=f[l](w[l]*(f[l-1]*(...f[1](w[1]*x+b[1])...))+b[l])
Wherein, l is network number of plies number, and f is the activation primitive of current layer, and x is user characteristics vector, and w is Model Weight Vector, b are bigoted.
Joint training linear model and depth model splice the input of depth model the last layer and linear model one It rises, exports predicted value after training.
Joint training will optimize all parameters of linear model and depth model simultaneously, by by linear model and depth mould The mode that type is weighted summation in training carries out;
Embeded Wide&Deep model tormulation are as follows:
Wherein, y is the label of two classification, and σ () is sigmoid activation primitive, and x is user characteristics vector, and b is bigoted ?.wwideIt is the weight of linear model, wdeepIt is the weight of depth model.
Note in the training process of joint training linear model and depth model: loss loss function is intersection entropy function; When predicted value is 0 or negative, predicted value is substituted for a minimum, to prevent from Nan value occur in the calculating of Loss loss function Trap;
Depth model is designed as the design of tower layer, and halve principle: wherein bottom is widest, and upper layer articulamentum successively subtracts Few, the width of tower layer successively halves from bottom to top;Use Xavier weights initialisation solution.
As a second aspect of the invention, potential paying customer forecasting system in smart television is provided;
Potential paying customer forecasting system in smart television, comprising: memory, processor and storage are on a memory simultaneously The computer instruction run on a processor when the computer instruction is run by processor, is completed described in any of the above-described method The step of.
Compared with prior art, the beneficial effects of the present invention are:
(1) feature of paying customer and non-paid user are sufficiently excavated, and is potential by feature derivative and Fusion Features The accurate prediction of paying customer provides foundation;
(2) potential paying customer is effectively excavated, the solution currently in intelligent television field is proposed, in order to look forward to Industry formulates corresponding business strategy and promotes firms profitability.
Detailed description of the invention
The accompanying drawings constituting a part of this application is used to provide further understanding of the present application, and the application's shows Meaning property embodiment and its explanation are not constituted an undue limitation on the present application for explaining the application.
Fig. 1 is flow chart of the invention;
Fig. 2 is the division that user of the present invention watches log information paying customer (positive example) log;
Fig. 3 is the division that user of the present invention watches log information non-payment user (negative example) log;
Fig. 4 is Embeded Wide&Deep model support composition of the present invention.
Specific embodiment
It is noted that following detailed description is all illustrative, it is intended to provide further instruction to the application.Unless another It indicates, all technical and scientific terms used herein has usual with the application person of an ordinary skill in the technical field The identical meanings of understanding.
It should be noted that term used herein above is merely to describe specific embodiment, and be not intended to restricted root According to the illustrative embodiments of the application.As used herein, unless the context clearly indicates otherwise, otherwise singular Also it is intended to include plural form, additionally, it should be understood that, when in the present specification using term "comprising" and/or " packet Include " when, indicate existing characteristics, step, operation, device, component and/or their combination.
As one embodiment of the invention, potential paying customer prediction technique in smart television is provided;
As shown in Figure 1, potential paying customer prediction technique in smart television, comprising:
Data set construction step: building is positively correlated data set and negatively correlated data set;
The positive correlation data set, comprising: the sponsored program of paying customer's history viewing, paying customer's history are watched non- Sponsored program, paying customer's history pay records, the recommended information of paying customer's history viewing sponsored program and paying customer go through The recommended information of history viewing non-payment program;Paying customer's history pay records, comprising: the viewing of paying customer's history is paid Take program and corresponding paid-for time;
The negative correlation data set, comprising: the program and non-paid user history viewing section of non-paid user history viewing Purpose recommended information;
Data characteristics extraction step:
Feature is positively correlated from being positively correlated to extract in data set;The positive correlation feature, comprising: the program viewing period is gone through The history pay records frequency, the spending amount of charges paid program, the type of program, the performer of program, the director of program, section country of destination Family, section object language, the show time of program, the scoring of program and the corresponding set meal ID of program;
Negatively correlated feature is extracted from negatively correlated data set;The negative correlation feature, comprising: program viewing period, section Purpose type, the performer of program, the director of program, the country of program, section object language, the show time of program, program are commented Divide set meal ID corresponding with program;
Data characteristics derives step:
Derive to feature is positively correlated, generates and be positively correlated new feature;Feature will be positively correlated and be positively correlated new feature and carried out Fusion Features obtain fused positive correlation feature;
The positive correlation new feature, comprising: continuous to be positively correlated feature and continuous-discrete positive correlation feature;
Derivative continuous positive correlation feature: charges paid program is watched from user in set period of time is gone out derived from positive correlation feature Number of days, the average number of programs watched daily of user and the average charges paid program of viewing daily of user duration characteristics;
Derivative continuous-discrete positive correlation feature: paid from going out to watch certain seed type daily in derived from positive correlation feature Take the quantity of program and watches within one day the duration characteristics of type charges paid program in certain per hour;
Negatively correlated feature is derived, negatively correlated new feature is generated;Negatively correlated feature and negatively correlated new feature are carried out Fusion Features obtain fused negatively correlated feature;
The negative correlation new feature, comprising: continuous negative correlation feature and continuous-discrete negatively correlated feature;
Derivative continuous negatively correlated feature: non-paid program is watched from user in set period of time is gone out derived from negatively correlated feature Number of days, the average non-paid number of programs watched daily of user and the average duration characteristics for watching non-paid program daily of user;
Derivative continuous-discrete negatively correlated feature: unpaid from going out to watch certain seed type daily in derived from negatively correlated feature Take the quantity of program and watches within one day the duration characteristics of the non-paid program of type in certain per hour;
Model training step: using fused positive correlation new feature and fused negatively correlated new feature to building in advance Embeded Wide&Deep model be trained, obtain trained Embeded Wide&Deep model;
User charges prediction steps: user itself behavioural characteristic of user to be predicted, the quilt of the watched program of user are extracted Move feature and by the active features of viewing program;User itself behavioural characteristic of the user to be predicted of extraction, user are watched The passive feature of program and to carry out feature by the active features of viewing program derivative, forms the new feature of user to be predicted;It will be to The new feature of prediction user is input in trained Embeded Wide&Deep model, exports setting for user to be predicted Payment probability in period.
The Embeded Wide&Deep model, comprising: linear model and depth model two parts;
The linear model is that Logic Regression Models learn user's low order feature representation, expression are as follows:
xembed=wembed*x
Y=wTxembed+b
Wherein, xembedTo be embedded in vector, x is input layer user vector, wembedFor embeding layer map vector, w is model power Weight vector, b are bigoted item, and y is predicted value.
The depth model part is multiple perceptron model, learns user's Higher Order Abstract feature, expression are as follows:
Y=f[l](w[l]*(f[l-1]*(...f[1](w[1]*x+b[1])...))+b[l])
Wherein, l is network number of plies number, and f is the activation primitive of current layer, and x is user characteristics vector, and w is Model Weight Vector, b are bigoted.
Joint training linear model and depth model splice the input of depth model the last layer and linear model one It rises, exports predicted value after training.
Joint training will optimize all parameters of linear model and depth model simultaneously, by by linear model and depth mould The mode that type is weighted summation in training carries out;
Model is as shown in figure 4, Embeded Wide&Deep model tormulation are as follows:
Wherein, y is the label of two classification, and σ () is sigmoid activation primitive, and x is user characteristics vector, and b is bigoted ?.wwiDe is the weight of linear model, wdeepIt is the weight of depth model.
Note in the training process of joint training linear model and depth model: loss loss function is intersection entropy function; When predicted value is 0 or negative, predicted value is substituted for a minimum, to prevent from Nan value occur in the calculating of Loss loss function Trap;
Depth model is designed as the design of tower layer, and halve principle: wherein bottom is widest, and upper layer articulamentum successively subtracts Few, the width of tower layer successively halves from bottom to top;Use Xavier weights initialisation solution.
As second embodiment of the invention, potential paying customer forecasting system in smart television is provided;
Potential paying customer forecasting system in smart television, comprising: memory, processor and storage are on a memory simultaneously The computer instruction run on a processor when the computer instruction is run by processor, is completed described in any of the above-described method The step of.
The application proposes characteristic processing frame derived from Information expansion and feature, and the Embeded Wide& being applicable in Deep model.
Step 1: Information expansion includes expanding user according to by original user viewing log and user's purchaser record The information of itself behavior watches extended log files by original user and goes out the passive information of the watched program of user, and by original Log and auxiliary information are expanded by viewing programme information initiative information;
Step 2: feature derivative, which refers to, to be combined by existing data to find new meaning, and individual features are increased;
Step 3:Embeded Wide&Deep model refers to linear according to being designed by corresponding parameter with frame The model of model and depth model.
The Embeded Wide&Deep model refers to according to the multilayer sense being designed by corresponding parameter and frame Know machine model;
User is watched log information to divide according to payment and non-payment user, paying customer is specifically watched into log letter Log information before breath payment day extracts, and non-payment user's whole user journal information extraction is come out, for doing The potential paying customer task stated;
Further, raw information described in the step 1 includes: user watches log information and auxiliary information, Middle auxiliary information includes film information and user's purchaser record.
It is wherein described that user is watched into log information according to payment and non-payment user division such as Fig. 2, shown in Fig. 3.Wherein Fig. 2 indicates that user has purchased set meal member at some time point, and the user journal before this user purchase is extracted as number According to the object of excavation, and label is 1, is expressed as positive example.Wherein Fig. 3 indicates that user did not bought set meal member eventually, by this its institute There is user's viewing log to extract the object as data mining, and label is 0, indicates the example that is negative.
Itself behavioural information of user described in the step 1 refers to the field watched in log information according to original user Or user's purchaser record Information expansion information relevant to user itself behavior in auxiliary information, addition is in phase after being quantified In the journal entries answered.Itself behavioural information of the user refers to: being extended according to time field information in original log information Which period occurred in one day for the current behavior of user out, occurred in the temporal informations such as week is several, because of user's watching behavior With temporal regularity, this time cycle be can be one day, is also possible to one week, and addition is in journal entries after being quantified In;
It is described extended log files watched by original user go out the passive information of the watched program of user refer to from the log The statistics passive information of the watched program of extending user is carried out in information, addition is in journal entries after being quantified.
The watched programme information of user is expanded with auxiliary information by original log to refer to: day is watched according to original user The watched program of user is associated with auxiliary information (programme information) in will, expands the watched program initiative information of user, Addition is in journal entries after being quantified.
User, which is expanded, in user's purchaser record information in auxiliary information has bought information, such as: user has bought Set meal member's frequency, frequency etc. is bought in spending amount, single video payment.
It is described the watched programme information of user is expanded by original log to refer to: section is first counted from all logs Mesh calculates program popularity by the viewing frequency, and addition is in journal entries after being quantified.
It expands the watched programme information of user with programme information in auxiliary information by original log to refer to: according to original The watched program ID of user is associated with auxiliary information (programme information) in log, expands program category, label, classification, Performer, director, country, language are shown the date, video scoring, set meal ID.Because above-mentioned every kind of information has multiple attributes, Mei Gejie Mesh may also correspond to multiple attributes in every kind of information, first quantify every kind of information attribute, to each attribute in every kind of information The continuous unique encodings since 0 are carried out, are stored in dictionary, so as to later period quick obtaining attribute coding.Similar to Pareto Rule (Pareto ' s principle) is also a bar Lai Tedinglv, and in mass data learning process, effective influence factor is certain That relative quantity is biggish, if amount seldom if think its for can not Consideration, therefore, for certain auxiliary informations (director, drill Member) the too many situation of number of attributes, the highest top n attribute of frequency of occurrence is selected according to testing sieve, remaining ranges one A attribute, is then quantified.
Further, feature derivative is by extracting to data in the step 2, and combination finds new data meaning Justice increases individual features, and specific includes derivative continuous feature, the derivative continuous feature of discrete features &.
The wherein continuous feature of the derivative: extracting the continuity Characteristics of user, passes through traversal and counts each user journal Data entries obtain.It is specific: to extract user and watch number of days, the average number of programs watched daily of user, user are average daily The continuous feature such as the duration of viewing.Wherein the user watches number of days: within the time span of model data processing, user Open the number of days of television set viewing program.The average number of programs watched daily of user described in wherein: cumulative user each out sees It sees number of programs sum, watches total number of days divided by user and obtain.The average duration watched daily of user described in wherein: cumulative every out A user watches program total duration, watches total number of days divided by user and obtains.
The continuous feature of derivative discrete features described in wherein: the temporal information similar expanded is combined and is expanded Other information combine to obtain the continuous feature of discrete &.First by the discrete dimension to the equivalent time of feature, for feature per one-dimensional Degree, traversal counts each user journal data entries in obtained data, extracts the continuity Characteristics of respective dimensions.
The continuous feature of derivative discrete features described in wherein: because user's watching behavior has temporal regularity, this A time cycle can be one day, be also possible to one week, and the similar temporal information expanded is combined other expanded Information consolidation obtains the continuous feature of discrete &.Such as: 7 days one week with the group of viewing program category (n attribute, i.e. n dimension) quantity It closes, obtaining 7*n dimensional vector indicates that user user watches the feature of the quantity of certain type programs for one week daily.24 hours one day With the combination of viewing program category (n attribute, i.e. n are tieed up) duration, obtaining 24*n dimensional vector is indicated user user 24 hours one day The feature of the duration of certain type programs is watched per hour.
User's information of drawing a portrait conventionally (discretization or serialization) quantified, above-mentioned is derived above Continuous feature, after the discrete continuous feature of & merges with the quantization of user's Figure Characteristics, final we obtain the user characteristics vector of n dimension, And the corresponding label of assignment, all user characteristics are merged into matrix and form complete data collection.After data set is upset sequence, press Training set and test set are generated according to ratio;
User's portrait information is conventionally quantified, the above above-mentioned continuous feature derived, discrete & connects After continuous feature merges with user's three kinds of information of portrait, final we obtain the user characteristics vector of 2161 dimensions, and assignment is corresponding Label forms complete data collection.After data set is upset sequence, training set and test set are generated according to the ratio of 8:2;
The foregoing is merely preferred embodiment of the present application, are not intended to limit this application, for the skill of this field For art personnel, various changes and changes are possible in this application.Within the spirit and principles of this application, made any to repair Change, equivalent replacement, improvement etc., should be included within the scope of protection of this application.

Claims (10)

1. potential paying customer prediction technique in smart television, characterized in that include:
Data set building: building is positively correlated data set and negatively correlated data set;
Data characteristics is extracted: being positively correlated feature from being positively correlated to extract in data set;It is extracted from negatively correlated data set negatively correlated special Sign;
Data characteristics is derivative: deriving to feature is positively correlated, generates and be positively correlated new feature;Feature will be positively correlated and be positively correlated new Feature carries out Fusion Features, obtains fused positive correlation feature;Negatively correlated feature is derived, is generated negatively correlated new special Sign;Negatively correlated feature and negatively correlated new feature are subjected to Fusion Features, obtain fused negatively correlated feature;
Model training step: using fused positive correlation new feature and fused negatively correlated new feature to building in advance Embeded Wide&Deep model is trained, and obtains trained Embeded Wide&Deep model;
Potential paying customer prediction: extract user itself behavioural characteristic of user to be predicted, the feature of the watched program of user and By the feature of viewing program;To the passive spy of user itself behavioural characteristic of the user to be predicted of extraction, the watched program of user It seeks peace derivative by the active features progress feature of viewing program, forms the new feature of user to be predicted;By the new of user to be predicted Feature is input in trained Embeded Wide&Deep model, and exporting user to be predicted is potential paying customer also right and wrong Potential paying customer.
2. potential paying customer prediction technique in smart television as described in claim 1, characterized in that the positive correlation data Collection, comprising: the sponsored program of paying customer's history viewing, the non-payment program of paying customer's history viewing, paying customer's history The introduction of pay records, the recommended information of paying customer's history viewing sponsored program and paying customer's history viewing non-payment program Information;Paying customer's history pay records, comprising: when the sponsored program of paying customer's history viewing and corresponding payment Between;
The negative correlation data set, comprising: the program and non-paid user history of non-paid user history viewing watch program Recommended information.
3. potential paying customer prediction technique in smart television as described in claim 1, characterized in that
The positive correlation feature, comprising: the program viewing period, the history pay records frequency, charges paid program spending amount, The type of program, the performer of program, the director of program, the country of program, section object language, the show time of program, program Score set meal ID corresponding with program;
The negative correlation feature, comprising: program viewing period, the type of program, the performer of program, the director of program, program Country, section object language, the show time of program, the scoring of program and the corresponding set meal ID of program.
4. potential paying customer prediction technique in smart television as described in claim 1, characterized in that described to be positively correlated new spy Sign, comprising: continuous to be positively correlated feature and continuous-discrete positive correlation feature;
Derive continuous positive correlation feature: from the day for going out user's viewing charges paid program in set period of time derived from positive correlation feature The duration characteristics of number, the average number of programs watched daily of user and the average charges paid program of viewing daily of user;
Derivative continuous-discrete positive correlation feature: certain seed type charges paid section is watched daily from being positively correlated to go out one week derived from feature Purpose quantity and the duration characteristics for watching type charges paid program in certain for one day per hour.
5. potential paying customer prediction technique in smart television as described in claim 1, characterized in that described negatively correlated new special Sign, comprising: continuous negative correlation feature and continuous-discrete negatively correlated feature;
Derivative continuous negatively correlated feature: from the day for going out user in set period of time derived from negatively correlated feature and watching non-paid program Number, the average non-paid number of programs watched daily of user and the average duration characteristics for watching non-paid program daily of user;
Derivative continuous-discrete negatively correlated feature: the non-paid section of certain seed type is watched daily from going out one week derived from negatively correlated feature Purpose quantity and the duration characteristics for watching the non-paid program of type in certain for one day per hour.
6. potential paying customer prediction technique in smart television as described in claim 1, characterized in that the use of user to be predicted Family itself behavioural characteristic, comprising: the time of user's viewing TV;
The feature of the watched program of user to be predicted, comprising: the number of days of program is watched within the scope of user's setting time, user is average Number of programs, the average duration watched daily of user watched daily;
By the feature of viewing program, comprising: program viewing period, the type of program, the performer of program, the director of program, section Purpose country, section object language, the show time of program, the scoring of program and the corresponding set meal ID of program.
7. potential paying customer prediction technique in smart television as described in claim 1, characterized in that
To user itself behavioural characteristic of the user to be predicted of extraction, the passive feature of the watched program of user and by viewing program Active features to carry out feature derivative, form the new feature of user to be predicted are as follows:
User watches program number of days, the average number of programs watched daily, the average time watched daily, average daily one week seven days Viewing number of days accounting, the one week seven days average number of programs watched daily, the one week seven days average time watched daily, each type Time that program is averagely watched, the average number watched of each type program, each type program is averagely watched one week seven days Time, the average number watched of each type program, each type program are average by viewing time, Mei Geyan one week seven days Member is watched by watched time, each program belonging country watched time, every kind of program using language by watched time, each director Number, each program show date watched time, different scoring program by watched time or watch time that program belongs to set meal Number.
8. potential paying customer prediction technique in smart television as described in claim 1, characterized in that
The Embeded Wide&Deep model, comprising: linear model and depth model two parts;
The linear model is that Logic Regression Models learn user's low order feature representation, expression are as follows:
xembed=wembed*x
Y=wTxembed+b
Wherein, xembedTo be embedded in vector, x is input layer user vector, wembedFor embeding layer map vector, w be Model Weight to Amount, b are bigoted item, and y is predicted value;
The depth model part is multiple perceptron model, learns user's Higher Order Abstract feature, expression are as follows:
Y=f[l](w[l]*(f[l-1]*(...f[1](w[1]*x+b[1])...))+b[l])
Wherein, l is network number of plies number, and f is the activation primitive of current layer, and x is user characteristics vector, and w is Model Weight vector, B is bigoted;
The input of depth model the last layer and linear model is stitched together by joint training linear model and depth model, Predicted value is exported after training;
Joint training will optimize all parameters of linear model and depth model simultaneously, by the way that linear model and depth model exist The mode that summation is weighted when training carries out.
9. potential paying customer prediction technique in smart television as described in claim 1, characterized in that
Embeded Wide&Deep model tormulation are as follows:
Wherein, y is the label of two classification, and σ () is sigmoid activation primitive, and x is user characteristics vector, and b is bigoted item;wwide It is the weight of linear model, wdeepIt is the weight of depth model.
10. potential paying customer forecasting system in smart television, characterized in that include: that memory, processor and being stored in is deposited The computer instruction run on reservoir and on a processor when the computer instruction is run by processor, completes aforesaid right It is required that step described in 1-9 either method.
CN201810879462.3A 2018-08-03 2018-08-03 Method and system for predicting potential pay users in smart television Expired - Fee Related CN108965938B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810879462.3A CN108965938B (en) 2018-08-03 2018-08-03 Method and system for predicting potential pay users in smart television

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810879462.3A CN108965938B (en) 2018-08-03 2018-08-03 Method and system for predicting potential pay users in smart television

Publications (2)

Publication Number Publication Date
CN108965938A true CN108965938A (en) 2018-12-07
CN108965938B CN108965938B (en) 2020-03-20

Family

ID=64467577

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810879462.3A Expired - Fee Related CN108965938B (en) 2018-08-03 2018-08-03 Method and system for predicting potential pay users in smart television

Country Status (1)

Country Link
CN (1) CN108965938B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110166803A (en) * 2019-06-21 2019-08-23 奚华文 A kind of interactive approach of the network media
CN110599234A (en) * 2019-08-13 2019-12-20 广东工业大学 Product sales prediction method
CN111191860A (en) * 2020-04-14 2020-05-22 北京热云科技有限公司 Prediction method based on ensemble learning, prediction system and readable storage medium
CN111554118A (en) * 2020-04-24 2020-08-18 深圳职业技术学院 Dynamic prediction method and system for bus arrival time
CN111709764A (en) * 2019-03-18 2020-09-25 腾讯科技(深圳)有限公司 Method and device for determining related parameters of multimedia content and storage medium
CN113761342A (en) * 2021-01-21 2021-12-07 北京沃东天骏信息技术有限公司 Information pushing method and device and computer readable storage medium
CN116934389A (en) * 2023-06-12 2023-10-24 嘉兴华数广电网络有限公司 Digital television value ticket card management system based on cloud computing and cross-platform technology
CN117909601A (en) * 2024-03-15 2024-04-19 厦门她趣信息技术有限公司 Payment social matching method, device, equipment and readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105160008A (en) * 2015-09-21 2015-12-16 合一网络技术(北京)有限公司 Method and device for locating suggested users
CN105354725A (en) * 2015-11-20 2016-02-24 珠海多玩信息技术有限公司 Prediction method and system of promotion effect of application
CN105719164A (en) * 2016-01-21 2016-06-29 海信集团有限公司 Paid multimedia resource recommending method and paid multimedia resource recommending device
CN106168976A (en) * 2016-07-14 2016-11-30 武汉斗鱼网络科技有限公司 A kind of specific user's method for digging based on NB Algorithm and system
CN106204063A (en) * 2016-06-30 2016-12-07 北京奇艺世纪科技有限公司 A kind of paying customer's method for digging and device
CN108205766A (en) * 2016-12-19 2018-06-26 阿里巴巴集团控股有限公司 Information-pushing method, apparatus and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105160008A (en) * 2015-09-21 2015-12-16 合一网络技术(北京)有限公司 Method and device for locating suggested users
CN105354725A (en) * 2015-11-20 2016-02-24 珠海多玩信息技术有限公司 Prediction method and system of promotion effect of application
CN105719164A (en) * 2016-01-21 2016-06-29 海信集团有限公司 Paid multimedia resource recommending method and paid multimedia resource recommending device
CN106204063A (en) * 2016-06-30 2016-12-07 北京奇艺世纪科技有限公司 A kind of paying customer's method for digging and device
CN106168976A (en) * 2016-07-14 2016-11-30 武汉斗鱼网络科技有限公司 A kind of specific user's method for digging based on NB Algorithm and system
CN108205766A (en) * 2016-12-19 2018-06-26 阿里巴巴集团控股有限公司 Information-pushing method, apparatus and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
H.-T. CHENG等: "Wide & deep learning for recommender systems", 《IN PROC. WORKSHOP DEEP LEARN RECOMMENDER SYST.》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111709764A (en) * 2019-03-18 2020-09-25 腾讯科技(深圳)有限公司 Method and device for determining related parameters of multimedia content and storage medium
CN111709764B (en) * 2019-03-18 2023-04-18 腾讯科技(深圳)有限公司 Method and device for determining related parameters of multimedia content and storage medium
CN110166803A (en) * 2019-06-21 2019-08-23 奚华文 A kind of interactive approach of the network media
CN110599234A (en) * 2019-08-13 2019-12-20 广东工业大学 Product sales prediction method
CN111191860A (en) * 2020-04-14 2020-05-22 北京热云科技有限公司 Prediction method based on ensemble learning, prediction system and readable storage medium
CN111191860B (en) * 2020-04-14 2020-08-21 北京热云科技有限公司 Prediction method based on ensemble learning, prediction system and readable storage medium
CN111554118A (en) * 2020-04-24 2020-08-18 深圳职业技术学院 Dynamic prediction method and system for bus arrival time
CN113761342A (en) * 2021-01-21 2021-12-07 北京沃东天骏信息技术有限公司 Information pushing method and device and computer readable storage medium
CN116934389A (en) * 2023-06-12 2023-10-24 嘉兴华数广电网络有限公司 Digital television value ticket card management system based on cloud computing and cross-platform technology
CN116934389B (en) * 2023-06-12 2024-02-23 嘉兴华数广电网络有限公司 Digital television value ticket card management system based on cloud computing and cross-platform technology
CN117909601A (en) * 2024-03-15 2024-04-19 厦门她趣信息技术有限公司 Payment social matching method, device, equipment and readable storage medium
CN117909601B (en) * 2024-03-15 2024-05-17 厦门她趣信息技术有限公司 Payment social matching method, device, equipment and readable storage medium

Also Published As

Publication number Publication date
CN108965938B (en) 2020-03-20

Similar Documents

Publication Publication Date Title
CN108965938A (en) Potential paying customer prediction technique and system in smart television
JP5583696B2 (en) Conversion confidence rating
CN108205766A (en) Information-pushing method, apparatus and system
CN106055617A (en) Data pushing method and device
CN106803190A (en) A kind of ad personalization supplying system and method
CN109783730A (en) Products Show method, apparatus, computer equipment and storage medium
CN109447364A (en) Power customer based on label complains prediction technique
CN105930425A (en) Personalized video recommendation method and apparatus
CN109255651A (en) A kind of search advertisements conversion intelligent Forecasting based on big data
CN104205158A (en) Systems, methods, and articles of manufacture to measure online audiences
CN102004768A (en) Adaptative analytics multidimensional processing system
CN104933622A (en) Microblog popularity degree prediction method based on user and microblog theme and microblog popularity degree prediction system based on user and microblog theme
CN110880127B (en) Consumption level prediction method and device, electronic equipment and storage medium
CN109359868A (en) A kind of construction method and system of power grid user portrait
CN112598438A (en) Outdoor advertisement recommendation system and method based on large-scale user portrait
CN102592236A (en) Internet advertising crowd analysis system and analysis method
CN103136280A (en) System and method of automatically searching hot point video
CN105869016A (en) Method for estimating click through rate based on convolution neural network
CN109325845A (en) A kind of financial product intelligent recommendation method and system
CN108885628A (en) Data analysing method candidate's determination device
CN108009850A (en) Marketing advertisement scheme matching method and system
CN106202371B (en) Media file processing method and device and advertisement analysis method
CN110110372A (en) A kind of user's timing behavior automatic segmentation prediction technique
CN108537635A (en) A kind of recommendation method and device of product
CN114331495A (en) Multimedia data processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200320

Termination date: 20210803