Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3219819.3219921acmotherconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

How LinkedIn Economic Graph Bonds Information and Product: Applications in LinkedIn Salary

Published: 19 July 2018 Publication History

Abstract

The LinkedIn Salary product was launched in late 2016 with the goal of providing insights on compensation distribution to job seekers, so that they can make more informed decisions when discovering and assessing career opportunities. The compensation insights are provided based on data collected from LinkedIn members and aggregated in a privacy-preserving manner. Given the simultaneous desire for computing robust, reliable insights and for having insights to satisfy as many job seekers as possible, a key challenge is to reliably infer the insights at the company level when there is limited or no data at all. We propose a two-step framework that utilizes a novel, semantic representation of companies (Company2vec) and a Bayesian statistical model to address this problem. Our approach makes use of the rich information present in the LinkedIn Economic Graph, and in particular, uses the intuition that two companies are likely to be similar if employees are very likely to transition from one company to the other and vice versa. We compute embeddings for companies by analyzing the LinkedIn members' company transition data using machine learning algorithms, then compute pairwise similarities between companies based on these embeddings, and finally incorporate company similarities in the form of peer company groups as part of the proposed Bayesian statistical model to predict insights at the company level. We perform extensive validation using several different evaluation techniques, and show that we can significantly increase the coverage of insights while, in fact, even slightly improving the quality of the obtained insights. For example, we were able to compute salary insights for 35 times as many title-region-company combinations in the U.S. as compared to previous work, corresponding to 4.9 times as many monthly active users. Finally, we highlight the lessons learned from practical deployment of our system.

References

[1]
BLS Handbook of Methods, chapter 3, Occupational Employment Statistics. U.S. Bureau of Labor Statistics, 2008. http://www.bls.gov/opub/hom/pdf/homch3.pdf.
[2]
How to rethink the candidate experience and make better hires. CareerBuilder's Candidate Behavior Study, 2016.
[3]
Job seeker nation study. Jobvite, 2016.
[4]
Glassdoor introduces salary estimates in job listings, 2017. https://www.glassdoor.com/press/glassdoor-introduces -salary-estimates-job-listings-reveals-unfilled-jobs-272-billion/.
[5]
Overview of BLS statistics on pay and benefits, 2018. https://www.bls.gov/bls/wages.htm.
[6]
Payscale data & methodology, 2018. http://www.payscale.com/docs/default-source/pdf/data_one_pager.pdf.
[7]
D. Agarwal, R. Agrawal, R. Khanna, and N. Kota. Estimating rates of rare events with multiple hierarchies through scalable log-linear models. In KDD, 2010.
[8]
F. Borisyuk, L. Zhang, and K. Kenthapadi. LiJAR: A system for job application redistribution towards efficient career marketplace. In KDD, 2017.
[9]
X. Chen, D. Banks, and M. West. Bayesian dynamic modeling and monitoring of network flows. arXiv:1805.04667, 2018.
[10]
X. Chen, K. Irie, D. Banks, R. Haslinger, J. Thomas, and M. West. Scalable bayesian modeling, monitoring and analysis of dynamic network flow data. Journal of the American Statistical Association, 2017.
[11]
H.-T. Cheng et al. Wide & deep learning for recommender systems. In Workshop on Deep Learning for Recommender Systems, 2016.
[12]
L. Finkelstein, E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan, G. Wolfman, and E. Ruppin. Placing search in context: The concept revisited. ACM Transactions on Information Systems, 20(1), 2002.
[13]
J. Garten, K. Sagae, V. Ustun, and M. Dehghani. Combining distributed vector representations for words. In Workshop on Vector Space Modeling for Natural Language Processing, 2015.
[14]
A. Gelman, J. B. Carlin, H. S. Stern, D. B. Dunson, A. Vehtari, and D. B. Rubin. Bayesian data analysis, volume 2. CRC press, 2014.
[15]
S. Harris. How to make the job market work like a supermarket. LinkedIn Pulse, 2016. https://www.linkedin.com/pulse/how-make-job-market-work-like-supermarket-seth-harris.
[16]
P. D. Hoff. A first course in Bayesian statistical methods. Springer Science & Business Media, 2009.
[17]
E. H. Huang, R. Socher, C. D. Manning, and A. Y. Ng. Improving word representations via global context and multiple word prototypes. In ACL, 2012.
[18]
K. Kenthapadi, S. Ambler, L. Zhang, and D. Agarwal. Bringing salary transparency to the world: Computing robust compensation insights via LinkedIn Salary. In CIKM, 2017. Available at https://arxiv.org/abs/1703.09845.
[19]
K. Kenthapadi, A. Chudhary, and S. Ambler. LinkedIn Salary: A system for secure collection and presentation of structured compensation insights to job seekers. In IEEE PAC, 2017. Available at https://arxiv.org/abs/1705.06976.
[20]
S. Kuzi, A. Shtok, and O. Kurland. Query expansion using word embeddings. In CIKM, 2016.
[21]
T. Mikolov, I. Sutskever, K. Chen, G. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In NIPS, 2013.
[22]
M. Smart and A. Chamberlain. Why do workers quit? The factors that predict employee turnover. Glassdoor, 2017. https://www.glassdoor.com/research/app/uploads/sites/2/2017/02/WhyDoWorkersQuit_Glassdoor.pdf.
[23]
C. Spearman. The proof and measurement of association between two things. The American Journal of Psychology, 1987.
[24]
R. Sumbaly, J. Kreps, L. Gao, A. Feinberg, C. Soman, and S. Shah. Serving large-scale batch computed data with project Voldemort. In FAST, 2012.
[25]
A. Timme. The data-driven approach to finding similar companies, 2015. https://blog.datafox.com/the-data-driven-approach-to-finding-similar-companies.
[26]
J. Weiner. The future of LinkedIn and the Economic Graph. LinkedIn Pulse, 2012.
[27]
T. Yamane. Statistics: An introductory analysis. 1973.
[28]
L. Zhang and D. Agarwal. Fast computation of posterior mode in multi-level hierarchical models. In NIPS, 2009.

Cited By

View all
  • (2024)Company Name Matching Using Job Market Data EnrichmentIT Professional10.1109/MITP.2024.337117926:2(76-82)Online publication date: 3-May-2024
  • (2023)News-Based Sparse Machine Learning Models for Adaptive Asset PricingData Science in Science10.1080/26941899.2023.21878952:1Online publication date: 3-Apr-2023
  • (2021)Bridging Machine Learning and Mechanism Design towards Algorithmic FairnessProceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency10.1145/3442188.3445912(489-503)Online publication date: 3-Mar-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2018
2925 pages
ISBN:9781450355520
DOI:10.1145/3219819
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bayesian smoothing
  2. company embeddings
  3. company2vec
  4. job transition analysis
  5. linkedin economic graph
  6. peer company group
  7. salary prediction

Qualifiers

  • Research-article

Conference

KDD '18
Sponsor:

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)36
  • Downloads (Last 6 weeks)4
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Company Name Matching Using Job Market Data EnrichmentIT Professional10.1109/MITP.2024.337117926:2(76-82)Online publication date: 3-May-2024
  • (2023)News-Based Sparse Machine Learning Models for Adaptive Asset PricingData Science in Science10.1080/26941899.2023.21878952:1Online publication date: 3-Apr-2023
  • (2021)Bridging Machine Learning and Mechanism Design towards Algorithmic FairnessProceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency10.1145/3442188.3445912(489-503)Online publication date: 3-Mar-2021
  • (2021)On Aggregating Salaries of Occupations From Job Post and Review DataIEEE Access10.1109/ACCESS.2021.30662049(43422-43433)Online publication date: 2021
  • (2020)Enhancing Employer Brand Evaluation with Collaborative Topic Regression ModelsACM Transactions on Information Systems10.1145/339273438:4(1-33)Online publication date: 23-May-2020
  • (2019)Large-Scale Talent Flow Forecast with Dynamic Latent Factor Model?The World Wide Web Conference10.1145/3308558.3313525(2312-2322)Online publication date: 13-May-2019
  • (2019)On Analysing Supply and Demand in Labor Markets: Framework, Model and System2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA.2019.00066(511-520)Online publication date: Oct-2019
  • (2018)Big Social Data Mining in a Cloud Computing Environment2018 International Conference on Cloud Computing, Big Data and Blockchain (ICCBB)10.1109/ICCBB.2018.8756461(1-8)Online publication date: Nov-2018
  • (2018)Mining ‘Following’ Patterns from Big but Sparsely Distributed Social Network Data2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)10.1109/ASONAM.2018.8508660(916-919)Online publication date: Aug-2018

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media