Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3269206.3272011acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Web-based Startup Success Prediction

Published: 17 October 2018 Publication History

Abstract

We consider the problem of predicting the success of startup companies at their early development stages. We formulate the task as predicting whether a company that has already secured initial (seed or angel) funding will attract a further round of investment in a given period of time. Previous work on this task has mostly been restricted to mining structured data sources, such as databases of the startup ecosystem consisting of investors, incubators and startups. Instead, we investigate the potential of using web-based open sources for the startup success prediction task and model the task using a very rich set of signals from such sources. In particular, we enrich structured data about the startup ecosystem with information from a business- and employment-oriented social networking service and from the web in general. Using these signals, we train a robust machine learning pipeline encompassing multiple base models using gradient boosting. We show that utilizing companies' mentions on the Web yields a substantial performance boost in comparison to only using structured data about the startup ecosystem. We also provide a thorough analysis of the obtained model that allows one to obtain insights into both the types of useful signals discoverable on the Web and market mechanisms underlying the funding process.

References

[1]
R. Aggarwal and H. Singh. Differential influence of blogs across different stages of decision making: The case of venture capitalists. MIS Q., 37 (4): 1093--1112, Dec. 2013.
[2]
M. D.-K. Ayyagari and V. Asli Maksimovic. Small vs. Young Firms across the World: Contribution to Employment, Job Creation, and Growth. The World Bank, 2011.
[3]
R. Biggadike. The risky business of diversification. Harvard Business Review, 57 (3): 103--111, 1979.
[4]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. Journal of machine Learning research, 3 (Jan): 993--1022, 2003.
[5]
L. Bottou. Online algorithms and stochastic approximations. In D. Saad, editor, Online Learning and Neural Networks. Cambridge University Press, Cambridge, UK, 1998. revised, oct 2012.
[6]
}catboostCatBoost. Gradient boosting on decision trees library. https://github.com/catboost, 2017 a .
[7]
}catboost-fstrCatBoost. Regular feature importance. https://tech.yandex.com/catboost/doc/dg/concepts/fstr-docpage/, 2017 b .
[8]
J. F. Coyle and G. D. Polsky. Acqui-hiring. Duke Law Journal, 63, 2012.
[9]
A. Davila, G. Foster, and M. Gupta. Venture capital financing and the growth of startup firms. Journal of Business Venturing, 18 (6): 689--708, 2003.
[10]
P. Gompers, W. Gornall, S. N. Kaplan, and I. A. Strebulaev. How do venture capitalists make decisions? Techn. report, Nat. Bureau of Econ. Research, 2016.
[11]
nd Lussier(2014)}halabi-model-2014C. E. Halab'i and R. N. Lussier. A model for predicting small firm performance: Increasing the probability of entrepreneurial success in chile. Journal of Small Business and Enterprise Development, 21 (1): 4--25, 2014.
[12]
D. K. Hsu, J. M. Haynie, S. A. Simmons, and A. McKelvie. What matters, matters differently: a conjoint analysis of the decision policies of angel and venture capital investors. Venture Capital, 16 (1): 1--25, 2014.
[13]
A. G. Huang and K. Mamo. Do analysts read the news? In Canadian Academic Accounting Association (CAAA) Annual Conference, May 2016.
[14]
A. Hyytinen, M. Pajarinen, and P. Rouvinen. Does innovativeness reduce startup survival rates? Journal of Business Venturing, 30 (4): 564 -- 581, 2015.
[15]
S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, pages 448--456, 2015.
[16]
erg}kaplan-should-2009S. N. Kaplan, B. A. Sensoy, and P. Strömberg. Should investors bet on the jockey or the horse? Evidence from the evolution of firms from early business plans to public companies. The Journal of Finance, 64 (1): 75--115, 2009.
[17]
W. R. Kerr, J. Lerner, and A. Schoar. The consequences of entrepreneurial finance: Evidence from angel financings. The Review of Financial Studies, 27 (1): 20--55, 2011.
[18]
S. Kortum and J. Lerner. Does venture capital spur innovation? In Entrepreneurial inputs and outcomes: New studies of entrepreneurship in the United States, pages 1--44. Emerald Group Publishing Limited, 2001.
[19]
Y. E. Liang and S.-T. D. Yuan. Investors are social animals: Predicting investor behavior using social network features via supervised learning approach. In Eleventh Workshop on Mining and Learning with Graphs (MLG 2013). ACM, 2013.
[20]
R. N. Lussier. A business success versus failure prediction model for service industries. Journal of Business and Entrepreneurship, 8 (2): 23, 1996.
[21]
C. M. Mason and T. Botelho. Comparing the initial screening of investment opportunities by business angel group gatekeepers and individual angels. In 2017 Emerging Trends in Entrepreneurial Finance Conference, March 2017.
[22]
sque}maxwell-business-2011A. L. Maxwell, S. A. Jeffrey, and M. Lévesque. Business angel early stage decision making. Journal of Business Venturing, 26 (2): 212 -- 225, 2011.
[23]
S. B. Moeller, F. P. Schlingemann, and R. M. Stulz. Wealth destruction on a massive scale? a study of acquiring-firm returns in the recent merger wave. The Journal of Finance, 60 (2): 757--782, 2005.
[24]
V. Nair and G. E. Hinton. Rectified linear units improve restricted boltzmann machines. In ICML, pages 807--814, 2010.
[25]
N. Srivastava, G. E. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15 (1): 1929--1958, 2014.
[26]
T. Stone, W. Zhang, and X. Zhao. An empirical study of top-n recommendation for venture finance. In CIKM, pages 1865--1868. ACM, 2013.
[27]
J. Surowiecki, M. P. Silverman, et al. The wisdom of crowds. American Journal of Physics, 75 (2): 190--192, 2007.
[28]
A. Wong, M. Bhatia, and Z. Freeman. Angel finance: the other venture capital. Strategic Change, 18 (7--8): 221--230, 2009.
[29]
and Liu}xiang-supervised-2012G. Xiang, Z. Zheng, M. Wen, J. I. Hong, C. P. Rosé, and C. Liu. A supervised approach to predict company acquisition with factual and topic features using profiles and news articles on techcrunch. In ICWSM. AAAI, June 2012.
[30]
C. Zhang, E. Chan, and A. Abdulhamid. Link prediction in bipartite venture capital investment networks. Stanford University, 2015.
[31]
Q. Zhang, T. Ye, M. Essaidi, S. Agarwal, V. Liu, and B. T. Loo. Predicting startup crowdfunding success through longitudinal social engagement analysis. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pages 1937--1946. ACM, 2017.
[32]
H. Zhong, C. Liu, J. Zhong, and H. Xiong. Which startup to invest in: a personalized portfolio strategy. Annals of Operations Research, pages 1--22, 2016.

Cited By

View all
  • (2024)РАЗРАБОТКА ПОДХОДА К ПРОГНОЗИРОВАНИЮ УСПЕШНОСТИ ИННОВАЦИОННЫХ ПРЕДПРИНИМАТЕЛЬСКИХ ПРОЕКТОВ ОРГАНИЗАЦИЙ ВЫСШЕГО ОБРАЗОВАНИЯПрогрессивная экономика10.54861/27131211_2024_10_164(164-174)Online publication date: 8-Oct-2024
  • (2024)What is success? – Concepts and perspectives in the Hungarian startup contextVezetéstudomány / Budapest Management Review10.14267/VEZTUD.2024.11.0455:11(41-52)Online publication date: 14-Nov-2024
  • (2024)Predicting startup success using two bias-free machine learning: resolving data imbalance using generative adversarial networksJournal of Big Data10.1186/s40537-024-00993-811:1Online publication date: 3-Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management
October 2018
2362 pages
ISBN:9781450360142
DOI:10.1145/3269206
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. gradient boosting
  2. heterogeneous web data
  3. mining open sources
  4. predictive modeling

Qualifiers

  • Research-article

Conference

CIKM '18
Sponsor:

Acceptance Rates

CIKM '18 Paper Acceptance Rate 147 of 826 submissions, 18%;
Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)248
  • Downloads (Last 6 weeks)36
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)РАЗРАБОТКА ПОДХОДА К ПРОГНОЗИРОВАНИЮ УСПЕШНОСТИ ИННОВАЦИОННЫХ ПРЕДПРИНИМАТЕЛЬСКИХ ПРОЕКТОВ ОРГАНИЗАЦИЙ ВЫСШЕГО ОБРАЗОВАНИЯПрогрессивная экономика10.54861/27131211_2024_10_164(164-174)Online publication date: 8-Oct-2024
  • (2024)What is success? – Concepts and perspectives in the Hungarian startup contextVezetéstudomány / Budapest Management Review10.14267/VEZTUD.2024.11.0455:11(41-52)Online publication date: 14-Nov-2024
  • (2024)Predicting startup success using two bias-free machine learning: resolving data imbalance using generative adversarial networksJournal of Big Data10.1186/s40537-024-00993-811:1Online publication date: 3-Sep-2024
  • (2024)Finding Successful Startups by Using Information Flows Among Investors in Higher Order Network of InvestmentsIEEE Transactions on Computational Social Systems10.1109/TCSS.2024.339443911:5(5803-5814)Online publication date: Oct-2024
  • (2024)Machine Learning Models to Investigate Startup Success in Venture Capital Using Crunchbase Dataset2024 21st International Joint Conference on Computer Science and Software Engineering (JCSSE)10.1109/JCSSE61278.2024.10613650(475-481)Online publication date: 19-Jun-2024
  • (2024)Discovering potential founders within academic institutionsInternational Journal of Data Science and Analytics10.1007/s41060-024-00663-1Online publication date: 14-Oct-2024
  • (2024)Staged link prediction in bipartite investment networks based on pseudo-edge generationInformation Technology and Management10.1007/s10799-024-00421-6Online publication date: 23-Apr-2024
  • (2024)Modeling and prediction of business success: a surveyArtificial Intelligence Review10.1007/s10462-023-10664-457:2Online publication date: 9-Feb-2024
  • (2024)Government Scheme Suggestor for Probable Successful Startup Using Hybrid IntelligenceSmart Systems: Innovations in Computing10.1007/978-981-97-3690-4_49(659-669)Online publication date: 30-Sep-2024
  • (2023)Critical success factors of startups based on modern technology; A systematic reviewRoshd -e- Fanavari10.61186/jstpi.37349.19.76.919:76(9-19)Online publication date: 18-Dec-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media