Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2835776.2835791acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article
Open access

Project Success Prediction in Crowdfunding Environments

Published: 08 February 2016 Publication History

Abstract

Crowdfunding has gained widespread attention in recent years. Despite the huge success of crowdfunding platforms, the percentage of projects that succeed in achieving their desired goal amount is only around 40%. Moreover, many of these crowdfunding platforms follow "all-or-nothing" policy which means the pledged amount is collected only if the goal is reached within a certain predefined time duration. Hence, estimating the probability of success for a project is one of the most important research challenges in the crowdfunding domain. To predict the project success, there is a need for new prediction models that can potentially combine the power of both classification (which incorporate both successful and failed projects) and regression (for estimating the time for success). In this paper, we formulate the project success prediction as a survival analysis problem and apply the censored regression approach where one can perform regression in the presence of partial information. We rigorously study the project success time distribution of crowdfunding data and show that the logistic and log-logistic distributions are a natural choice for learning from such data. We investigate various censored regression models using comprehensive data of 18K Kickstarter (a popular crowdfunding platform) projects and 116K corresponding tweets collected from Twitter. We show that the models that take complete advantage of both the successful and failed projects during the training phase will perform significantly better at predicting the success of future projects compared to the ones that only use the successful projects. We provide a rigorous evaluation on many sets of relevant features and show that adding few temporal features that are obtained at the project's early stages can dramatically improve the performance.

References

[1]
J. Andreoni. Impure altruism and donations to public goods: a theory of warm-glow giving. The economic journal, pages 464--477, 1990.
[2]
A. Ashta and D. Assadi. Do social cause and social technology meet? impact of web 2.0 technologies on peer-to-peer lending transactions. Cahiers du CEREN, 29:177--192, 2009.
[3]
S. Bennett. Log-logistic regression models for survival data. Applied Statistics, pages 165--171, 1983.
[4]
T. Bruett. Cows, kiva, and prosper. com: How disintermediation and the internet are changing microfinance. Community Development Investment Review, 3(2):44--50, 2007.
[5]
J. Buckley and I. James. Linear regression with censored data. Biometrika, 66(3):429--436, 1979.
[6]
O. Chapelle. Modeling delayed feedback in display advertising. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1097--1105. ACM, 2014.
[7]
J. Choo, D. Lee, B. Dilkina, H. Zha, and H. Park. To gather together for a better world: understanding and leveraging communities in micro-lending recommendation. In Proceedings of the 23rd international conference on World wide web, pages 249--260, 2014.
[8]
D. R. Cox. Regression models and life-tables. Journal of the Royal Statistical Society. Series B (Methodological), pages 187--220, 1972.
[9]
B. Efron. The efficiency of cox's likelihood function for censored data. Journal of the American Statistical Association, 72(359):557--565, 1977.
[10]
V. Etter, M. Grossglauser, and P. Thiran. Launch hard or go home!: predicting the success of kickstarter campaigns. In Proceedings of the first ACM conference on Online social networks, pages 177--182, 2013.
[11]
J. Galak, D. Small, and A. T. Stephen. Microfinance decision making: A field study of prosocial lending. Journal of Marketing Research, 48(SPL):S130--S137, 2011.
[12]
E. M. Gerber, J. S. Hui, and P.-Y. Kuo. Crowdfunding: Why people are motivated to post and fund projects on crowdfunding platforms. In CSCW Workshop, 2012.
[13]
F. E. Harrell, R. M. Cali, D. B. Pryor, K. L. Lee, and R. A. Rosati. Evaluating the yield of medical tests. JAMA, 247(18):2543--2546, 1982.
[14]
F. E. Harrell, K. L. Lee, R. M. Cali, D. B. Pryor, and R. A. Rosati. Regression modelling strategies for improved prognostic prediction. Statistics in medicine, 3(2):143--152, 1984.
[15]
S. Heyman. Keeping up with kickstarter. The New York Times, URL http://www.nytimes.com/2015/01/15/arts/ international/keeping-up-with-kickstarter.html, 2015.
[16]
T. Hothorn, P. Buehlmann, T. Kneib, M. Schmid, and B. Hofner. Model-based boosting 2.0. Journal of Machine Learning Research, 11:2109--2113, 2010.
[17]
J. S. Hui, M. D. Greenberg, and E. M. Gerber. Understanding the role of community in crowdfunding work. In Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing, pages 62--74. ACM, 2014.
[18]
E. L. Kaplan and P. Meier. Nonparametric estimation from incomplete observations. Journal of the American statistical association, 53(282):457--481, 1958.
[19]
J. P. Klein and M.-J. Zhang. Survival analysis, software. Wiley Online Library, 2005.
[20]
V. Kuppuswamy and B. L. Bayus. Crowdfunding creative ideas: The dynamics of project backers in kickstarter. SSRN Electronic Journal, 2013.
[21]
E. T. Lee and J. Wang. Statistical methods for survival data analysis, volume 476. Wiley. com, 2003.
[22]
C.-T. Lu, S. Xie, X. Kong, and P. S. Yu. Inferring the impacts of social media on crowdfunding. In Proceedings of the 7th ACM international conference on Web search and data mining, pages 573--582, 2014.
[23]
M. Lunn and D. McNeil. Applying cox regression to competing risks. Biometrics, pages 524--532, 1995.
[24]
A. Mayr and M. Schmid. Boosting the concordance index for survival data--a unified framework to derive and evaluate biomarker combinations. 2014.
[25]
R. G. Miller Jr. Survival analysis, volume 66. John Wiley & Sons, 2011.
[26]
V. Rakesh, J. Choo, and C. K. Reddy. Project recommendation using heterogeneous traits in crowdfunding. In Ninth International AAAI Conference on Web and Social Media, 2015.
[27]
V. Rakesh, W.-C. Lee, and C. K. Reddy. Probabilistic group recommendation model for crowdfunding domains. In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM). ACM, 2016.
[28]
C. K. Reddy and Y. Li. A review of clinical prediction models. In C. K. Reddy and C. C. Aggarwal, editors, Healthcare Data Analytics. Chapman and Hall/CRC Press, 2015.
[29]
T. Therneau. A package for survival analysis in s. r package version 2.37-4. URL http://CRAN. R-project. org/package= survival. Box, 980032:23298-0032, 2013.
[30]
J. Tobin. Estimation of relationships for limited dependent variables. Econometrica: journal of the Econometric Society, pages 24--36, 1958.
[31]
Z. Wang and C. Wang. Buckley-james boosting for survival analysis with high-dimensional biomarker data. Statistical Applications in Genetics and Molecular Biology, 9(1), 2010.
[32]
A. Xu, X. Yang, H. Rao, W.-T. Fu, S.-W. Huang, and B. P. Bailey. Show me the money!: an analysis of project updates during crowdfunding campaigns. In Proceedings of the 32nd annual ACM conference on Human factors in computing systems, pages 591--600, 2014.

Cited By

View all
  • (2024)Use of Patterns of Service Utilization and Hierarchical Survival Analysis in Planning and Providing Care for Overdose Patients and Predicting the Time-to-Second OverdoseKnowledge10.3390/knowledge40300244:3(444-461)Online publication date: 19-Aug-2024
  • (2024)Making Early and Accurate Deep Learning Predictions to Help Disadvantaged Individuals in Medical CrowdfundingProduction and Operations Management10.1177/10591478241231846Online publication date: 1-Mar-2024
  • (2024)Agile Project Status Prediction Using Interpretable Machine Learning2024 IEEE 12th International Conference on Intelligent Systems (IS)10.1109/IS61756.2024.10705197(1-8)Online publication date: 29-Aug-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WSDM '16: Proceedings of the Ninth ACM International Conference on Web Search and Data Mining
February 2016
746 pages
ISBN:9781450337168
DOI:10.1145/2835776
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 February 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. crowdfunding.
  2. prediction
  3. project success
  4. regression
  5. survival analysis

Qualifiers

  • Research-article

Funding Sources

Conference

WSDM 2016
WSDM 2016: Ninth ACM International Conference on Web Search and Data Mining
February 22 - 25, 2016
California, San Francisco, USA

Acceptance Rates

WSDM '16 Paper Acceptance Rate 67 of 368 submissions, 18%;
Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)292
  • Downloads (Last 6 weeks)40
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Use of Patterns of Service Utilization and Hierarchical Survival Analysis in Planning and Providing Care for Overdose Patients and Predicting the Time-to-Second OverdoseKnowledge10.3390/knowledge40300244:3(444-461)Online publication date: 19-Aug-2024
  • (2024)Making Early and Accurate Deep Learning Predictions to Help Disadvantaged Individuals in Medical CrowdfundingProduction and Operations Management10.1177/10591478241231846Online publication date: 1-Mar-2024
  • (2024)Agile Project Status Prediction Using Interpretable Machine Learning2024 IEEE 12th International Conference on Intelligent Systems (IS)10.1109/IS61756.2024.10705197(1-8)Online publication date: 29-Aug-2024
  • (2024)Multimodal dynamic graph convolutional network for crowdfunding success predictionApplied Soft Computing10.1016/j.asoc.2024.111313154(111313)Online publication date: Mar-2024
  • (2024)6G secure quantum communication: a success probability prediction modelAutomated Software Engineering10.1007/s10515-024-00427-y31:1Online publication date: 29-Mar-2024
  • (2024)Modeling and prediction of business success: a surveyArtificial Intelligence Review10.1007/s10462-023-10664-457:2Online publication date: 9-Feb-2024
  • (2023)Stabilising Job Survival Analysis for Disability Employment Services in Unseen EnvironmentsProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599908(4970-4980)Online publication date: 6-Aug-2023
  • (2023)Personal or General? A Hybrid Strategy with Multi-factors for News RecommendationACM Transactions on Information Systems10.1145/355537341:2(1-29)Online publication date: 13-Apr-2023
  • (2023)Feature selection for Turkish Crowdfunding projects with using filtering and wrapping methodsElectronic Commerce Research and Applications10.1016/j.elerap.2023.101340(101340)Online publication date: Dec-2023
  • (2023)A Survey of Machine Learning Methodologies for Loan Evaluation in Peer-to-Peer (P2P) LendingData Analytics for Management, Banking and Finance10.1007/978-3-031-36570-6_1(1-49)Online publication date: 5-Jun-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media