How Founders, Investors, and Investment Rounds Inform Startup Success Predictions

The feature space for the startup success model includes data on founders, investors, and investment rounds, with categorical and numerical features aggregated and encoded. Social media links and investment history are considered, with NMF used to process text tags. All features are carefully engineered to avoid information leakage and ensure accuracy in predictions.


This content originally appeared on HackerNoon and was authored by ExitStrategy

:::info Authors:

(1) Mark Potanin, a Corresponding (authorpotanin.m.st@gmail.com);

(2) Andrey Chertok, (a.v.chertok@gmail.com);

(3) Konstantin Zorin, (berzqwer@gmail.com);

(4) Cyril Shtabtsovsky, (cyril@aloniq.com).

:::

Abstract and 1. Introduction

2 Related works

3 Dataset Overview, Preprocessing, and Features

3.1 Successful Companies Dataset and 3.2 Unsuccessful Companies Dataset

3.3 Features

4 Model Training, Evaluation, and Portfolio Simulation and 4.1 Backtest

4.2 Backtest settings

4.3 Results

4.4 Capital Growth

5 Other approaches

5.1 Investors ranking model

5.2 Founders ranking model and 5.3 Unicorn recommendation model

6 Conclusion

7 Further Research, References and Appendix

3.3 Features

The feature space of the model includes:

\ Founders Features Categorical: countrycode, region, city, institutionname, degreetype, subject. Numerical: twitterurl, linkedinurl, facebookurl, gender, iscompleted, numdegrees, numlaststartups, numlastjobs, numberoffounders.

\ We incorporated three binary flags into our model to represent the presence of founders’ social media links. Since a company can have multiple founders, it was essential to aggregate information on all the founders for each company. For categorical variables, the most frequent value from the list was used, and the median for numerical variables.

\ Investors Features Categorical: type, countrycode, region, city, investortypes. Numerical: investmentcount, totalfundingusd, twitterurl, linkedinurl, facebookurl, raisedamountusd, investorcount, numfull.

\ Figure 1: Model architecture

\ Functions that generate features based on the founders’ and investors’ data incorporate a date parameter as input. This approach is necessary to the model from using future information. For example, details about the number of companies founded or the founder’s previous job experience that took place after the date of interest should not be incorporated into the feature set to avoid information leakage from the future.

\ Rounds features Categorical: countrycode, investmenttype, region, city, investorname. Numerical: sum, mean, max of raisedamountusd, investorcount, postmoneyvaluation_usd.

\ It is crucial to emphasize that all features related to a company’s investment rounds are gathered at a time point prior to the beginning of the time window of interest.

\ Categories There are two additional types of text data - text tags representing the companies’ field of work. For example:

\ • category_list: Internet, Social Media, Social Network

\ • categorygroupslist: Data and Analytics, Information Technology, Software

\ The set of tags used in our study consists of a list of keywords separated by commas. We used the NMF (Non-Negative Matrix Factorization) matrix factorization method to generate features from these tags. This process involves creating a binary table with companies represented as rows and tags as columns, where each value in the table indicates whether a given company is associated with a specific tag (1) or not (0). The trained matrix factorization then converts each binary vector into a smaller dimension vector (in our case, 30).

\ All categorical features are encoded using the OrdinalEncoder, while numerical features are normalized.

\

:::info This paper is available on arxiv under CC 4.0 license.

:::

\


This content originally appeared on HackerNoon and was authored by ExitStrategy


Print Share Comment Cite Upload Translate Updates
APA

ExitStrategy | Sciencx (2024-08-07T18:20:18+00:00) How Founders, Investors, and Investment Rounds Inform Startup Success Predictions. Retrieved from https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/

MLA
" » How Founders, Investors, and Investment Rounds Inform Startup Success Predictions." ExitStrategy | Sciencx - Wednesday August 7, 2024, https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/
HARVARD
ExitStrategy | Sciencx Wednesday August 7, 2024 » How Founders, Investors, and Investment Rounds Inform Startup Success Predictions., viewed ,<https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/>
VANCOUVER
ExitStrategy | Sciencx - » How Founders, Investors, and Investment Rounds Inform Startup Success Predictions. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/
CHICAGO
" » How Founders, Investors, and Investment Rounds Inform Startup Success Predictions." ExitStrategy | Sciencx - Accessed . https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/
IEEE
" » How Founders, Investors, and Investment Rounds Inform Startup Success Predictions." ExitStrategy | Sciencx [Online]. Available: https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/. [Accessed: ]
rf:citation
» How Founders, Investors, and Investment Rounds Inform Startup Success Predictions | ExitStrategy | Sciencx | https://www.scien.cx/2024/08/07/how-founders-investors-and-investment-rounds-inform-startup-success-predictions/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.