| Résumé | Star scientists are highly influential researchers who have made significant contributions to their field, gained widespread recognition, and often attracted substantial research funding. They are critical for the advancement of science and innovation and significantly influence the transfer of knowledge and technology to industry. Identifying potential star scientists before their performance becomes outstanding is important for recruitment, collaboration, networking, and research funding decisions. This study utilizes machine learning techniques and builds four different classifiers, i.e., random forest, support vector machines, naïve bayes, and logistic regression, to predict star scientists in the field of artificial intelligence while highlighting features related to their success. The analysis is based on publication data collected from Scopus from 2000 to 2019, incorporating a diverse set of features such as gender, ethnic diversity, and collaboration network structural properties. The random forest model achieved the best performance with an AUC of 0.75. Our results confirm that star scientists follow different patterns compared to their non-star counterparts in almost all the early-career features. We found that certain features, such as gender and ethnic diversity, play important roles in scientific collaboration and can significantly impact an author’s career development and success. The most important features in predicting star scientists in the field of artificial intelligence were the number of articles, betweenness centrality, research impact indicators, and weighted degree centrality. Our approach offers valuable insights for researchers, practitioners, and funding agencies interested in identifying and supporting talented researchers. |
|---|