No Cover Image

Journal article 329 views 61 downloads

Number of Publications on New Clinical Prediction Models: A Bibliometric Review

Banafsheh Arshi Orcid Logo, Laure Wynants Orcid Logo, Eline Rijnhart Orcid Logo, Kelly Reeve Orcid Logo, Laura Cowley, Luc J Smits Orcid Logo

JMIR Medical Informatics, Volume: 13, Start page: e62710

Swansea University Author: Laura Cowley

  • 69987.VOR.pdf

    PDF | Version of Record

    © Banafsheh Arshi, Laure Wynants, Eline Rijnhart, Kelly Reeve, Laura Elizabeth Cowley, Luc J Smits. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).

    Download (496.34KB)

Check full text

DOI (Published version): 10.2196/62710

Abstract

Concerns have been expressed about the abundance of new clinical prediction models (CPMs) proposed in the literature. However, the extent of this proliferation in prediction research remains unclear. This study aimed to estimate the total and annual number of CPM development-related publications ava...

Full description

Published in: JMIR Medical Informatics
ISSN: 2291-9694
Published: JMIR Publications 2025
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa69987
first_indexed 2025-07-17T09:33:16Z
last_indexed 2025-07-18T05:00:02Z
id cronfa69987
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2025-07-17T10:34:52.2545484</datestamp><bib-version>v2</bib-version><id>69987</id><entry>2025-07-17</entry><title>Number of Publications on New Clinical Prediction Models: A Bibliometric Review</title><swanseaauthors><author><sid>a80501f280e89fee276510b25fc68e77</sid><firstname>Laura</firstname><surname>Cowley</surname><name>Laura Cowley</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2025-07-17</date><deptcode>MEDS</deptcode><abstract>Concerns have been expressed about the abundance of new clinical prediction models (CPMs) proposed in the literature. However, the extent of this proliferation in prediction research remains unclear. This study aimed to estimate the total and annual number of CPM development-related publications available across all medical fields. Using a validated search strategy, we conducted a systematic search of literature for prediction model studies published in Pubmed and Embase between 1995 and the end of 2020. By taking random samples for each year, we identified eligible studies that developed a multivariable model (ie, diagnostic or prognostic) for individual-level prediction of a health outcome across all medical fields. Exclusion criteria included development of models with a single predictor, studies not involving humans, methodological studies, conference abstracts, articles with unavailable full text, and those not available in English. We estimated the total and annual number of published regression-based multivariable CPM development articles, based on the total number of publications, proportion of included articles, and the search sensitivity. Furthermore, we used an adjusted Poisson regression to extrapolate our results to the period 1950-2024. Additionally, we estimated the number of articles that developed CPMs using techniques other than regression (eg, machine learning). From a random sample of 10,660 articles published between 1995 and 2020, 109 regression-based CPM development articles were included. We estimated that 82,772 (95% CI 65,313-100,231) CPM development articles using regression were published, with an acceleration in model development from 2010 onward. With the addition of articles that developed non-regression-based CPMs, the number increased to 147,714 (95% CI 125,201-170,226). After extrapolation to the years 1950-2024, the number of articles increased to 156,673 and 248,431 for regression-based models and total CPMs, respectively. Based on a representative sample of publications from the literature, we estimated that nearly 250,000 articles reporting the development of CPMs across all medical fields were published until 2024. CPM development-related publications continue to increase in number. To prevent research waste and close the gap between research and clinical practice, focus should shift away from developing new CPMs to facilitating model validation and impact assessment of the plethora of existing CPMs. Limitations of this study include restriction of search to articles available in English and development of the validated search strategy prior to the popularity of artificial intelligence and machine learning models.</abstract><type>Journal Article</type><journal>JMIR Medical Informatics</journal><volume>13</volume><journalNumber/><paginationStart>e62710</paginationStart><paginationEnd/><publisher>JMIR Publications</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint/><issnElectronic>2291-9694</issnElectronic><keywords>prediction modeling; model development; external validation; public health; research waste; clinical prediction; clinical practice; online databases; health outcomes; publication records; publication</keywords><publishedDay>4</publishedDay><publishedMonth>7</publishedMonth><publishedYear>2025</publishedYear><publishedDate>2025-07-04</publishedDate><doi>10.2196/62710</doi><url/><notes>Review</notes><college>COLLEGE NANME</college><department>Medical School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MEDS</DepartmentCode><institution>Swansea University</institution><apcterm>Another institution paid the OA fee</apcterm><funders/><projectreference/><lastEdited>2025-07-17T10:34:52.2545484</lastEdited><Created>2025-07-17T10:07:58.3598836</Created><path><level id="1">Faculty of Medicine, Health and Life Sciences</level><level id="2">Swansea University Medical School - Health Data Science</level></path><authors><author><firstname>Banafsheh</firstname><surname>Arshi</surname><orcid>0000-0003-0223-7872</orcid><order>1</order></author><author><firstname>Laure</firstname><surname>Wynants</surname><orcid>0000-0002-3037-122X</orcid><order>2</order></author><author><firstname>Eline</firstname><surname>Rijnhart</surname><orcid>0000-0001-5300-998X</orcid><order>3</order></author><author><firstname>Kelly</firstname><surname>Reeve</surname><orcid>0000-0001-9325-6467</orcid><order>4</order></author><author><firstname>Laura</firstname><surname>Cowley</surname><order>5</order></author><author><firstname>Luc J</firstname><surname>Smits</surname><orcid>0000-0003-0785-1345</orcid><order>6</order></author></authors><documents><document><filename>69987__34782__475fc5eb8ed5401985252bd2d494ddfd.pdf</filename><originalFilename>69987.VOR.pdf</originalFilename><uploaded>2025-07-17T10:31:52.3355404</uploaded><type>Output</type><contentLength>508254</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><documentNotes>&#xA9; Banafsheh Arshi, Laure Wynants, Eline Rijnhart, Kelly Reeve, Laura Elizabeth Cowley, Luc J Smits. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/</licence></document></documents><OutputDurs/></rfc1807>
spelling 2025-07-17T10:34:52.2545484 v2 69987 2025-07-17 Number of Publications on New Clinical Prediction Models: A Bibliometric Review a80501f280e89fee276510b25fc68e77 Laura Cowley Laura Cowley true false 2025-07-17 MEDS Concerns have been expressed about the abundance of new clinical prediction models (CPMs) proposed in the literature. However, the extent of this proliferation in prediction research remains unclear. This study aimed to estimate the total and annual number of CPM development-related publications available across all medical fields. Using a validated search strategy, we conducted a systematic search of literature for prediction model studies published in Pubmed and Embase between 1995 and the end of 2020. By taking random samples for each year, we identified eligible studies that developed a multivariable model (ie, diagnostic or prognostic) for individual-level prediction of a health outcome across all medical fields. Exclusion criteria included development of models with a single predictor, studies not involving humans, methodological studies, conference abstracts, articles with unavailable full text, and those not available in English. We estimated the total and annual number of published regression-based multivariable CPM development articles, based on the total number of publications, proportion of included articles, and the search sensitivity. Furthermore, we used an adjusted Poisson regression to extrapolate our results to the period 1950-2024. Additionally, we estimated the number of articles that developed CPMs using techniques other than regression (eg, machine learning). From a random sample of 10,660 articles published between 1995 and 2020, 109 regression-based CPM development articles were included. We estimated that 82,772 (95% CI 65,313-100,231) CPM development articles using regression were published, with an acceleration in model development from 2010 onward. With the addition of articles that developed non-regression-based CPMs, the number increased to 147,714 (95% CI 125,201-170,226). After extrapolation to the years 1950-2024, the number of articles increased to 156,673 and 248,431 for regression-based models and total CPMs, respectively. Based on a representative sample of publications from the literature, we estimated that nearly 250,000 articles reporting the development of CPMs across all medical fields were published until 2024. CPM development-related publications continue to increase in number. To prevent research waste and close the gap between research and clinical practice, focus should shift away from developing new CPMs to facilitating model validation and impact assessment of the plethora of existing CPMs. Limitations of this study include restriction of search to articles available in English and development of the validated search strategy prior to the popularity of artificial intelligence and machine learning models. Journal Article JMIR Medical Informatics 13 e62710 JMIR Publications 2291-9694 prediction modeling; model development; external validation; public health; research waste; clinical prediction; clinical practice; online databases; health outcomes; publication records; publication 4 7 2025 2025-07-04 10.2196/62710 Review COLLEGE NANME Medical School COLLEGE CODE MEDS Swansea University Another institution paid the OA fee 2025-07-17T10:34:52.2545484 2025-07-17T10:07:58.3598836 Faculty of Medicine, Health and Life Sciences Swansea University Medical School - Health Data Science Banafsheh Arshi 0000-0003-0223-7872 1 Laure Wynants 0000-0002-3037-122X 2 Eline Rijnhart 0000-0001-5300-998X 3 Kelly Reeve 0000-0001-9325-6467 4 Laura Cowley 5 Luc J Smits 0000-0003-0785-1345 6 69987__34782__475fc5eb8ed5401985252bd2d494ddfd.pdf 69987.VOR.pdf 2025-07-17T10:31:52.3355404 Output 508254 application/pdf Version of Record true © Banafsheh Arshi, Laure Wynants, Eline Rijnhart, Kelly Reeve, Laura Elizabeth Cowley, Luc J Smits. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). true eng https://creativecommons.org/licenses/by/4.0/
title Number of Publications on New Clinical Prediction Models: A Bibliometric Review
spellingShingle Number of Publications on New Clinical Prediction Models: A Bibliometric Review
Laura Cowley
title_short Number of Publications on New Clinical Prediction Models: A Bibliometric Review
title_full Number of Publications on New Clinical Prediction Models: A Bibliometric Review
title_fullStr Number of Publications on New Clinical Prediction Models: A Bibliometric Review
title_full_unstemmed Number of Publications on New Clinical Prediction Models: A Bibliometric Review
title_sort Number of Publications on New Clinical Prediction Models: A Bibliometric Review
author_id_str_mv a80501f280e89fee276510b25fc68e77
author_id_fullname_str_mv a80501f280e89fee276510b25fc68e77_***_Laura Cowley
author Laura Cowley
author2 Banafsheh Arshi
Laure Wynants
Eline Rijnhart
Kelly Reeve
Laura Cowley
Luc J Smits
format Journal article
container_title JMIR Medical Informatics
container_volume 13
container_start_page e62710
publishDate 2025
institution Swansea University
issn 2291-9694
doi_str_mv 10.2196/62710
publisher JMIR Publications
college_str Faculty of Medicine, Health and Life Sciences
hierarchytype
hierarchy_top_id facultyofmedicinehealthandlifesciences
hierarchy_top_title Faculty of Medicine, Health and Life Sciences
hierarchy_parent_id facultyofmedicinehealthandlifesciences
hierarchy_parent_title Faculty of Medicine, Health and Life Sciences
department_str Swansea University Medical School - Health Data Science{{{_:::_}}}Faculty of Medicine, Health and Life Sciences{{{_:::_}}}Swansea University Medical School - Health Data Science
document_store_str 1
active_str 0
description Concerns have been expressed about the abundance of new clinical prediction models (CPMs) proposed in the literature. However, the extent of this proliferation in prediction research remains unclear. This study aimed to estimate the total and annual number of CPM development-related publications available across all medical fields. Using a validated search strategy, we conducted a systematic search of literature for prediction model studies published in Pubmed and Embase between 1995 and the end of 2020. By taking random samples for each year, we identified eligible studies that developed a multivariable model (ie, diagnostic or prognostic) for individual-level prediction of a health outcome across all medical fields. Exclusion criteria included development of models with a single predictor, studies not involving humans, methodological studies, conference abstracts, articles with unavailable full text, and those not available in English. We estimated the total and annual number of published regression-based multivariable CPM development articles, based on the total number of publications, proportion of included articles, and the search sensitivity. Furthermore, we used an adjusted Poisson regression to extrapolate our results to the period 1950-2024. Additionally, we estimated the number of articles that developed CPMs using techniques other than regression (eg, machine learning). From a random sample of 10,660 articles published between 1995 and 2020, 109 regression-based CPM development articles were included. We estimated that 82,772 (95% CI 65,313-100,231) CPM development articles using regression were published, with an acceleration in model development from 2010 onward. With the addition of articles that developed non-regression-based CPMs, the number increased to 147,714 (95% CI 125,201-170,226). After extrapolation to the years 1950-2024, the number of articles increased to 156,673 and 248,431 for regression-based models and total CPMs, respectively. Based on a representative sample of publications from the literature, we estimated that nearly 250,000 articles reporting the development of CPMs across all medical fields were published until 2024. CPM development-related publications continue to increase in number. To prevent research waste and close the gap between research and clinical practice, focus should shift away from developing new CPMs to facilitating model validation and impact assessment of the plethora of existing CPMs. Limitations of this study include restriction of search to articles available in English and development of the validated search strategy prior to the popularity of artificial intelligence and machine learning models.
published_date 2025-07-04T05:30:09Z
_version_ 1856805598004248576
score 11.09611