No Cover Image

Journal article 504 views 105 downloads

Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS

Athanasios Anastasiou, Brian Perkins, Karen Tingay Orcid Logo

International Journal for Population Data Science, Volume: 1, Issue: 1

Swansea University Author: Karen Tingay Orcid Logo

Check full text

DOI (Published version): 10.23889/ijpds.v1i1.96

Abstract

Objectives The main objective of this poster is to present a pilot project in determining emerging population health themes and identifying key research-enabling datasets ahead of time. At present, large-scale databanks, such as the Secure Anonymised Information Linkage (SAIL) Databank at Swansea Un...

Full description

Published in: International Journal for Population Data Science
ISSN: 2399-4908
Published: IPDLN Conference (August 2016) 2017
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa39087
first_indexed 2018-03-29T19:34:59Z
last_indexed 2018-03-29T19:34:59Z
id cronfa39087
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2018-03-29T14:45:02.8122272</datestamp><bib-version>v2</bib-version><id>39087</id><entry>2018-03-15</entry><title>Bibliometric Analysis to Scan and Scrape New Datasets: It&#x2019;s all about that BASS</title><swanseaauthors><author><sid>e54a01719bded3d289478a854ca6a016</sid><ORCID>0000-0002-0257-9365</ORCID><firstname>Karen</firstname><surname>Tingay</surname><name>Karen Tingay</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2018-03-15</date><deptcode>MEDS</deptcode><abstract>Objectives The main objective of this poster is to present a pilot project in determining emerging population health themes and identifying key research-enabling datasets ahead of time. At present, large-scale databanks, such as the Secure Anonymised Information Linkage (SAIL) Databank at Swansea University Medical School, already manage large quantities of health and administrative linked datasets. While these datasets are valuable for research purposes, complementary datasets may be required by collaborating researchers to answer detailed population health research questions. Dataset acquisition can take several years, which is a serious delay to a project with time-limited funding. The ability to pre-emptively acquire datasets so that these are ready for use before a researcher requests them would obviously be beneficial. However, a recent study conducted by the Farr Cipher team at Swansea University identified over 800 health and administrative datasets in Wales alone. With limited resources such as available funding and time, which of these datasets is worth its effort in acquiring? ApproachBibliometrics has long been a means of measuring the impact of papers on the wider academic community. Lately, the focus of analyses has been extended to include the topics, authorship and citations of the publications. Existing bibliometric data mining techniques suggest that it is possible to identify emerging topic trends and through this assist in prioritising dataset identification and acquisition. The project explored mining available literature through bibliometric analysis in order to predict emerging trends and through these identify potentially relevant and valuable datasets for acquisition on behalf of the Dementias Platform UK (DPUK). Literature searches were conducted for papers published on the topic of &#x201C;dementia&#x201D; over the last 20 years. Additional keywords and topics were extracted to identify emerging areas of research and clinical interest. These were then compared against an existing list of over 800 Welsh datasets currently not held in SAIL. ResultsResults focus on: &#x2022; Using bibliometric methods in the context of DPUK cohort publications &#x2022; Identifying emerging trends in the field of dementia research. &#x2022; Identifying and prioritising datasets which might be useful for the SAIL Databank to acquire</abstract><type>Journal Article</type><journal>International Journal for Population Data Science</journal><volume>1</volume><journalNumber>1</journalNumber><publisher>IPDLN Conference (August 2016)</publisher><issnElectronic>2399-4908</issnElectronic><keywords/><publishedDay>13</publishedDay><publishedMonth>4</publishedMonth><publishedYear>2017</publishedYear><publishedDate>2017-04-13</publishedDate><doi>10.23889/ijpds.v1i1.96</doi><url/><notes/><college>COLLEGE NANME</college><department>Medical School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MEDS</DepartmentCode><institution>Swansea University</institution><apcterm/><lastEdited>2018-03-29T14:45:02.8122272</lastEdited><Created>2018-03-15T10:28:51.4584340</Created><path><level id="1">Faculty of Medicine, Health and Life Sciences</level><level id="2">Swansea University Medical School - Medicine</level></path><authors><author><firstname>Athanasios</firstname><surname>Anastasiou</surname><order>1</order></author><author><firstname>Brian</firstname><surname>Perkins</surname><order>2</order></author><author><firstname>Karen</firstname><surname>Tingay</surname><orcid>0000-0002-0257-9365</orcid><order>3</order></author></authors><documents><document><filename>0039087-29032018144306.pdf</filename><originalFilename>39087.pdf</originalFilename><uploaded>2018-03-29T14:43:06.3870000</uploaded><type>Output</type><contentLength>218037</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><embargoDate>2018-03-29T00:00:00.0000000</embargoDate><documentNotes>&#xA9;The Authors. Open Access under CC BY-NC-ND 4.0</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language></document></documents><OutputDurs/></rfc1807>
spelling 2018-03-29T14:45:02.8122272 v2 39087 2018-03-15 Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS e54a01719bded3d289478a854ca6a016 0000-0002-0257-9365 Karen Tingay Karen Tingay true false 2018-03-15 MEDS Objectives The main objective of this poster is to present a pilot project in determining emerging population health themes and identifying key research-enabling datasets ahead of time. At present, large-scale databanks, such as the Secure Anonymised Information Linkage (SAIL) Databank at Swansea University Medical School, already manage large quantities of health and administrative linked datasets. While these datasets are valuable for research purposes, complementary datasets may be required by collaborating researchers to answer detailed population health research questions. Dataset acquisition can take several years, which is a serious delay to a project with time-limited funding. The ability to pre-emptively acquire datasets so that these are ready for use before a researcher requests them would obviously be beneficial. However, a recent study conducted by the Farr Cipher team at Swansea University identified over 800 health and administrative datasets in Wales alone. With limited resources such as available funding and time, which of these datasets is worth its effort in acquiring? ApproachBibliometrics has long been a means of measuring the impact of papers on the wider academic community. Lately, the focus of analyses has been extended to include the topics, authorship and citations of the publications. Existing bibliometric data mining techniques suggest that it is possible to identify emerging topic trends and through this assist in prioritising dataset identification and acquisition. The project explored mining available literature through bibliometric analysis in order to predict emerging trends and through these identify potentially relevant and valuable datasets for acquisition on behalf of the Dementias Platform UK (DPUK). Literature searches were conducted for papers published on the topic of “dementia” over the last 20 years. Additional keywords and topics were extracted to identify emerging areas of research and clinical interest. These were then compared against an existing list of over 800 Welsh datasets currently not held in SAIL. ResultsResults focus on: • Using bibliometric methods in the context of DPUK cohort publications • Identifying emerging trends in the field of dementia research. • Identifying and prioritising datasets which might be useful for the SAIL Databank to acquire Journal Article International Journal for Population Data Science 1 1 IPDLN Conference (August 2016) 2399-4908 13 4 2017 2017-04-13 10.23889/ijpds.v1i1.96 COLLEGE NANME Medical School COLLEGE CODE MEDS Swansea University 2018-03-29T14:45:02.8122272 2018-03-15T10:28:51.4584340 Faculty of Medicine, Health and Life Sciences Swansea University Medical School - Medicine Athanasios Anastasiou 1 Brian Perkins 2 Karen Tingay 0000-0002-0257-9365 3 0039087-29032018144306.pdf 39087.pdf 2018-03-29T14:43:06.3870000 Output 218037 application/pdf Version of Record true 2018-03-29T00:00:00.0000000 ©The Authors. Open Access under CC BY-NC-ND 4.0 true eng
title Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
spellingShingle Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
Karen Tingay
title_short Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
title_full Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
title_fullStr Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
title_full_unstemmed Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
title_sort Bibliometric Analysis to Scan and Scrape New Datasets: It’s all about that BASS
author_id_str_mv e54a01719bded3d289478a854ca6a016
author_id_fullname_str_mv e54a01719bded3d289478a854ca6a016_***_Karen Tingay
author Karen Tingay
author2 Athanasios Anastasiou
Brian Perkins
Karen Tingay
format Journal article
container_title International Journal for Population Data Science
container_volume 1
container_issue 1
publishDate 2017
institution Swansea University
issn 2399-4908
doi_str_mv 10.23889/ijpds.v1i1.96
publisher IPDLN Conference (August 2016)
college_str Faculty of Medicine, Health and Life Sciences
hierarchytype
hierarchy_top_id facultyofmedicinehealthandlifesciences
hierarchy_top_title Faculty of Medicine, Health and Life Sciences
hierarchy_parent_id facultyofmedicinehealthandlifesciences
hierarchy_parent_title Faculty of Medicine, Health and Life Sciences
department_str Swansea University Medical School - Medicine{{{_:::_}}}Faculty of Medicine, Health and Life Sciences{{{_:::_}}}Swansea University Medical School - Medicine
document_store_str 1
active_str 0
description Objectives The main objective of this poster is to present a pilot project in determining emerging population health themes and identifying key research-enabling datasets ahead of time. At present, large-scale databanks, such as the Secure Anonymised Information Linkage (SAIL) Databank at Swansea University Medical School, already manage large quantities of health and administrative linked datasets. While these datasets are valuable for research purposes, complementary datasets may be required by collaborating researchers to answer detailed population health research questions. Dataset acquisition can take several years, which is a serious delay to a project with time-limited funding. The ability to pre-emptively acquire datasets so that these are ready for use before a researcher requests them would obviously be beneficial. However, a recent study conducted by the Farr Cipher team at Swansea University identified over 800 health and administrative datasets in Wales alone. With limited resources such as available funding and time, which of these datasets is worth its effort in acquiring? ApproachBibliometrics has long been a means of measuring the impact of papers on the wider academic community. Lately, the focus of analyses has been extended to include the topics, authorship and citations of the publications. Existing bibliometric data mining techniques suggest that it is possible to identify emerging topic trends and through this assist in prioritising dataset identification and acquisition. The project explored mining available literature through bibliometric analysis in order to predict emerging trends and through these identify potentially relevant and valuable datasets for acquisition on behalf of the Dementias Platform UK (DPUK). Literature searches were conducted for papers published on the topic of “dementia” over the last 20 years. Additional keywords and topics were extracted to identify emerging areas of research and clinical interest. These were then compared against an existing list of over 800 Welsh datasets currently not held in SAIL. ResultsResults focus on: • Using bibliometric methods in the context of DPUK cohort publications • Identifying emerging trends in the field of dementia research. • Identifying and prioritising datasets which might be useful for the SAIL Databank to acquire
published_date 2017-04-13T07:14:02Z
_version_ 1827005732021600256
score 11.056895