No Cover Image

Journal article 824 views 136 downloads

A probabilistic framework for behavioral identification from animal-borne accelerometers

Jane E. Dentinger, Luca Borger Orcid Logo, Mark Holton Orcid Logo, Ruholla Jafari-Marandi, Durham A. Norman, Brian K. Smith, Seth F. Oppenheimer, Bronson K. Strickland, Rory Wilson Orcid Logo, Garrett M. Street

Ecological Modelling, Volume: 464, Start page: 109818

Swansea University Authors: Luca Borger Orcid Logo, Mark Holton Orcid Logo, Rory Wilson Orcid Logo

  • 58741.pdf

    PDF | Accepted Manuscript

    ©2021 All rights reserved. All article content, except where otherwise noted, is licensed under a Creative Commons Attribution Non-Commercial No Derivatives License (CC-BY-NC-ND)

    Download (971.88KB)

Abstract

Many studies of animal distributions use habitat and climactic variables to explain patterns of observed space use. However, without behavioral information, we can only speculate as to why and how these characteristics are important to species persistence. Animal-borne accelerometer and magnetometer...

Full description

Published in: Ecological Modelling
ISSN: 0304-3800
Published: Elsevier BV 2022
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa58741
Tags: Add Tag
No Tags, Be the first to tag this record!
first_indexed 2021-11-22T13:17:13Z
last_indexed 2021-12-07T04:15:39Z
id cronfa58741
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2021-12-06T16:24:24.6811929</datestamp><bib-version>v2</bib-version><id>58741</id><entry>2021-11-22</entry><title>A probabilistic framework for behavioral identification from animal-borne accelerometers</title><swanseaauthors><author><sid>8416d0ffc3cccdad6e6d67a455e7c4a2</sid><ORCID>0000-0001-8763-5997</ORCID><firstname>Luca</firstname><surname>Borger</surname><name>Luca Borger</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>0e1d89d0cc934a740dcd0a873aed178e</sid><ORCID>0000-0001-8834-3283</ORCID><firstname>Mark</firstname><surname>Holton</surname><name>Mark Holton</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>017bc6dd155098860945dc6249c4e9bc</sid><ORCID>0000-0003-3177-0177</ORCID><firstname>Rory</firstname><surname>Wilson</surname><name>Rory Wilson</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2021-11-22</date><deptcode>SBI</deptcode><abstract>Many studies of animal distributions use habitat and climactic variables to explain patterns of observed space use. However, without behavioral information, we can only speculate as to why and how these characteristics are important to species persistence. Animal-borne accelerometer and magnetometer data loggers can be used to detect behaviors and when coupled with telemetry improve our understanding of animal space use and habitat requirements. However, these loggers collect tremendous quantities of data requiring automated machine learning techniques to identify patterns in the data. Supervised machine learning requires a set of training signals with known behaviors to train the model to identify the unique signal characteristics associated with each behavior. In contrast, unsupervised approaches aggregate unlabeled signals into groups based purely on signal similarity but, without additional information, do not identify specific behaviors. In this paper, we propose a probabilistic framework for interpreting uncertainty in machine learning techniques&#x2014;the probability profile&#x2014;and demonstrate how to post hoc identify behaviors within signal groups. We assess model performance using a matrix-based measure of dissimilarity. We used a Random Forest (RF) and a clustered self-organizing map (CSOM) for comparison and demonstrate the use of a behavioral profile for each using a data set of high-frequency accelerometer and magnetometer data collected from 7 captive wild pigs (Sus scrofa) moving in a 1&#xA0;ha outdoor enclosure. We found that the RF had more discrimination than the CSOM which had fewer clusters associated with high probabilities of a single behavior (&gt;50%). The leave-p-out cross validation statistic of the probability matrix ( L 1 &#xAF; ) indicated that there was an average maximum dissimilarity of 20% and 65% between the training and test data sets for the RF and CSOM methods, respectively. Using a probability profile to describe groups predicted from machine learning allows the variation and error inherent in behavioral prediction to be incorporated directly into the model to better reflect the nuances of behavior derived from accelerometer and/or magnetometer signals. We discuss the data requirements of this framework, demonstrate its application to field data, highlight critical assumptions and caveats, and examine how it may be used to generate new ecological inference.</abstract><type>Journal Article</type><journal>Ecological Modelling</journal><volume>464</volume><journalNumber/><paginationStart>109818</paginationStart><paginationEnd/><publisher>Elsevier BV</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint>0304-3800</issnPrint><issnElectronic/><keywords>Accelerometers; Behavior; Machine learning; k-means clustering; SOM; Random forest</keywords><publishedDay>1</publishedDay><publishedMonth>2</publishedMonth><publishedYear>2022</publishedYear><publishedDate>2022-02-01</publishedDate><doi>10.1016/j.ecolmodel.2021.109818</doi><url/><notes/><college>COLLEGE NANME</college><department>Biosciences</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>SBI</DepartmentCode><institution>Swansea University</institution><apcterm/><funders>Mississippi Agriculture and Forestry Research Station Strategic Research Initiative Noble Research Institute</funders><lastEdited>2021-12-06T16:24:24.6811929</lastEdited><Created>2021-11-22T13:13:42.3130708</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Biosciences, Geography and Physics - Biosciences</level></path><authors><author><firstname>Jane E.</firstname><surname>Dentinger</surname><order>1</order></author><author><firstname>Luca</firstname><surname>Borger</surname><orcid>0000-0001-8763-5997</orcid><order>2</order></author><author><firstname>Mark</firstname><surname>Holton</surname><orcid>0000-0001-8834-3283</orcid><order>3</order></author><author><firstname>Ruholla</firstname><surname>Jafari-Marandi</surname><order>4</order></author><author><firstname>Durham A.</firstname><surname>Norman</surname><order>5</order></author><author><firstname>Brian K.</firstname><surname>Smith</surname><order>6</order></author><author><firstname>Seth F.</firstname><surname>Oppenheimer</surname><order>7</order></author><author><firstname>Bronson K.</firstname><surname>Strickland</surname><order>8</order></author><author><firstname>Rory</firstname><surname>Wilson</surname><orcid>0000-0003-3177-0177</orcid><order>9</order></author><author><firstname>Garrett M.</firstname><surname>Street</surname><order>10</order></author></authors><documents><document><filename>58741__21770__b0ec8512a2d348abaeb678e65713e9ae.pdf</filename><originalFilename>58741.pdf</originalFilename><uploaded>2021-12-02T10:29:09.5733483</uploaded><type>Output</type><contentLength>995208</contentLength><contentType>application/pdf</contentType><version>Accepted Manuscript</version><cronfaStatus>true</cronfaStatus><embargoDate>2022-11-14T00:00:00.0000000</embargoDate><documentNotes>&#xA9;2021 All rights reserved. All article content, except where otherwise noted, is licensed under a Creative Commons Attribution Non-Commercial No Derivatives License (CC-BY-NC-ND)</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://www.creativecommons.org/licenses/by-nc-nd/4.0/</licence></document></documents><OutputDurs/></rfc1807>
spelling 2021-12-06T16:24:24.6811929 v2 58741 2021-11-22 A probabilistic framework for behavioral identification from animal-borne accelerometers 8416d0ffc3cccdad6e6d67a455e7c4a2 0000-0001-8763-5997 Luca Borger Luca Borger true false 0e1d89d0cc934a740dcd0a873aed178e 0000-0001-8834-3283 Mark Holton Mark Holton true false 017bc6dd155098860945dc6249c4e9bc 0000-0003-3177-0177 Rory Wilson Rory Wilson true false 2021-11-22 SBI Many studies of animal distributions use habitat and climactic variables to explain patterns of observed space use. However, without behavioral information, we can only speculate as to why and how these characteristics are important to species persistence. Animal-borne accelerometer and magnetometer data loggers can be used to detect behaviors and when coupled with telemetry improve our understanding of animal space use and habitat requirements. However, these loggers collect tremendous quantities of data requiring automated machine learning techniques to identify patterns in the data. Supervised machine learning requires a set of training signals with known behaviors to train the model to identify the unique signal characteristics associated with each behavior. In contrast, unsupervised approaches aggregate unlabeled signals into groups based purely on signal similarity but, without additional information, do not identify specific behaviors. In this paper, we propose a probabilistic framework for interpreting uncertainty in machine learning techniques—the probability profile—and demonstrate how to post hoc identify behaviors within signal groups. We assess model performance using a matrix-based measure of dissimilarity. We used a Random Forest (RF) and a clustered self-organizing map (CSOM) for comparison and demonstrate the use of a behavioral profile for each using a data set of high-frequency accelerometer and magnetometer data collected from 7 captive wild pigs (Sus scrofa) moving in a 1 ha outdoor enclosure. We found that the RF had more discrimination than the CSOM which had fewer clusters associated with high probabilities of a single behavior (>50%). The leave-p-out cross validation statistic of the probability matrix ( L 1 ¯ ) indicated that there was an average maximum dissimilarity of 20% and 65% between the training and test data sets for the RF and CSOM methods, respectively. Using a probability profile to describe groups predicted from machine learning allows the variation and error inherent in behavioral prediction to be incorporated directly into the model to better reflect the nuances of behavior derived from accelerometer and/or magnetometer signals. We discuss the data requirements of this framework, demonstrate its application to field data, highlight critical assumptions and caveats, and examine how it may be used to generate new ecological inference. Journal Article Ecological Modelling 464 109818 Elsevier BV 0304-3800 Accelerometers; Behavior; Machine learning; k-means clustering; SOM; Random forest 1 2 2022 2022-02-01 10.1016/j.ecolmodel.2021.109818 COLLEGE NANME Biosciences COLLEGE CODE SBI Swansea University Mississippi Agriculture and Forestry Research Station Strategic Research Initiative Noble Research Institute 2021-12-06T16:24:24.6811929 2021-11-22T13:13:42.3130708 Faculty of Science and Engineering School of Biosciences, Geography and Physics - Biosciences Jane E. Dentinger 1 Luca Borger 0000-0001-8763-5997 2 Mark Holton 0000-0001-8834-3283 3 Ruholla Jafari-Marandi 4 Durham A. Norman 5 Brian K. Smith 6 Seth F. Oppenheimer 7 Bronson K. Strickland 8 Rory Wilson 0000-0003-3177-0177 9 Garrett M. Street 10 58741__21770__b0ec8512a2d348abaeb678e65713e9ae.pdf 58741.pdf 2021-12-02T10:29:09.5733483 Output 995208 application/pdf Accepted Manuscript true 2022-11-14T00:00:00.0000000 ©2021 All rights reserved. All article content, except where otherwise noted, is licensed under a Creative Commons Attribution Non-Commercial No Derivatives License (CC-BY-NC-ND) true eng https://www.creativecommons.org/licenses/by-nc-nd/4.0/
title A probabilistic framework for behavioral identification from animal-borne accelerometers
spellingShingle A probabilistic framework for behavioral identification from animal-borne accelerometers
Luca Borger
Mark Holton
Rory Wilson
title_short A probabilistic framework for behavioral identification from animal-borne accelerometers
title_full A probabilistic framework for behavioral identification from animal-borne accelerometers
title_fullStr A probabilistic framework for behavioral identification from animal-borne accelerometers
title_full_unstemmed A probabilistic framework for behavioral identification from animal-borne accelerometers
title_sort A probabilistic framework for behavioral identification from animal-borne accelerometers
author_id_str_mv 8416d0ffc3cccdad6e6d67a455e7c4a2
0e1d89d0cc934a740dcd0a873aed178e
017bc6dd155098860945dc6249c4e9bc
author_id_fullname_str_mv 8416d0ffc3cccdad6e6d67a455e7c4a2_***_Luca Borger
0e1d89d0cc934a740dcd0a873aed178e_***_Mark Holton
017bc6dd155098860945dc6249c4e9bc_***_Rory Wilson
author Luca Borger
Mark Holton
Rory Wilson
author2 Jane E. Dentinger
Luca Borger
Mark Holton
Ruholla Jafari-Marandi
Durham A. Norman
Brian K. Smith
Seth F. Oppenheimer
Bronson K. Strickland
Rory Wilson
Garrett M. Street
format Journal article
container_title Ecological Modelling
container_volume 464
container_start_page 109818
publishDate 2022
institution Swansea University
issn 0304-3800
doi_str_mv 10.1016/j.ecolmodel.2021.109818
publisher Elsevier BV
college_str Faculty of Science and Engineering
hierarchytype
hierarchy_top_id facultyofscienceandengineering
hierarchy_top_title Faculty of Science and Engineering
hierarchy_parent_id facultyofscienceandengineering
hierarchy_parent_title Faculty of Science and Engineering
department_str School of Biosciences, Geography and Physics - Biosciences{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Biosciences, Geography and Physics - Biosciences
document_store_str 1
active_str 0
description Many studies of animal distributions use habitat and climactic variables to explain patterns of observed space use. However, without behavioral information, we can only speculate as to why and how these characteristics are important to species persistence. Animal-borne accelerometer and magnetometer data loggers can be used to detect behaviors and when coupled with telemetry improve our understanding of animal space use and habitat requirements. However, these loggers collect tremendous quantities of data requiring automated machine learning techniques to identify patterns in the data. Supervised machine learning requires a set of training signals with known behaviors to train the model to identify the unique signal characteristics associated with each behavior. In contrast, unsupervised approaches aggregate unlabeled signals into groups based purely on signal similarity but, without additional information, do not identify specific behaviors. In this paper, we propose a probabilistic framework for interpreting uncertainty in machine learning techniques—the probability profile—and demonstrate how to post hoc identify behaviors within signal groups. We assess model performance using a matrix-based measure of dissimilarity. We used a Random Forest (RF) and a clustered self-organizing map (CSOM) for comparison and demonstrate the use of a behavioral profile for each using a data set of high-frequency accelerometer and magnetometer data collected from 7 captive wild pigs (Sus scrofa) moving in a 1 ha outdoor enclosure. We found that the RF had more discrimination than the CSOM which had fewer clusters associated with high probabilities of a single behavior (>50%). The leave-p-out cross validation statistic of the probability matrix ( L 1 ¯ ) indicated that there was an average maximum dissimilarity of 20% and 65% between the training and test data sets for the RF and CSOM methods, respectively. Using a probability profile to describe groups predicted from machine learning allows the variation and error inherent in behavioral prediction to be incorporated directly into the model to better reflect the nuances of behavior derived from accelerometer and/or magnetometer signals. We discuss the data requirements of this framework, demonstrate its application to field data, highlight critical assumptions and caveats, and examine how it may be used to generate new ecological inference.
published_date 2022-02-01T04:15:31Z
_version_ 1763754042774781952
score 11.035634