E-Thesis 243 views 277 downloads
A sufficient condition for the improvement of Restricted Boltzmann Machines / MARK THOMAS
Swansea University Author: MARK THOMAS
Abstract
This thesis explores Restricted Boltzmann Machines (RBMs) and their training, focusing on the minimization of the Kullback-Leibler (KL) divergence. Neural networks and the importance of the KL divergence are introduced and motivated. Examples of KL divergence calculations are demonstrated for variou...
| Published: |
Swansea, Wales, UK
2023
|
|---|---|
| Institution: | Swansea University |
| Degree level: | Master of Research |
| Degree name: | MSc by Research |
| Supervisor: | Aarts, Gert ; Cucini, Biagio |
| URI: | https://cronfa.swan.ac.uk/Record/cronfa66097 |
| first_indexed |
2024-04-20T10:18:28Z |
|---|---|
| last_indexed |
2024-11-25T14:17:30Z |
| id |
cronfa66097 |
| recordtype |
RisThesis |
| fullrecord |
<?xml version="1.0"?><rfc1807><datestamp>2024-04-20T11:32:11.0047751</datestamp><bib-version>v2</bib-version><id>66097</id><entry>2024-04-20</entry><title>A sufficient condition for the improvement of Restricted Boltzmann Machines</title><swanseaauthors><author><sid>f160e9dde3581f7c178dfeb2436f5df6</sid><firstname>MARK</firstname><surname>THOMAS</surname><name>MARK THOMAS</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2024-04-20</date><abstract>This thesis explores Restricted Boltzmann Machines (RBMs) and their training, focusing on the minimization of the Kullback-Leibler (KL) divergence. Neural networks and the importance of the KL divergence are introduced and motivated. Examples of KL divergence calculations are demonstrated for various model and target distributions. A demonstration of the non-universality of the ability to improve models by introducing a new parameter without re-training the existing ones is made. The Ising model is explored as an example of available training data, and the work of G. Cossu et al., ‘Machine learning determination of dynamical parameters: The Ising model case,’ Phys. Rev. B, 100, 064304 (2019) in training a set of RBMs on the one-dimensional Ising model is successfully reproduced. Connections between the mathematics of RBMs and lattice Quantum Field Theory (QFT) are explored, and insights from QFT are utilized to inform the design choices of RBMs to consider. Leveraging these insights, a linearisation procedure is employed to produce a sufficient condition for the possibility of improvement of an RBM with bilinear inter-layer mixing and a Gaussian hidden layer through the introduction of new parameters, without the need to re-train already-existing parameters. This condition is tested and potential issues with the linearisation procedure performed are highlighted.</abstract><type>E-Thesis</type><journal/><volume/><journalNumber/><paginationStart/><paginationEnd/><publisher/><placeOfPublication>Swansea, Wales, UK</placeOfPublication><isbnPrint/><isbnElectronic/><issnPrint/><issnElectronic/><keywords/><publishedDay>19</publishedDay><publishedMonth>12</publishedMonth><publishedYear>2023</publishedYear><publishedDate>2023-12-19</publishedDate><doi/><url/><notes/><college>COLLEGE NANME</college><CollegeCode>COLLEGE CODE</CollegeCode><institution>Swansea University</institution><supervisor>Aarts, Gert ; Cucini, Biagio</supervisor><degreelevel>Master of Research</degreelevel><degreename>MSc by Research</degreename><apcterm/><funders/><projectreference/><lastEdited>2024-04-20T11:32:11.0047751</lastEdited><Created>2024-04-20T11:14:43.2600472</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Biosciences, Geography and Physics - Physics</level></path><authors><author><firstname>MARK</firstname><surname>THOMAS</surname><order>1</order></author></authors><documents><document><filename>66097__30079__828ad801073242c1921bba32f075e8d1.pdf</filename><originalFilename>Thomas_Mark_D_A_MSc_Thesis_Final_Redacted_Signature.pdf</originalFilename><uploaded>2024-04-20T11:25:49.0860476</uploaded><type>Output</type><contentLength>3111405</contentLength><contentType>application/pdf</contentType><version>E-Thesis – open access</version><cronfaStatus>true</cronfaStatus><documentNotes>Copyright: The Author, Mark Thomas, 2023.</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language></document></documents><OutputDurs/></rfc1807> |
| spelling |
2024-04-20T11:32:11.0047751 v2 66097 2024-04-20 A sufficient condition for the improvement of Restricted Boltzmann Machines f160e9dde3581f7c178dfeb2436f5df6 MARK THOMAS MARK THOMAS true false 2024-04-20 This thesis explores Restricted Boltzmann Machines (RBMs) and their training, focusing on the minimization of the Kullback-Leibler (KL) divergence. Neural networks and the importance of the KL divergence are introduced and motivated. Examples of KL divergence calculations are demonstrated for various model and target distributions. A demonstration of the non-universality of the ability to improve models by introducing a new parameter without re-training the existing ones is made. The Ising model is explored as an example of available training data, and the work of G. Cossu et al., ‘Machine learning determination of dynamical parameters: The Ising model case,’ Phys. Rev. B, 100, 064304 (2019) in training a set of RBMs on the one-dimensional Ising model is successfully reproduced. Connections between the mathematics of RBMs and lattice Quantum Field Theory (QFT) are explored, and insights from QFT are utilized to inform the design choices of RBMs to consider. Leveraging these insights, a linearisation procedure is employed to produce a sufficient condition for the possibility of improvement of an RBM with bilinear inter-layer mixing and a Gaussian hidden layer through the introduction of new parameters, without the need to re-train already-existing parameters. This condition is tested and potential issues with the linearisation procedure performed are highlighted. E-Thesis Swansea, Wales, UK 19 12 2023 2023-12-19 COLLEGE NANME COLLEGE CODE Swansea University Aarts, Gert ; Cucini, Biagio Master of Research MSc by Research 2024-04-20T11:32:11.0047751 2024-04-20T11:14:43.2600472 Faculty of Science and Engineering School of Biosciences, Geography and Physics - Physics MARK THOMAS 1 66097__30079__828ad801073242c1921bba32f075e8d1.pdf Thomas_Mark_D_A_MSc_Thesis_Final_Redacted_Signature.pdf 2024-04-20T11:25:49.0860476 Output 3111405 application/pdf E-Thesis – open access true Copyright: The Author, Mark Thomas, 2023. true eng |
| title |
A sufficient condition for the improvement of Restricted Boltzmann Machines |
| spellingShingle |
A sufficient condition for the improvement of Restricted Boltzmann Machines MARK THOMAS |
| title_short |
A sufficient condition for the improvement of Restricted Boltzmann Machines |
| title_full |
A sufficient condition for the improvement of Restricted Boltzmann Machines |
| title_fullStr |
A sufficient condition for the improvement of Restricted Boltzmann Machines |
| title_full_unstemmed |
A sufficient condition for the improvement of Restricted Boltzmann Machines |
| title_sort |
A sufficient condition for the improvement of Restricted Boltzmann Machines |
| author_id_str_mv |
f160e9dde3581f7c178dfeb2436f5df6 |
| author_id_fullname_str_mv |
f160e9dde3581f7c178dfeb2436f5df6_***_MARK THOMAS |
| author |
MARK THOMAS |
| author2 |
MARK THOMAS |
| format |
E-Thesis |
| publishDate |
2023 |
| institution |
Swansea University |
| college_str |
Faculty of Science and Engineering |
| hierarchytype |
|
| hierarchy_top_id |
facultyofscienceandengineering |
| hierarchy_top_title |
Faculty of Science and Engineering |
| hierarchy_parent_id |
facultyofscienceandengineering |
| hierarchy_parent_title |
Faculty of Science and Engineering |
| department_str |
School of Biosciences, Geography and Physics - Physics{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Biosciences, Geography and Physics - Physics |
| document_store_str |
1 |
| active_str |
0 |
| description |
This thesis explores Restricted Boltzmann Machines (RBMs) and their training, focusing on the minimization of the Kullback-Leibler (KL) divergence. Neural networks and the importance of the KL divergence are introduced and motivated. Examples of KL divergence calculations are demonstrated for various model and target distributions. A demonstration of the non-universality of the ability to improve models by introducing a new parameter without re-training the existing ones is made. The Ising model is explored as an example of available training data, and the work of G. Cossu et al., ‘Machine learning determination of dynamical parameters: The Ising model case,’ Phys. Rev. B, 100, 064304 (2019) in training a set of RBMs on the one-dimensional Ising model is successfully reproduced. Connections between the mathematics of RBMs and lattice Quantum Field Theory (QFT) are explored, and insights from QFT are utilized to inform the design choices of RBMs to consider. Leveraging these insights, a linearisation procedure is employed to produce a sufficient condition for the possibility of improvement of an RBM with bilinear inter-layer mixing and a Gaussian hidden layer through the introduction of new parameters, without the need to re-train already-existing parameters. This condition is tested and potential issues with the linearisation procedure performed are highlighted. |
| published_date |
2023-12-19T05:17:35Z |
| _version_ |
1859432119117283328 |
| score |
11.099876 |

