generated from academicpages/academicpages.github.io
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
5 changed files
with
30 additions
and
4 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
--- | ||
layout: archive | ||
title: "Publications" | ||
title: "Reports" | ||
permalink: /publications/ | ||
author_profile: true | ||
--- | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,26 @@ | ||
--- | ||
title: "Validation report 002: Go Policy Networks" | ||
collection: publications | ||
permalink: /publication/2024-01-11-GO | ||
excerpt: 'This paper is about fixing template issue #693.' | ||
date: 2024-01-11 | ||
venue: 'Explainable Machine Learning 2023/2024 course' | ||
paperurl: 'https://modeloriented.github.io/CVE-AI/files/2023_GO.pdf' | ||
citation: 'Antoni Hanke, Michal Grotkowski. (2024). "Comparative Analysis of Convolutional and Transformer Architectures in Go Policy Networks." <i>Github: CVE-AI/ModelOriented</i>.' | ||
tags: | ||
- Go Policy Network | ||
- Convolution | ||
- Transformer | ||
--- | ||
|
||
In this report we aim to spot shortcomings of using a convolutional architecture as a Go policy network. By comparing it to an equivalently trained Transformer policy and employing XAI methods such as Ceteris Paribus, we can see where each network under and overperforms. This work points in the direction of further research of Transformer architectures in positional games such as Go, where previously it was believed Convolutions were SoTA. | ||
|
||
<hr/> | ||
|
||
Our experiments show the limitations of convolutional policy networks. They are great at capturing local features, however limited in attending to global phenomena. In some tasks, like the game of Go, this might be detrimental. Transformers on the other hand are able to seamlessly incorporate both the local and global understanding via its flexible attention mechanism. This points in the direction of future work on implementing Transformers as policy networks for large positional games. | ||
|
||
|
||
<hr/> | ||
|
||
Link to original publication with a model: (see the report) | ||
|
Binary file not shown.