feat: positional encoding blog #2483

FL33TW00D · 2024-11-22T16:37:12Z

Congratulations! You've made it this far! Once merged, the article will appear at https://huggingface.co/blog. Official articles
require additional reviews. Alternatively, you can write a community article following the process here.

Preparing the Article

You're not quite done yet, though. Please make sure to follow this process (as documented here):

Add an entry to _blog.yml.
Add a thumbnail. There are no requirements here, but there is a template if it's helpful.
[?] Check you use a short title and blog path.
Upload any additional assets (such as images) to the Documentation Images repo. This is to reduce bloat in the GitHub base repo when cloning and pulling. Try to have small images to avoid a slow or expensive user experience.
Add metadata (such as authors) to your md file. You can also specify guest or org for the authors.
Ensure the publication date is correct.
Preview the content. A quick way is to paste the markdown content in https://huggingface.co/new-blog. Do not click publish, this is just a way to do an early check.

Here is an example of a complete PR: #2382

Getting a Review

Please make sure to get a review from someone on your team or a co-author.
Once this is done and once all the steps above are completed, you should be able to merge.
There is no need for additional reviews if you and your co-authors are happy and meet all of the above.

Feel free to add @pcuenca as a reviewer if you want a final check. Keep in mind he'll be biased toward light reviews
(e.g., check for proper metadata) rather than content reviews unless explicitly asked.

FL33TW00D · 2024-11-22T16:39:37Z

Right now links are to my personal site, will change over to the documentation-images link when the PR is merged there (https://huggingface.co/datasets/huggingface/documentation-images/discussions/396)

_blog.yml

assets/you-could-have-designed-SOTA-positional-encoding/thumbnail.png

you-could-have-designed-SOTA-positional-encoding.md

_blog.yml

pcuenca

🔥

designing-positional-encoding.md

pcuenca · 2024-11-25T15:29:40Z

designing-positional-encoding.md

+As we can see, without any positional information, the output of a (multi
+headed) self attention operation is **identical for the same token in
+different positions**, despite the tokens clearly representing distinct entities. Let's begin designing a method of enhancing self attention with positional information, such that it can determine relationships between words encoded by
+their positions.


This is great. I wonder if we should compare it with the actual embeddings from the model (with positional embedding applied) to show they are different, but it's probably not necessary.

I wanted to do this, but I wanted to keep the code samples as short as possible. If you're happy I'll leave it for now

designing-positional-encoding.md

Co-authored-by: Pedro Cuenca <[email protected]>

FL33TW00D added 2 commits November 22, 2024 16:24

feat: positional encoding blog

6b85c43

chore: _blog.yml and thumbnail

97901d0

pcuenca approved these changes Nov 22, 2024

View reviewed changes

chore: update links

0c8dc31

pcuenca reviewed Nov 22, 2024

View reviewed changes

_blog.yml Outdated Show resolved Hide resolved

FL33TW00D added 2 commits November 25, 2024 11:39

Merge branch 'main' into feature/positional-encoding

c9a4fc3

chore: better thumbnail

5e5175b

FL33TW00D marked this pull request as ready for review November 25, 2024 13:15

FL33TW00D requested a review from pcuenca November 25, 2024 13:15

pcuenca approved these changes Nov 25, 2024

View reviewed changes

FL33TW00D and others added 2 commits November 25, 2024 15:58

Update designing-positional-encoding.md

8f18f2b

Co-authored-by: Pedro Cuenca <[email protected]>

chore: improvements

49ed21c

FL33TW00D merged commit f7a37e4 into main Nov 25, 2024
1 check passed

FL33TW00D deleted the feature/positional-encoding branch November 25, 2024 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: positional encoding blog #2483

feat: positional encoding blog #2483

FL33TW00D commented Nov 22, 2024

FL33TW00D commented Nov 22, 2024

pcuenca left a comment

pcuenca Nov 25, 2024

FL33TW00D Nov 25, 2024

pcuenca Nov 25, 2024

feat: positional encoding blog #2483

feat: positional encoding blog #2483

Conversation

FL33TW00D commented Nov 22, 2024

Preparing the Article

Getting a Review

FL33TW00D commented Nov 22, 2024

pcuenca left a comment

Choose a reason for hiding this comment

pcuenca Nov 25, 2024

Choose a reason for hiding this comment

FL33TW00D Nov 25, 2024

Choose a reason for hiding this comment

pcuenca Nov 25, 2024

Choose a reason for hiding this comment