Update transformer_tutorial.py | Resolving issue #1778 #2402

YoushaaMurhij · 2023-06-01T21:02:09Z

Add description for positional encoding calculation for Transformers

Description

Add explanation for calculating positional encoding for Transformers by taking the log first and then applying the exponential function.

Checklist

The issue that is being fixed is referred in the description (see above "Fixes [Help Wanted] Why take the log function and then apply exp? #1778")
Only one issue is addressed in this pull request
Labels from the issue that this PR is fixing are added to this pull request
No unnecessary issues are included into this pull request.

cc @suraj813

Add description for positional encoding calculation for Transformers

netlify · 2023-06-01T21:07:19Z

✅ Deploy Preview for pytorch-tutorials-preview ready!

Name	Link
🔨 Latest commit	`ed8c29d`
🔍 Latest deploy log	https://app.netlify.com/sites/pytorch-tutorials-preview/deploys/647a246d3aa7870008689752
😎 Deploy Preview	https://deploy-preview-2402--pytorch-tutorials-preview.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site settings.

NicolasHug · 2023-06-02T12:03:10Z

beginner_source/transformer_tutorial.py

+# input length (in this case, ``10000``). Dividing this term by ``d_model`` scales 
+# the values to be within a reasonable range for the exponential function. 
+# The negative sign in front of the logarithm ensures that the values decrease exponentially.
+# The reason for writing ``math.log(10000.0)`` instead of ``4`` in the code is to make it clear


I don't understand this comment. math.log(10000.0) is 9.2, not 4.

Sorry, I removed the redundant description.

carljparker

Wow. Nice explanation. Thanks.

NicolasHug · 2023-06-02T18:49:40Z

beginner_source/transformer_tutorial.py

+# for positional encoding. The purpose of this calculation is to create 
+# a range of values that decrease exponentially. 
+# This allows the model to learn to attend to positions based on their relative distances.
+# The ``math.log(10000.0)`` term in the exponent represents the maximum effective 


I'm not sure this is correct, the maximum input length is max_len, not 10000. Am I missing something?

The purpose of this value is to make the frequencies of the sine and cosine functions very large. This is important because it helps to ensure that the positional encodings are unique for each position in the sequence. Right?

I think, I need to update this, too.

Also, can you please make the description less lengthy and in a simpler language. Thank you!

svekars · 2023-06-02T19:49:23Z

@YoushaaMurhij please submit a PR updating the description as suggested. Tag @NicolasHug and @kit1980 to review your update.

This reverts commit 83cbc8d.

…" (#2412) This reverts commit 83cbc8d.

Update transformer_tutorial.py

701683d

Add description for positional encoding calculation for Transformers

facebook-github-bot added the cla signed label Jun 1, 2023

YoushaaMurhij changed the title ~~Update transformer_tutorial.py Resolving issue #1778~~ Update transformer_tutorial.py | Resolving issue #1778 Jun 1, 2023

github-actions bot added question intro docathon-h1-2023 A label for the docathon in H1 2023 easy and removed cla signed labels Jun 1, 2023

Update Positional Encoding description in transformer_tutorial.py

51e989f

facebook-github-bot added the cla signed label Jun 1, 2023

github-actions bot removed the cla signed label Jun 1, 2023

facebook-github-bot added the cla signed label Jun 1, 2023

NicolasHug reviewed Jun 2, 2023

View reviewed changes

Update transformer_tutorial.py

f453a23

github-actions bot removed the cla signed label Jun 2, 2023

facebook-github-bot added the cla signed label Jun 2, 2023

carljparker approved these changes Jun 2, 2023

View reviewed changes

Merge branch 'main' into main

ed8c29d

carljparker merged commit 83cbc8d into pytorch:main Jun 2, 2023

github-actions bot removed the cla signed label Jun 2, 2023

NicolasHug reviewed Jun 2, 2023

View reviewed changes

svekars pushed a commit that referenced this pull request Jun 2, 2023

Revert "Update transformer_tutorial.py | Resolving issue #1778 (#2402)"

5b0e0ca

This reverts commit 83cbc8d.

svekars mentioned this pull request Jun 2, 2023

Revert "Update transformer_tutorial.py | Resolving issue #1778" #2412

Merged

svekars removed question intro docathon-h1-2023 A label for the docathon in H1 2023 easy labels Jun 2, 2023

svekars pushed a commit that referenced this pull request Jun 2, 2023

Revert "Update transformer_tutorial.py | Resolving issue #1778 (#2402)…

3a58c51

…" (#2412) This reverts commit 83cbc8d.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update transformer_tutorial.py | Resolving issue #1778 #2402

Update transformer_tutorial.py | Resolving issue #1778 #2402

YoushaaMurhij commented Jun 1, 2023 •

edited by pytorch-bot bot

Loading

netlify bot commented Jun 1, 2023 •

edited

Loading

NicolasHug Jun 2, 2023

YoushaaMurhij Jun 2, 2023

carljparker left a comment

NicolasHug Jun 2, 2023

YoushaaMurhij Jun 2, 2023

YoushaaMurhij Jun 2, 2023

svekars Jun 2, 2023

svekars commented Jun 2, 2023

Update transformer_tutorial.py | Resolving issue #1778 #2402

Update transformer_tutorial.py | Resolving issue #1778 #2402

Conversation

YoushaaMurhij commented Jun 1, 2023 • edited by pytorch-bot bot Loading

Description

Checklist

netlify bot commented Jun 1, 2023 • edited Loading

✅ Deploy Preview for pytorch-tutorials-preview ready!

NicolasHug Jun 2, 2023

Choose a reason for hiding this comment

YoushaaMurhij Jun 2, 2023

Choose a reason for hiding this comment

carljparker left a comment

Choose a reason for hiding this comment

NicolasHug Jun 2, 2023

Choose a reason for hiding this comment

YoushaaMurhij Jun 2, 2023

Choose a reason for hiding this comment

YoushaaMurhij Jun 2, 2023

Choose a reason for hiding this comment

svekars Jun 2, 2023

Choose a reason for hiding this comment

svekars commented Jun 2, 2023

YoushaaMurhij commented Jun 1, 2023 •

edited by pytorch-bot bot

Loading

netlify bot commented Jun 1, 2023 •

edited

Loading