Skip to content

gh-86155: Fix htmlparser "unclosed script tag causes data loss" #22658

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 8 commits into from
May 10, 2025

Conversation

waylan
Copy link
Contributor

@waylan waylan commented Oct 12, 2020

When calling .close() the htmlparser should flush all remaining content,
even when that content is in an unclosed script or style tag.

https://bugs.python.org/issue41989

When calling .close() the htmlparser should flush all remaining content,
even when that content is in an unclosed script or style tag.
@waylan waylan requested a review from ezio-melotti as a code owner October 12, 2020 01:12
@the-knights-who-say-ni
Copy link

Hello, and thanks for your contribution!

I'm a bot set up to make sure that the project can legally accept this contribution by verifying everyone involved has signed the PSF contributor agreement (CLA).

CLA Missing

Our records indicate the following people have not signed the CLA:

@waylan

For legal reasons we need all the people listed to sign the CLA before we can look at your contribution. Please follow the steps outlined in the CPython devguide to rectify this issue.

If you have recently signed the CLA, please wait at least one business day
before our records are updated.

You can check yourself to see if the CLA has been received.

Thanks again for the contribution, we look forward to reviewing it!

@waylan waylan changed the title bpo:41989 Fix htmlparser "unclosed script tag causes data loss" bpo-41989 Fix htmlparser "unclosed script tag causes data loss" Oct 12, 2020
@github-actions
Copy link

This PR is stale because it has been open for 30 days with no activity.

@github-actions github-actions bot added the stale Stale PR or inactive for long period of time. label Dec 17, 2020
@github-actions github-actions bot removed the stale Stale PR or inactive for long period of time. label Jul 30, 2022
@bedevere-bot
Copy link

Most changes to Python require a NEWS entry.

Please add it using the blurb_it web app or the blurb command-line tool.

@arhadthedev arhadthedev changed the title bpo-41989 Fix htmlparser "unclosed script tag causes data loss" gh-86155: Fix htmlparser "unclosed script tag causes data loss" Feb 13, 2023
@arhadthedev arhadthedev added the stdlib Python modules in the Lib dir label Feb 13, 2023
@arhadthedev
Copy link
Member

@ezio-melotti (as an html expert)

@python-cla-bot
Copy link

python-cla-bot bot commented Apr 18, 2025

All commit authors signed the Contributor License Agreement.

CLA signed

Copy link
Member

@serhiy-storchaka serhiy-storchaka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@serhiy-storchaka serhiy-storchaka added needs backport to 3.13 bugs and security fixes needs backport to 3.14 bugs and security fixes labels May 10, 2025
@serhiy-storchaka serhiy-storchaka enabled auto-merge (squash) May 10, 2025 16:53
@serhiy-storchaka serhiy-storchaka merged commit 53383e9 into python:main May 10, 2025
39 checks passed
@miss-islington-app
Copy link

Thanks @waylan for the PR, and @serhiy-storchaka for merging it 🌮🎉.. I'm working now to backport this PR to: 3.13, 3.14.
🐍🍒⛏🤖

miss-islington pushed a commit to miss-islington/cpython that referenced this pull request May 10, 2025
…TMLParser (pythonGH-22658)

When calling .close() the HTMLParser should flush all remaining content,
even when that content is in an unclosed script or style tag.
(cherry picked from commit 53383e9)

Co-authored-by: Waylan Limberg <[email protected]>
miss-islington pushed a commit to miss-islington/cpython that referenced this pull request May 10, 2025
…TMLParser (pythonGH-22658)

When calling .close() the HTMLParser should flush all remaining content,
even when that content is in an unclosed script or style tag.
(cherry picked from commit 53383e9)

Co-authored-by: Waylan Limberg <[email protected]>
@bedevere-app
Copy link

bedevere-app bot commented May 10, 2025

GH-133844 is a backport of this pull request to the 3.14 branch.

@bedevere-app bedevere-app bot removed the needs backport to 3.14 bugs and security fixes label May 10, 2025
@bedevere-app
Copy link

bedevere-app bot commented May 10, 2025

GH-133845 is a backport of this pull request to the 3.13 branch.

@bedevere-app bedevere-app bot removed the needs backport to 3.13 bugs and security fixes label May 10, 2025
serhiy-storchaka pushed a commit that referenced this pull request May 10, 2025
…HTMLParser (GH-22658) (GH-133845)

When calling .close() the HTMLParser should flush all remaining content,
even when that content is in an unclosed script or style tag.
(cherry picked from commit 53383e9)

Co-authored-by: Waylan Limberg <[email protected]>
serhiy-storchaka pushed a commit that referenced this pull request May 10, 2025
…HTMLParser (GH-22658) (GH-133844)

When calling .close() the HTMLParser should flush all remaining content,
even when that content is in an unclosed script or style tag.
(cherry picked from commit 53383e9)

Co-authored-by: Waylan Limberg <[email protected]>
@waylan waylan deleted the htmlparser branch May 12, 2025 15:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
stdlib Python modules in the Lib dir
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants