Python: Add SSRF queries #7420

RasmusWL · 2021-12-16T01:03:19Z

I've added 2 queries:

one that detects full SSRF, where an attacker can control the full URL, which is always bad
and one for partial SSRF, where an attacker can only control parts of an URL (such as the path, query parameters, or fragment), which is not a big problem in many cases (but could still be exploitable)

full SSRF should run by default, and partial SSRF should not (but having the query included makes it easy to run). I got inspired by this setup from Java where they have a precise and imprecise version of the same query.

Current status

Most of the query work/library modeling is done (although we could always add support for more libraries). Still need to do:

write qhelp
write change-note
Polish sanitizer for full SSRF query so we're able to detect "https://" + user_input is in fact controlling the full URL.
verify FP rates from run across many repos
verify performance looks ok
make some updates to the Ruby code so we're better aligned (but that can wait until after this PR is merged I think)

Commits

Some of the commits changes the concepts that was added in the very first commit. I've kept things this way so it could help to illustrate why I wanted to diverge from the Ruby code.

What is SSRF even?

See https://portswigger.net/web-security/ssrf if you need a refresher on SSRF 😊

Taken from Ruby, except that `getURL` member predicate was changed to `getUrl` to keep consistency with the rest of our concepts, and stick to our naming convention.

For the snippet below, our current query is able to show _why_ we consider `var` to be a falsey value that would disable SSL/TLS verification. I'm not sure we're going to need the part that Ruby did, for being able to specify _where_ the verification was removed, but we'll see. ``` requests.get(url, verify=var) ```

Also adjusts test slightly. Writing `clientRequestDisablesCertValidation=False` to mean that certificate validation was disabled by the `False` expression is just confusing, as it easily reads as _certificate validate was NOT disabled_ :| The new one ties to each request that is being made, which seems like the right setup.

I think `getUrl` is a bit too misleading, since from the name, I would only ever expect ONE result for one request being made. `getAUrlPart` captures that there could be multiple results, and that they might not constitute a whole URl. Which is the same naming I used when I tried to model this a long time ago https://github.com/github/codeql/blob/a80860cdc6b06b363b0d0919600ab383a470b449/python/ql/lib/semmle/python/web/Http.qll#L102-L111

I've added 2 queries: - one that detects full SSRF, where an attacker can control the full URL, which is always bad - and one for partial SSRF, where an attacker can control parts of an URL (such as the path, query parameters, or fragment), which is not a big problem in many cases (but might still be exploitable) full SSRF should run by default, and partial SSRF should not (but makes it easy to see the other results). Some elements of the full SSRF queries needs a bit more polishing, like being able to detect `"https://" + user_input` is in fact controlling the full URL.

python/ql/test/library-tests/frameworks/requests/taint_test.py

Co-authored-by: yoff <[email protected]>

Since that might not be the same place where the vulnerable URL part is.

Now full-ssrf will only alert if **all** URL parts are fully user-controlled.

They were very misleading before, because a sanitizer that happened early, would remove taint from the rest of the cases by use-use flow :|

python/ql/lib/semmle/python/frameworks/Requests.qll

Accidentally committed :|

I included examples of both types in the qhelp of both queries, to provide context of what each of them actually are.

python/ql/test/query-tests/Security/CWE-918-ServerSideRequestForgery/full_partial_test.py

…orgery/full_partial_test.py

yoff

LGTM - thanks for the offline explanations

That was changed in 9866214

yoff

Lgtm

RasmusWL · 2021-12-17T15:20:35Z

RasmusWL added 12 commits December 13, 2021 11:09

Python: Add HTTP::Client::Request concept

5de79b4

Taken from Ruby, except that `getURL` member predicate was changed to `getUrl` to keep consistency with the rest of our concepts, and stick to our naming convention.

Python: Clearer sourceType for client response body

08f6d1a

Python: Add modeling of requests

b68d280

Python: Consider taint of client http requests

35cba17

Python: Model requests Responses

cf2ee06

Python: Add tests of http.client.HTTPResponse

a5bae30

Python: Add modeling of http.client.HTTPResponse

6f81685

Python: Remove getResponse and do manual taint steps

579de0c

RasmusWL requested a review from yoff December 16, 2021 01:03

github-actions bot added documentation Python labels Dec 16, 2021

yoff reviewed Dec 16, 2021

View reviewed changes

python/ql/test/library-tests/frameworks/requests/taint_test.py Outdated Show resolved Hide resolved

RasmusWL and others added 8 commits December 16, 2021 15:19

Python: Apply suggestions from code review

6ce1524

Co-authored-by: yoff <[email protected]>

Python: Minor adjustments to QLDoc of HTTP::Client::Request

5a7efd0

Python: Add interesting test-case

b1bca85

Python: Adjust SSRF location to request call

cb934e1

Since that might not be the same place where the vulnerable URL part is.

Python: Improve full/partial SSRF split

4b5599f

Now full-ssrf will only alert if **all** URL parts are fully user-controlled.

Python: Fix SSRF sanitizer tests

6f297f4

They were very misleading before, because a sanitizer that happened early, would remove taint from the rest of the cases by use-use flow :|

Python: Add tricky .format SSRF tests

8d9a797

Python: Allow http[s]:// prefix for SSRF

1d00730

intrigus-lgtm reviewed Dec 17, 2021

View reviewed changes

python/ql/lib/semmle/python/frameworks/Requests.qll Show resolved Hide resolved

RasmusWL added 2 commits December 17, 2021 09:44

Python: Remove debug predicate

e309d82

Accidentally committed :|

Python: Add SSRF change-note

e7abe43

RasmusWL requested a review from yoff December 17, 2021 09:22

Python: Add SSRF qhelp

83f1b2c

I included examples of both types in the qhelp of both queries, to provide context of what each of them actually are.

RasmusWL marked this pull request as ready for review December 17, 2021 10:49

RasmusWL requested a review from a team as a code owner December 17, 2021 10:49

yoff reviewed Dec 17, 2021

View reviewed changes

python/ql/test/query-tests/Security/CWE-918-ServerSideRequestForgery/full_partial_test.py Outdated Show resolved Hide resolved

yoff and others added 2 commits December 17, 2021 14:26

Update python/ql/test/query-tests/Security/CWE-918-ServerSideRequestF…

9866214

…orgery/full_partial_test.py

Python: Fix typo

626009e

yoff previously approved these changes Dec 17, 2021

View reviewed changes

Python: Adjust .expected based on new comment

83f87f0

That was changed in 9866214

RasmusWL dismissed yoff’s stale review via 83f87f0 December 17, 2021 14:30

RasmusWL requested a review from yoff December 17, 2021 14:31

yoff approved these changes Dec 17, 2021

View reviewed changes

codeql-ci merged commit 5054d5b into github:main Dec 17, 2021

RasmusWL deleted the ssrf-new branch December 17, 2021 15:21

RasmusWL mentioned this pull request Jan 4, 2022

Python: Draft for SSRF query #2933

Closed

RasmusWL mentioned this pull request Mar 22, 2022

Ruby: Minor change of SSRF concept #8524

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Add SSRF queries #7420

Python: Add SSRF queries #7420

RasmusWL commented Dec 16, 2021 •

edited

Loading

yoff left a comment

yoff left a comment

RasmusWL commented Dec 17, 2021

Python: Add SSRF queries #7420

Python: Add SSRF queries #7420

Conversation

RasmusWL commented Dec 16, 2021 • edited Loading

Current status

Commits

What is SSRF even?

yoff left a comment

Choose a reason for hiding this comment

yoff left a comment

Choose a reason for hiding this comment

RasmusWL commented Dec 17, 2021

RasmusWL commented Dec 16, 2021 •

edited

Loading