Replace jsonpickle with json to serialize entity #275

lupengamzn · 2021-03-17T21:05:57Z

Issue #, if available:

Security issue regarding decode API in jsonpickle: jsonpickle/jsonpickle#335
X-Ray Python SDK doesn't use decode API, but better find a replacement

Description of changes:

Replace jsonpickle with json to serialize entity data

Benchmark:

Some initial benchmark results using pytest-benchmark on Python 2.7:

Will automate the benchmark across the library shortly

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

codecov-io · 2021-03-17T21:11:45Z

Codecov Report

Merging #275 (7abff72) into master (508f929) will increase coverage by 0.17%.
The diff coverage is 94.02%.

@@            Coverage Diff             @@
##           master     #275      +/-   ##
==========================================
+ Coverage   79.34%   79.52%   +0.17%     
==========================================
  Files          82       83       +1     
  Lines        3249     3277      +28     
==========================================
+ Hits         2578     2606      +28     
  Misses        671      671

Impacted Files	Coverage Δ
aws_xray_sdk/core/utils/conversion.py	`85.71% <85.71%> (ø)`
aws_xray_sdk/core/models/entity.py	`92.85% <100.00%> (+0.23%)`	⬆️
aws_xray_sdk/core/models/segment.py	`97.14% <100.00%> (-0.12%)`	⬇️
aws_xray_sdk/core/models/subsegment.py	`96.82% <100.00%> (-0.15%)`	⬇️
aws_xray_sdk/core/models/throwable.py	`89.13% <100.00%> (+9.13%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 508f929...7abff72. Read the comment docs.

srprash

Looks good. Some minor comments.

aws_xray_sdk/core/models/entity.py

aws_xray_sdk/core/utils/conversion.py

srprash · 2021-03-22T07:49:16Z

tox.ini

@@ -18,8 +18,8 @@ deps =
    requests
    bottle >= 0.10
    flask >= 0.10
-    sqlalchemy
-    Flask-SQLAlchemy
+    sqlalchemy==1.3.*


I think we should only pin for py 3.4 and 3.5 and should still test with the latest version of sqlalchemy and flask-sqlalchemy for the supported python. Ideally, the SDK should work with every sqlalchemy version supported for each python version.

The other py SDKs will fail the test with the latest sqlalchemy and flask-sqlalchemy version.

Why does the X-Ray SDK tests fail for the latest version of sqlalchemy? we are not using any of the internal APIs of sqlalchemy in our tests right? Does that mean the SDK is not compatible with latest sqlalchemy for supported python versions?

Some of the unit tests are using the API(for example), which will fail the test for the latest sqlalchemy and flask-sqlalchemy. Not sure if the Python SDK is compatible with the latest sqlalchemy or not, but at least some unit tests need to update the API they use from sqlalchemy and flask-sqlalchemy.

srprash

LGTM

bhautikpip

Should we also add performance results with this PR? This would help customers to understand the performance improvement in next release because of this serialization logic change.

bhautikpip · 2021-03-22T21:51:30Z

aws_xray_sdk/core/models/entity.py

+
+        for key, value in vars(self).items():
+            if isinstance(value, bool) or value:
+                if key == 'subsegments':


How do we handle the case if key is annotations, aws or http ? I see that you're handling metadata case separately. Shouldn't we do the same with annotations too? since in that case also we don't know how many key-value pairs would be there.

annotations, aws or http are all dict with serializable values like string, int or boolean, so there is no need to handle these cases.

aws_xray_sdk/core/utils/conversion.py

lupengamzn · 2021-03-22T22:56:36Z

Should we also add performance results with this PR? This would help customers to understand the performance improvement in next release because of this serialization logic change.

We can do a follow-up PR to address this in future as seems like currently the pytest-benchmark library doesn't support all of the Python platforms supported by X-Ray Python SDK. So will need some workaround.

aws/aws-xray-sdk-python#275

Removal of jsonpickle: aws/aws-xray-sdk-python#275 git-svn-id: file:///srv/repos/svn-community/svn@904452 9fca08f4-af9d-4005-b8df-a31f2cc04f65

heitorlessa · 2021-04-05T14:31:36Z

aws_xray_sdk/core/models/entity.py

+                    entity_dict[key] = subsegments
+                elif key == 'cause':
+                    entity_dict[key] = {}
+                    entity_dict[key]['working_directory'] = self.cause['working_directory']


@lupengamzn

I couldn't dive into this implementation yet to provide a fix, but this is causing a regression in the AWS Lambda Powertools Tracer feature leading to a TypeError: string must be a list of integers

aws-powertools/powertools-lambda-python#383

* Replace jsonpickle with json to serialize entity * Added workflow to create release tag * Pinned sqlalchemy and Flask-SQLAlchemy for unit test * Fixed version * Changed log to debug level * Update logging * Added empty line

Replace jsonpickle with json to serialize entity

3e8642b

Added workflow to create release tag

2d66ba3

lupengamzn requested review from srprash and bhautikpip March 19, 2021 16:35

lupengamzn added 2 commits March 19, 2021 11:09

Pinned sqlalchemy and Flask-SQLAlchemy for unit test

7abff72

Fixed version

827b83e

srprash requested changes Mar 22, 2021

View reviewed changes

lupengamzn added 2 commits March 22, 2021 10:19

Changed log to debug level

389eacc

Update logging

7bc1cc5

srprash approved these changes Mar 22, 2021

View reviewed changes

bhautikpip reviewed Mar 22, 2021

View reviewed changes

Added empty line

7fcf969

bhautikpip approved these changes Mar 22, 2021

View reviewed changes

lupengamzn merged commit 266bc82 into aws:master Mar 22, 2021

BastianZim added a commit to regro-cf-autotick-bot/aws-xray-sdk-feedstock that referenced this pull request Mar 24, 2021

Remove jsonpickle

87e6bf1

aws/aws-xray-sdk-python#275

heitorlessa mentioned this pull request Apr 5, 2021

Xray: TypeError: string indices must be integers aws-powertools/powertools-lambda-python#383

Closed

heitorlessa reviewed Apr 5, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace jsonpickle with json to serialize entity #275

Replace jsonpickle with json to serialize entity #275

lupengamzn commented Mar 17, 2021

codecov-io commented Mar 17, 2021 •

edited

Loading

srprash left a comment

srprash Mar 22, 2021

lupengamzn Mar 22, 2021

srprash Mar 22, 2021

lupengamzn Mar 22, 2021

srprash left a comment

bhautikpip left a comment

bhautikpip Mar 22, 2021

lupengamzn Mar 22, 2021

lupengamzn commented Mar 22, 2021

heitorlessa Apr 5, 2021

Replace jsonpickle with json to serialize entity #275

Replace jsonpickle with json to serialize entity #275

Conversation

lupengamzn commented Mar 17, 2021

codecov-io commented Mar 17, 2021 • edited Loading

Codecov Report

srprash left a comment

Choose a reason for hiding this comment

srprash Mar 22, 2021

Choose a reason for hiding this comment

lupengamzn Mar 22, 2021

Choose a reason for hiding this comment

srprash Mar 22, 2021

Choose a reason for hiding this comment

lupengamzn Mar 22, 2021

Choose a reason for hiding this comment

srprash left a comment

Choose a reason for hiding this comment

bhautikpip left a comment

Choose a reason for hiding this comment

bhautikpip Mar 22, 2021

Choose a reason for hiding this comment

lupengamzn Mar 22, 2021

Choose a reason for hiding this comment

lupengamzn commented Mar 22, 2021

heitorlessa Apr 5, 2021

Choose a reason for hiding this comment

codecov-io commented Mar 17, 2021 •

edited

Loading