-
Notifications
You must be signed in to change notification settings - Fork 315
load_table_from_dataframe produces incorrect results when used in list of dict #781
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@Lmmejia11 Thanks for the runnable example, it appears This was both with the latest BigQuery client ( However, when I downgraded Would it be possible to upgrade I will nevertheless try to find why this is the case, because Edit: Interesting, this seems to start happening when the number of dataframe rows is 513 or more, while with 512 rows or less it works just fine. On the other hand, Edit 2: After additional tests it actually seems that |
Since we want to keep the supported dependency ranges wide ( Also increasing priority, as possible silent data corruption is Bad™. |
Thanks! thankfully I can upgrade pyarrow |
As you said, I noticed the bug appeared when there are many lines. But it can also depend on the length of the lists. If I remember correctly, 1000 lines with list of length 5 also worked fine. I dont know if it could bug with less than 512 rows if the lists are longer or the values heavier, it might be linked to space. |
Thanks, I'll keep this in mind. It might actually be better to not try to narrow down the conditions when the bug could occur, but instead always issue a warning when a less recent |
Environment details
python --version
: Python 3.8.10pip --version
: pip 21.1.2google-cloud-bigquery
version:pip show google-cloud-bigquery
: google-cloud-bigquery==2.6.2, pyarrow==2.0.0Steps to reproduce
Code example
At least for this example, the 'value' attribute is transcribed in the correct order (first item has the smallest value, and it increases). The 'name' value was sampled with possibility of repetition. All table lines have the same 'name' values in the same order, and it can change if the code is reexecuted.
The text was updated successfully, but these errors were encountered: