Update Google Docs Meta Data #1546

github-actions · 2024-10-07T15:28:43Z

Updating Google Docs Meta Data

addition of "Signal Set" column
addition of two chng signals: 7dav_inpatient_covid and 7dav_outpatient_covid
a bunch of fixes to extended ascii apostrophes and quotation marks (replaced with regular ascii equivalents)

The signal name for "covid_naat_pct_positive_7dav" was lost in an apparent accidental paste, but i fixed it here w/ a commit to the branch PR, and manually in the spreadsheet

sonarqubecloud · 2024-10-08T20:18:07Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

melange396 · 2024-10-08T20:43:54Z

It turns out that there are still extended ascii chars in here (they are actually unicode chars)... They are findable by running:

from collections import defaultdict
highchars = defaultdict(int)
with open('db_signals.csv') as f:
    for line in f:
        for char in line:
            val = ord(char)
            if val>=127:
                highchars[val] += 1

the current db_signals.csv file gets the following results:

>>> highchars
defaultdict(<class 'int'>, {8220: 9, 8217: 30, 8221: 9})
>>> chr(8220)
'“'
>>> chr(8221)
'”'
>>> chr(8217)
'’'
>>>

I am not going to simply replace them in the file itself because of escaping concerns, so after merging this PR, i will replace them in the google spreadsheet and then run the csv sync utility (GH action) again.

melange396 · 2024-10-09T03:37:40Z

in case it helps someone in the future, heres some ugly code that i used to help compare the two versions of these files:

import csv

dev = []
with open('dev__db_signals.csv') as f:
    for r in csv.reader(f):
        dev.append(r)

new = []
with open('new__db_signals.csv') as f:
    for r in csv.reader(f):
        new.append(r)

def compare_rows(a, b):
    if len(a) != len(b):
        print("length mismatch")
    for i in range(len(a)):
        if a[i] != b[i]:
            print("    ", i, a[i].replace("\n", ""))
            print("    ", i, b[i].replace("\n", ""))

for i in range(len(dev)):
    offset = 0
    if i in (7,8):
        # skip added rows                                                                                                                                                                                          
        continue
    if i > 8:
        # account for added rows                                                                                                                                                                                   
        offset = 2
    n = new[i][:10] + new[i][11:] # skip added column @ index 10                                                                                                                                                   
    d = dev[i-offset]
    if n != d:
        print(i)
        compare_rows(n, d)

chore: update docs

aa30c1a

github-actions bot added the chore label Oct 7, 2024

github-actions bot assigned melange396 Oct 7, 2024

github-actions bot requested a review from melange396 October 7, 2024 15:28

fix lost 'covid_naat_pct_positive_7dav' signal name

97d6124

melange396 approved these changes Oct 8, 2024

View reviewed changes

melange396 merged commit a9a2535 into dev Oct 8, 2024
7 checks passed

melange396 deleted the bot/update-docs branch October 8, 2024 20:45

melange396 mentioned this pull request Oct 9, 2024

Properly decode UTF-8 from gsheet csv #1548

Merged

This was referenced Dec 5, 2024

Automate parts of metadata csv update comparison #1564

Open

Release Delphi Epidata 4.1.27 #1566

Merged

melange396 mentioned this pull request Feb 8, 2025

Update Google Docs Meta Data #1596

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Google Docs Meta Data #1546

Update Google Docs Meta Data #1546

github-actions bot commented Oct 7, 2024 •

edited by melange396

Loading

sonarqubecloud bot commented Oct 8, 2024

melange396 commented Oct 8, 2024

melange396 commented Oct 9, 2024

Update Google Docs Meta Data #1546

Update Google Docs Meta Data #1546

Conversation

github-actions bot commented Oct 7, 2024 • edited by melange396 Loading

sonarqubecloud bot commented Oct 8, 2024

Quality Gate passed

melange396 commented Oct 8, 2024

melange396 commented Oct 9, 2024

github-actions bot commented Oct 7, 2024 •

edited by melange396

Loading