tested and working #3

Maheidem · 2025-04-08T20:45:21Z

feel free to accept or not, but it's heloping me a lot

JordiNeil · 2025-04-09T03:10:33Z

main.py

+        cmd = [
+            "curl", "-s",
+            "-H", f"Authorization: Bearer {DATABRICKS_TOKEN}",
+            "-H", "Content-Type: application/json",
+            f"https://{DATABRICKS_HOST}/api/2.0/dbfs/list?path={path}"
+        ]


I think here we can use the request function as in the other calls. Not sure why to do it using subprocess and cmd.

Databricks API call

JordiNeil · 2025-04-09T03:13:14Z

main.py

+        # Build URL with query parameters directly
+        encoded_path = urllib.parse.quote(path)
+        url = f"https://{DATABRICKS_HOST}/api/2.0/dbfs/read?path={encoded_path}&length={length}"
+
+        print(f"Requesting URL: {url}")
+        response = requests.get(url, headers=headers)
+        print(f"Status code: {response.status_code}")


Same here, we can reuse the databricks_api_request function

JordiNeil · 2025-04-09T03:13:51Z

main.py

+        headers = {
+            "Authorization": f"Bearer {DATABRICKS_TOKEN}",
+            "Content-Type": "application/json"
+        }
+
+        # DBFS file upload requires three steps:
+        # 1. Create a handle
+        create_url = f"https://{DATABRICKS_HOST}/api/2.0/dbfs/create"
+        create_data = {
+            "path": dbfs_path,
+            "overwrite": overwrite
+        }
+
+        create_response = requests.post(create_url, headers=headers, json=create_data)


JordiNeil · 2025-04-09T03:18:06Z

main.py

+        headers = {
+            "Authorization": f"Bearer {DATABRICKS_TOKEN}",
+            "Content-Type": "application/json"
+        }
+
+        # Build URL with query parameters directly
+        encoded_path = urllib.parse.quote(path)
+        url = f"https://{DATABRICKS_HOST}/api/2.0/workspace/list?path={encoded_path}"
+
+        print(f"Requesting URL: {url}")
+        response = requests.get(url, headers=headers)
+        print(f"Status code: {response.status_code}")
+        print(f"Response: {response.text}")


JordiNeil · 2025-04-09T03:18:28Z

main.py

+        headers = {
+            "Authorization": f"Bearer {DATABRICKS_TOKEN}",
+            "Content-Type": "application/json"
+        }
+
+        # Build request
+        export_url = f"https://{DATABRICKS_HOST}/api/2.0/workspace/export"
+        export_data = {
+            "path": path,
+            "format": format
+        }
+
+        response = requests.get(export_url, headers=headers, params=export_data)


JordiNeil · 2025-04-09T03:19:13Z

main.py

+        headers = {
+            "Authorization": f"Bearer {DATABRICKS_TOKEN}",
+            "Content-Type": "application/json"
+        }
+
+        # Encode the content as base64
+        import base64
+        content_bytes = content.encode("utf-8")
+        encoded_content = base64.b64encode(content_bytes).decode("utf-8")
+
+        # Build request
+        import_url = f"https://{DATABRICKS_HOST}/api/2.0/workspace/import"
+        import_data = {
+            "path": path,
+            "content": encoded_content,
+            "language": language,
+            "format": format,
+            "overwrite": overwrite
+        }
+
+        response = requests.post(import_url, headers=headers, json=import_data)
+        response.raise_for_status()


JordiNeil · 2025-04-09T03:57:21Z

Thank you so much for this contribution!

Let me know if you want to apply the changes, if not, I'll apply them later 😄

Maheidem · 2025-04-09T13:29:40Z

My honest take is that i actually did a lot of this with AI to get it working faster. I actually use for my job.
If you see improvements to be made, please fell free.
I am going into this a data science and not software eng.

JordiNeil · 2025-04-09T14:25:06Z

Hey, yes, I noticed you used AI 😅
I'll take your suggestions and apply them with the fixes. I'll let you know when they're in the main branch.

Thank you! 🙏

tested and working

2ed3ab1

JordiNeil reviewed Apr 9, 2025

View reviewed changes

Maheidem and others added 4 commits April 9, 2025 10:14

udpating the docker run file

1086c38

add packaging to dependencies

47cf28f

clean up code

c8bd1f2

tested and working

a30fabc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tested and working #3

tested and working #3

Maheidem commented Apr 8, 2025

JordiNeil Apr 9, 2025

JordiNeil Apr 9, 2025

JordiNeil Apr 9, 2025

JordiNeil Apr 9, 2025

JordiNeil Apr 9, 2025

JordiNeil Apr 9, 2025

JordiNeil commented Apr 9, 2025

Maheidem commented Apr 9, 2025

JordiNeil commented Apr 9, 2025

tested and working #3

Are you sure you want to change the base?

tested and working #3

Conversation

Maheidem commented Apr 8, 2025

JordiNeil Apr 9, 2025

Choose a reason for hiding this comment

JordiNeil Apr 9, 2025

Choose a reason for hiding this comment

JordiNeil Apr 9, 2025

Choose a reason for hiding this comment

JordiNeil Apr 9, 2025

Choose a reason for hiding this comment

JordiNeil Apr 9, 2025

Choose a reason for hiding this comment

JordiNeil Apr 9, 2025

Choose a reason for hiding this comment

JordiNeil commented Apr 9, 2025

Maheidem commented Apr 9, 2025

JordiNeil commented Apr 9, 2025