Gateway-in-server early prototype #1718

jvstme · 2024-09-23T18:54:51Z

This commit implements most of the reverse
proxying logic for gateway-in-server. It also
includes a dependency injection mechanism that
will allow the new gateway app to work with
different repo (storage) implementations in-server
and remotely.

For this prototype, gateway-in-server duplicates
remote gateways, i.e. all services are available
both on a remote gateway and on gateway-in-server.
This will be changed later.

Behind the GATEWAY_IN_SERVER feature flag.

Part of #1595

Note: unit tests will likely follow later, as it should be
easier to test with the in-memory repo implementation
that will be added later for remote gateways.

This commit implements most of the reverse proxying logic for gateway-in-server. It also includes a dependency injection mechanism that will allow the new gateway app to work with different repo (storage) implementations in-server and remotely. For this prototype, gateway-in-server duplicates remote gateways, i.e. all services are available both on a remote gateway and on gateway-in-server. This will be changed later. Behind the GATEWAY_IN_SERVER feature flag.

src/dstack/_internal/gateway/services/service_proxy.py

r4victor · 2024-09-24T13:16:40Z

src/dstack/_internal/gateway/services/service_proxy.py

+    if "Upgrade" in request.headers:
+        raise fastapi.exceptions.HTTPException(
+            fastapi.status.HTTP_400_BAD_REQUEST, "Upgrading connections is not supported"
+        )


Does it mean websocket is not supported?

Yes, it will be supported later

src/dstack/_internal/gateway/deps.py

r4victor · 2024-09-24T13:37:57Z

unit tests will likely follow later, as it should be
easier to test with the in-memory repo implementation
that will be added later for remote gateways.

Which isn't coming soon necessarily, right? I think it's crucial to cover proxy() since it adds non-trivial logic that's going to be a pain to maintain/develop without tests.

un-def · 2024-09-24T14:38:56Z

src/dstack/_internal/gateway/services/service_proxy.py

+
+async def stream_response(
+    response: httpx.Response, replica_id: str
+) -> AsyncGenerator[bytes, None]:


I think AsyncIterator[YieldType] could be used in place of AsyncGenerator[YieldType, None]

I subjectively prefer more generic return types in public interfaces and more specific return types in private ones. AsyncGenerator is more specific than AsyncIterator here

r4victor · 2024-09-25T05:14:46Z

src/dstack/_internal/server/app.py

@@ -166,6 +174,10 @@ def register_routes(app: FastAPI, ui: bool = True):
    app.include_router(gateways.router)
    app.include_router(volumes.root_router)
    app.include_router(volumes.project_router)
+    if FeatureFlags.GATEWAY_IN_SERVER:
+        app.include_router(
+            service_proxy.router, prefix="/gateway/services", tags=["gateway-in-server"]


/gateway/ in the path would be misleading when the server is proxying the requests. gateway-in-server should remain an internal concept.

Also, I remember we agreed that we need two separate concepts: gateways and service proxies, so I'd expect not to see "gateway-in-server" in the code as well. Currently, it's unclear what gateway is since the term is used for different things.

Gateway is a component of dstack used to access services and models. It can run in two modes: in-server and remotely. Each mode has it's own specifics, e.g. remote gateways need to be created, while gateway-in-server exists always, remote gateways can serve services at subdomains, while gateway-in-server can't, etc.

I think this is easier for users to understand than a new term. "Service proxy" isn't a suitable term because this component will also be used for LLMs, not just services. So it will need a more generic name, like "Proxy". And having two similar terms like "Gateway" and "Proxy" can cause confusion.

r4victor · 2024-09-25T05:35:59Z

Overall, I don't see a good reason to have a common repository interface (BaseGatewayRepo) and the injection mechanism unless we're going to move remote gateway from nginx to Python proxying (which I don't think is a good idea). The server and remote gateways would need different repo interfaces, although they might share some methods.

jvstme · 2024-09-26T01:13:19Z

I don't see a good reason to have a common repository interface (BaseGatewayRepo) and the injection mechanism

Even if we keep Nginx for remote gateways, two gateway modes will still have much shared logic that will benefit from a common repo interface: retrieving model mappings for the OpenAI endpoint, service and replica details for restarting SSH tunnels after gateway restarts, user tokens for authentication, backend credentials for #1631, and any other data for future features.

Some repo methods will be useless for one of the gateway operation modes, so these methods will have to be empty in the respective repo implementation. This may be a complication, but the alternatives are hard repo dependencies in gateway code with lots of if-else statements, or completely different applications for remote gateways and gateway-in-server. Both seem like a greater complication to me, so I'd rather go with BaseGatewayRepo. If it turns out to be inconvenient, it won't be difficult to remove it at a later stage.

jvstme requested a review from r4victor September 23, 2024 19:03

r4victor reviewed Sep 24, 2024

View reviewed changes

src/dstack/_internal/gateway/services/service_proxy.py Outdated Show resolved Hide resolved

r4victor reviewed Sep 24, 2024

View reviewed changes

src/dstack/_internal/gateway/deps.py Show resolved Hide resolved

un-def reviewed Sep 24, 2024

View reviewed changes

r4victor reviewed Sep 25, 2024

View reviewed changes

Add tests and other fixes

4cd5911

jvstme force-pushed the issue_1595_proxy branch from 41751db to 4cd5911 Compare September 26, 2024 01:34

jvstme requested a review from r4victor September 26, 2024 01:44

r4victor approved these changes Sep 26, 2024

View reviewed changes

jvstme merged commit 3684ef1 into master Sep 26, 2024
46 checks passed

jvstme deleted the issue_1595_proxy branch September 26, 2024 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gateway-in-server early prototype #1718

Gateway-in-server early prototype #1718

Uh oh!

jvstme commented Sep 23, 2024

Uh oh!

Uh oh!

r4victor Sep 24, 2024

Uh oh!

jvstme Sep 25, 2024

Uh oh!

Uh oh!

r4victor commented Sep 24, 2024

Uh oh!

un-def Sep 24, 2024

Uh oh!

jvstme Sep 26, 2024

Uh oh!

r4victor Sep 25, 2024

Uh oh!

r4victor Sep 25, 2024

Uh oh!

jvstme Sep 25, 2024

Uh oh!

r4victor commented Sep 25, 2024

Uh oh!

jvstme commented Sep 26, 2024

Uh oh!

Uh oh!

Uh oh!

Gateway-in-server early prototype #1718

Gateway-in-server early prototype #1718

Uh oh!

Conversation

jvstme commented Sep 23, 2024

Uh oh!

Uh oh!

r4victor Sep 24, 2024

Choose a reason for hiding this comment

Uh oh!

jvstme Sep 25, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

r4victor commented Sep 24, 2024

Uh oh!

un-def Sep 24, 2024

Choose a reason for hiding this comment

Uh oh!

jvstme Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

r4victor Sep 25, 2024

Choose a reason for hiding this comment

Uh oh!

r4victor Sep 25, 2024

Choose a reason for hiding this comment

Uh oh!

jvstme Sep 25, 2024

Choose a reason for hiding this comment

Uh oh!

r4victor commented Sep 25, 2024

Uh oh!

jvstme commented Sep 26, 2024

Uh oh!

Uh oh!

Uh oh!