Opinions on running envpool on a dedicated simulator server with e.g. REST API #300

harpone · 2024-04-22T19:02:13Z

Maximizing GPU utilization is usually pretty hard in RL, even with parallel environments, so I'm thinking of running a parallel simulator on a separate CPU only server with max possible number of CPUs and do the actual training on a GPU node (within same placement group + all possible networking optimizations) in whatever cloud.

Could this work or is the extra network latency too much to make this feasible?

Trinkle23897 · 2024-04-23T03:52:40Z

In that case you should use python asyncio

harpone · 2024-04-23T05:24:00Z

In that case you should use python asyncio

yeah, maybe, but I'm more concerned about the actual feasibility in terms of latency etc. I would imagine this would be more common practice if it's feasible, but haven't been able to find any references...

mavenlin · 2024-04-30T00:48:20Z

In that case you should use python asyncio

yeah, maybe, but I'm more concerned about the actual feasibility in terms of latency etc. I would imagine this would be more common practice if it's feasible, but haven't been able to find any references...

@harpone I believe this is feasible, we have a customized game implementation based on GRPC internally. But the code written at that time is no longer compatible with the current public version of envpool. And also it is async API only.

The basic idea is to initiate a GRPC server at the GPU server, and many GRPC clients at CPU cluster. The GRPC server asynchronously writes the StateBufferQueue and sends out the actions.

I can provide some help if you need this functionality and would like to implement on top of envpool.

harpone assigned Trinkle23897 Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Opinions on running envpool on a dedicated simulator server with e.g. REST API #300

Opinions on running envpool on a dedicated simulator server with e.g. REST API #300

harpone commented Apr 22, 2024

Trinkle23897 commented Apr 23, 2024

harpone commented Apr 23, 2024

mavenlin commented Apr 30, 2024

Opinions on running envpool on a dedicated simulator server with e.g. REST API #300

Opinions on running envpool on a dedicated simulator server with e.g. REST API #300

Comments

harpone commented Apr 22, 2024

Trinkle23897 commented Apr 23, 2024

harpone commented Apr 23, 2024

mavenlin commented Apr 30, 2024