-
Notifications
You must be signed in to change notification settings - Fork 1.7k
[feature-request] interactively add runs without stopping tensorboard #1708
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi @Hafplo! Thanks for reaching out to us. TensorBoard finds runs in any descendant directory of the logdir. If you It sounds like you’re saying that pointing TensorBoard at your As a workaround in the meantime, you could create a symlink directory
If you want to add a new logdir at runtime, just create a new symlink:
When you’re done, you can just delete the directory (
Does this help? |
I want to clarify this, just because it’s a bit tricky and intrinsically |
@wchargin On the same note, is there a way to reduce the size of our events file (perhaps by spliting them and taking only the latest steps)? |
We don’t currently offer this functionality. As you may know, when In the long term, we want to provide more flexibility in this area, |
@wchargin Thank you for your quick responses and insights.
We'll be looking out for these updates. Regarding your workaround with symlinks: We are using GCS (Google Cloud Storage) to store our models and event files. According to this answer, symlinks are unavailable for cloud objects. Following your logic, I thought of creating a dedicated bucket for "current runs" and using gsutil to copy in and out of it. This solution might work if we implement a small tool to handle it. But it sounds cumbersome to go over all this effort just to make use of Tensorboard's finding "descendant directory of the logdir". |
Understood. Yes, it’s also my understanding that GCS does not support
Agreed. It looks like we don’t really have a good solution for you at I’ll keep this feature request open. Thanks again. |
We are using tensorboard all the time to monitor our training sessions and to compare between our models.
Since it takes a lot of time to load and read large events files, it would be very helpful to be able to interactively load and remove runs.
Currently the only we know is to terminate tensorboard and run it again with new "--logdir" argument.
Loading all the runs in advance is not an option since:
Thank you.
The text was updated successfully, but these errors were encountered: