-
Notifications
You must be signed in to change notification settings - Fork 3.5k
[RFC] Tuner Revamp #11012
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Not sure if there's an easy workaround currently but I'd love an easy way of combining |
Hey @rohitgr7, IMO, I don't believe subclassing the Trainer is the way forward. I would rather prefer to work on a better snapshotting, restoring the Trainer state, and extend support for validation, test and predict. Furthermore, Flash fine-tune is relying on trainer.fit and simply adds a callback internally. If the Tuner refactor would be more flexible at be run on any Trainer entry points, fine-tune could also be supported. Best, |
I agree with @tchaton . Personally, I would prefer having them as callbacks |
thank you @tchaton and @justusschock for your comments. Looks like making them callbacks is a reasonable solution here. I'll start with it, for now, to see if it's compatible as a callback. Although I can't think of a better snapshotting mechanism for it. Would love to discuss more on that :) |
hey @lukasschmit ! |
Proposed refactor
Issues
ps: please add more issues up here if you have any regarding the tuner.
Possible solutions
well, this solution could possibly solve
1
&2
but possibly can't be configured to solve4
.Another solution proposed by @Borda is to make them as callbacks, so that they can be easily configured by users independently and can help resolve
4
. But this solution might not resolve1
&2
.Another solution @Borda and @SkafteNicki suggested, for now, is to move lr_finder to bolts and experiment there and improve scale_batch_size within lightning. But possibly it can't guarantee to solve
4
.Additional context
Other issues with the tuner right now we need to address:
#9625
#10560
#10557
thanks to @Borda @SkafteNicki @ethanwharris @akihironitta for helping out with the discussion and possible solutions.
cc @justusschock @awaelchli @akihironitta @Borda
The text was updated successfully, but these errors were encountered: