[RFC] Support passing pluggable Accelerators to Trainer #10687

kaushikb11 · 2021-11-23T07:12:12Z

🚀 Feature

With the present Lightning accelerator design, new accelerators cannot be passed to Trainer unless they are part of the Lightning accelerators.

This is not possible.

trainer = Trainer(accelerator=NewSOTAAccelerator(), devices=4)

There is a lot of innovation happening in the space of ML Accelerators, and the list will continue to grow. We should enable support for this functionality and make it easier for users to experiment with new accelerators using Lightning.

This proposal also aims at cleaning up and moving hardware specific logic from the accelerator connector to the accelerators.

For example, the HPUAccelerator PR, which is still in development, adds support for Habana's Gaudi Accelerator. Based on the above points, the Accelerator interface would look like this.

class HPUAccelerator(Accelerator):
    """Accelerator for HPU devices."""
    
    @property
    def accelerator_type(self) -> str:
        """Accelerator type."""
        return "hpu"
	
    @staticmethod
    def parse_devices(devices) -> int:
        # Include the HPU device parsing logic here
        return devices
    
    @staticmethod
    def auto_device_count() -> int:
        """Get the HPU devices when set to auto."""
        return habana.device_count()
    
    @staticmethod
    def get_parallel_devices(devices: int) -> List[torch.device]:
        """Gets parallel devices for the given HPU devices."""
        # Moved the logic from accelerator connector
        return [torch.device("hpu")] * devices
    
    def get_device_stats(self, device: Union[str, torch.device]) -> Dict[str, Any]:
        """Gets stats for the given HPU device."""
        return {}

After defining HPUAccelerator, the user could provide it to the Trainer without it being part of the Lightning accelerators.

trainer = Trainer(accelerator=HPUAccelerator(), devices=4, strategy=HPUPlugin())

cc @Borda @tchaton @rohitgr7 @akihironitta

The text was updated successfully, but these errors were encountered:

SeanNaren · 2021-11-23T17:21:11Z

I love this! However a double edge sword; we should not detract from the importance of bringing these accelerators/strategies in core Lightning.

I.e, If we have supporters/maintainers for the Habana accelerator, we should bring this into Lightning.

kaushikb11 · 2021-11-23T17:44:34Z

Yes, agreed! Habana accelerator was an example. The aim of the proposal is to add flexibility for the users, and not be limited by our options.

Whereas we should aim to support as much as accelerators/strategies for the community.

kaushikb11 · 2022-03-31T03:37:13Z

This is supported by #12030

kaushikb11 added feature Is an improvement or enhancement accelerator labels Nov 23, 2021

kaushikb11 self-assigned this Nov 23, 2021

kaushikb11 added the discussion In a discussion stage label Nov 23, 2021

kaushikb11 added the priority: 1 Medium priority task label Jan 7, 2022

kaushikb11 mentioned this issue Jan 20, 2022

Remove Strategy.on_tpu property #11536

Merged

12 tasks

kaushikb11 mentioned this issue Feb 21, 2022

Add support for pluggable Accelerators #12030

Merged

12 tasks

kaushikb11 closed this as completed Mar 31, 2022

kaushikb11 mentioned this issue Mar 31, 2022

Support passing PrecisionPlugin instances to the precision flag #12540

Closed

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC] Support passing pluggable Accelerators to Trainer #10687

[RFC] Support passing pluggable Accelerators to Trainer #10687

kaushikb11 commented Nov 23, 2021 •

edited by github-actions bot

Loading

SeanNaren commented Nov 23, 2021

Uh oh!

kaushikb11 commented Nov 23, 2021

Uh oh!

kaushikb11 commented Mar 31, 2022

Uh oh!

[RFC] Support passing pluggable Accelerators to Trainer #10687

[RFC] Support passing pluggable Accelerators to Trainer #10687

Comments

kaushikb11 commented Nov 23, 2021 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 Feature

SeanNaren commented Nov 23, 2021

Uh oh!

kaushikb11 commented Nov 23, 2021

Uh oh!

kaushikb11 commented Mar 31, 2022

Uh oh!

kaushikb11 commented Nov 23, 2021 •

edited by github-actions bot

Loading