feat: rmsnorm lowering #3440

bowang007 · 2025-03-12T18:05:34Z

RMSNORM lowering pass

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

py/torch_tensorrt/dynamo/lowering/passes/replace_rmsnorm.py

py/torch_tensorrt/dynamo/lowering/passes/_aten_lowering_pass.py

zewenli98 · 2025-04-11T01:14:05Z

py/torch_tensorrt/dynamo/conversion/plugins/_generate_plugin.py

+        shape_calc_fns = [None] * output.ndim
+
+        for i in range(output.ndim):
+            input_node_expr = input_node_expr = list(


zewenli98 · 2025-04-11T01:14:59Z

py/torch_tensorrt/dynamo/conversion/plugins/_generate_plugin.py

-        shape_calc_fns = [None] * args[0].ndim
-        for i in range(args[0].ndim):
-            input_node_expr = [syms_arg[i].node.expr for syms_arg in syms_args]
+        shape_calc_fns = [None] * output.ndim


It looks like fake_mode above was defined twice.

narendasan · 2025-04-11T01:36:55Z

py/torch_tensorrt/dynamo/conversion/plugins/_generate_plugin_converter.py

        tensor_inputs = plugin.input_tensor_names
        tensor_args = args[0 : len(tensor_inputs)]
+
+        random_id = random.randint(0, 10000)


Use UUID (as short as possible) instead of random for now

narendasan · 2025-04-11T01:39:35Z

examples/dynamo/llama2_flashinfer_rmsnorm.py

+torch_tensorrt.dynamo.conversion.plugins.custom_op(
+    "flashinfer::rmsnorm", supports_dynamic_shapes=True
+)
+


TODO: After merge, extend this example to include an aot_impl

There needs to be a modification to the plugin_converter generation that figure out if you can use aot_impl

rmsnorm lowering

a29c757

facebook-github-bot added the cla signed label Mar 12, 2025

github-actions bot added component: lowering Issues re: The lowering / preprocessing passes component: api [Python] Issues re: Python API component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths labels Mar 12, 2025

github-actions bot requested a review from gs-olive March 12, 2025 18:05

This comment was marked as outdated.

Sign in to view

narendasan reviewed Mar 13, 2025

View reviewed changes

py/torch_tensorrt/dynamo/lowering/passes/replace_rmsnorm.py Outdated Show resolved Hide resolved

narendasan reviewed Mar 13, 2025

View reviewed changes

py/torch_tensorrt/dynamo/lowering/passes/_aten_lowering_pass.py Outdated Show resolved Hide resolved

bowang007 added 3 commits March 14, 2025 04:04

update

8612f63

update

c912fef

update

cc199f1

github-actions bot added component: conversion Issues re: Conversion stage and removed component: lowering Issues re: The lowering / preprocessing passes labels Apr 10, 2025

bowang007 marked this pull request as ready for review April 10, 2025 23:25

bowang007 requested review from peri044 and zewenli98 and removed request for gs-olive April 10, 2025 23:26

zewenli98 reviewed Apr 11, 2025

View reviewed changes

narendasan added the needs-release-cherrypick label Apr 11, 2025

github-actions bot requested a review from narendasan April 11, 2025 01:26

narendasan reviewed Apr 11, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: rmsnorm lowering #3440

feat: rmsnorm lowering #3440

bowang007 commented Mar 12, 2025

This comment was marked as outdated.

zewenli98 Apr 11, 2025

zewenli98 Apr 11, 2025

narendasan Apr 11, 2025

narendasan Apr 11, 2025

narendasan Apr 11, 2025

feat: rmsnorm lowering #3440

Are you sure you want to change the base?

feat: rmsnorm lowering #3440

Conversation

bowang007 commented Mar 12, 2025

Checklist:

This comment was marked as outdated.

zewenli98 Apr 11, 2025

Choose a reason for hiding this comment

zewenli98 Apr 11, 2025

Choose a reason for hiding this comment

narendasan Apr 11, 2025

Choose a reason for hiding this comment

narendasan Apr 11, 2025

Choose a reason for hiding this comment

narendasan Apr 11, 2025

Choose a reason for hiding this comment