|
3 | 3 | Dates are in YYYY-MM-DD format.
|
4 | 4 |
|
5 | 5 |
|
| 6 | +## v0.33.0 (2021-09-16) |
| 7 | +### Added |
| 8 | +- Added various examples, a [CLI User Guide](polygraphy/tools/) and [directory for how-to guides](./how-to). |
| 9 | +- Added an experimental `template trt-config` tool to generate template scripts that create TensorRT builder configurations. |
| 10 | +- Added `--hide-fail-output` to make `debug` subtools suppress output from failed iterations. |
| 11 | +- Added experimental support for DLA. |
| 12 | +- Added a `data to-input` tool that can combine inputs/outputs created by `--save-inputs`/`--save-outputs`. |
| 13 | + The resulting file is compatible with `--load-inputs`. |
| 14 | + |
| 15 | +### Changed |
| 16 | +- Updated `debug` subtools to show captured output on failed iterations. |
| 17 | +- The logger will now emit all `CRITICAL` messages to `stderr` instead of `stdout`. |
| 18 | +- Renamed `CompareFunc.basic_compare_func` to `CompareFunc.simple`. The old name is preserved as an alias. |
| 19 | +- The `--good` and `--bad` arguments in `diff-tactics` can now also accept single files instead of directories. |
| 20 | + |
| 21 | +### Fixed |
| 22 | +- Fixed a bug where `debug reduce` would crash when ONNX models included `Constant` nodes whose outputs |
| 23 | + needed to be marked as model outputs. |
| 24 | + |
| 25 | + |
| 26 | +## v0.32.0 (2021-08-10) |
| 27 | +### Added |
| 28 | +- Added support for `K`, `M`, and `G` suffixes to CLI arguments that expect a number of bytes (e.g. `--workspace`). |
| 29 | + These correspond to `KiB`, `MiB`, and `GiB` respectively. |
| 30 | + For example, `--workspace=16M` is equivalent to `--workspace=16777216`. |
| 31 | +- Added a `copy_outputs_to_host` parameter in `TrtRunner.infer()`, which, when set to `False`, will cause the runner |
| 32 | + to return `DeviceView`s instead of NumPy arrays for inference outputs. This allows us to avoid a |
| 33 | + device-to-host and host-to-device copy if we want outputs to remain on the device. |
| 34 | +- Added a `view()` method to `DeviceArray`s to create read-only `DeviceView`s over their data. |
| 35 | +- Added a `PluginRefRunner` which provides CPU reference implementations for TensorRT plugins |
| 36 | + and a corresponding `--pluginref` runner option in `polygraphy run`. |
| 37 | + |
| 38 | +### Changed |
| 39 | +- Marked old shape syntax (`<name>,dim0xdim1x...xdimN,<dtype>`) as deprecated since it leads to ambiguity when |
| 40 | + parsing shapes including named dynamic dimensions. |
| 41 | + |
| 42 | + For example, compare: |
| 43 | + ``` |
| 44 | + --input-shapes input0,xxyxz |
| 45 | + ``` |
| 46 | +
|
| 47 | + and: |
| 48 | + ``` |
| 49 | + --input-shapes input0:[x,y,z] |
| 50 | + ``` |
| 51 | +
|
| 52 | + For now, the old syntax continues to work for shapes without named dimensions, |
| 53 | + but it will be removed in a future version of Polygraphy. |
| 54 | +
|
| 55 | + The newer syntax, which was originally introduced in Polygraphy 0.25.0, |
| 56 | + uses the list syntax already present in other parts of Polygraphy. |
| 57 | + For example, `--val-range [0,1]` in `run` and `--attrs axes=[0,1]` in `surgeon insert` use the same syntax. |
| 58 | +- Made several performance improvements in the Polygraphy CUDA wrapper. |
| 59 | +- Added a loud warning when the deprecated `--int-min`/`--int-max` or `--float-min`/`--float-max` options are used. |
| 60 | + These are superseded by `--val-range` which allows you to specify data ranges on a per-input basis. |
| 61 | +
|
| 62 | +### Removed |
| 63 | +- Removed various deprecated aliases: `ModifyOnnx`, `SessionFromOnnxBytes`, `ModifyNetwork`, `ModifyGraph` |
| 64 | +- Removed the `to-json` tool which was used to convert Pickled data generated by Polygraphy 0.26.1 and older to JSON. |
| 65 | + Polygraphy 0.27.0 and later only support reading and writing data in JSON format. |
| 66 | +- Removed deprecated legacy submodule `polygraphy.util.misc` which was just an alias for `polygraphy.util`. |
| 67 | +
|
| 68 | +
|
6 | 69 | ## v0.31.1 (2021-07-16)
|
7 | 70 | ### Changed
|
8 | 71 | - Improved the quality of several examples and added information on how to load serialized TensorRT engines
|
|
0 commit comments