Version 2.23.0

aimetci released this 28 Jan 18:36

f92bf54

Bug fixes and Improvements
- ONNX
  - Disable per-channel quantization for ConvTranspose ops (9395e32)
  - New top level API for configuring parameter quantization type (a1c197d)
- Torch
  - Enable Torch Dynamo ONNX export (59e0125)
- Common
  - Enable per-channel matmul quantization in config files (7137849)
  - LLM quantization recipes in docs (6561f0e)
  - Fix CUDA discrepancies against CPU wheel (01e7422)

Assets 7