Skip to content

Version 2.23.0

Choose a tag to compare

@aimetci aimetci released this 28 Jan 18:36
  • Bug fixes and Improvements
    • ONNX

      • Disable per-channel quantization for ConvTranspose ops (9395e32)
      • New top level API for configuring parameter quantization type (a1c197d)
    • Torch

      • Enable Torch Dynamo ONNX export (59e0125)
    • Common

      • Enable per-channel matmul quantization in config files (7137849)
      • LLM quantization recipes in docs (6561f0e)
      • Fix CUDA discrepancies against CPU wheel (01e7422)