We are utilizing CMSIS-DSP version 1.14.3 to produce optimized code through the MATLAB workflow, targeting ARM Cortex-A
family devices with CRL. We've enabled the 'ARM_MATH_NEON' flag to leverage the NEON SIMD instruction set.
However, after generating the optimized code using this flag, the results do not align with the simulation outcomes.
Conversely, when we use the default C implementation of the 'arm_biquad_cascade_df2T_f32()' function from CMSIS-DSP,
the results match the simulation. Please let me know if there are any updates regarding this issue.