Hi,
i head the demo voice in 3.00kbps, and it appears that the ESC result isn't as satisfactory as the DAC result. Could you provide a fair comparison when the parameters are similar?
For example, without reducing the model size by a factor of nine, could we compare the results using the same model size?