Q: KokoroWavSynthesizer - is it possible to know the boundaries of generated speech?

I am building a system where user can interrupt TTS. The system will need to know what was and what was not spoken. Right now I am using `KokoroWavSynthesizer.Synthesize` and its `OnProgress` callback. It, however, does not provide any information besides the generated audio. Does Kokoro provide this information at a lower level? Can KokoroSharp expose it?