I am building a system where user can interrupt TTS. The system will need to know what was and what was not spoken. Right now I am using KokoroWavSynthesizer.Synthesize and its OnProgress callback. It, however, does not provide any information besides the generated audio. Does Kokoro provide this information at a lower level? Can KokoroSharp expose it?