Skip to content

Releases: uezo/aiavatarkit

v0.8.8

11 Feb 19:08
6d7e6ad

Choose a tag to compare

What's Changed

  • Add SileroStreamSpeechDetector with real-time speech segment recognition #254
  • Invoke on_speech_detected on max_duration #255
  • Add WebSocket-based streaming STT adapter with VAD support #256
  • Allow multiple callbacks for VAD events #257
  • Add 'voiced' VAD event #258
  • Add STT WebSocket example (client + server) #259
  • Update for v0.8.8🐙 #260

Full Changelog: v0.8.7...v0.8.8

v0.8.7

08 Feb 17:06
eeb5c65

Choose a tag to compare

What's Changed

  • Compile LANGUAGE_PATTERN for language parsing #247
  • Add local gateway mode for SpeechGatewaySpeechSynthesizer #248
  • Skip Kana conversion for non-Japanese text #249
  • Add add_local_gateway method for easier TTS gateway setup #250
  • Improve Admin Panel authentication and proxy support #251
  • Unify default SQLite database filename to aiavatar.db #252
  • Update for v0.8.7 #253

Full Changelog: v0.8.6...v0.8.7

v0.8.6

07 Feb 06:19
b76ac49

Choose a tag to compare

What's Changed

  • Add admin panel for monitoring, control, and evaluation #230
  • Add user create and delete functionality to admin panel #231
  • Add AdminPanel for dynamic adapter registration #232
  • Fix preroll buffer size calculation for accurate audio capture #233
  • Add final chunk to the response from control/perform API #234
  • Add pcm_to_wave conversion method #235
  • Include dynamic config fields for Admin Panel #236
  • Add error tracking to performance records and Admin Panel #237
  • Add Processing phase metrics and detailed tool call logging #238
  • Display tool call status in websocket UI #239
  • Fix SQLite metrics query for short time periods #240
  • Add configurable dummy components for STT/LLM/TTS #241
  • Add validate_request hook for filtering requests #242
  • Add early text validation for AzureStreamSpeechDetector #243
  • Add per-session speech speed control for personalized user experience #244
  • Fix voice_text returning None when wakeword is not matched #245
  • Update for v0.8.6 #246

Full Changelog: v0.8.5...v0.8.6

v0.8.5

31 Jan 07:27
8db12e3

Choose a tag to compare

What's Changed

  • Add centralized PostgreSQL pool provider and integration #217
  • Refactor CharacterService and add SQLite support #218
  • Add user management support to character module #219
  • Add callback hooks and local server adapter #220
  • Refactor local client to use AIAvatarLocalServer #221
  • Add bind_character for simplified character integration #222
  • Fix response stopping logic based on awake state #223
  • Fix empty system prompt being added to messages #224
  • Add on_error handler for LLM API error responses #225
  • Add Azure OpenAI client support in CharacterService #226
  • Log speech synthesis only in debug mode #227
  • Add customizable chat print hook to LLM base class #228
  • Update for v0.8.5 #229

Full Changelog: v0.8.4...v0.8.5

v0.8.4

17 Jan 18:46
87fd0e5

Choose a tag to compare

What's Changed

  • Add follow_redirects option to SpeechSynthesizer #210
  • Add MotionPNGTuber example #211
  • Add kana_map caching to AlphabetToKanaPreprocessor #212
  • Add GetDiaryTool for retrieving diary and schedule #213
  • Add batch activity range generation to service #214
  • Add block_barge_in to prevent interruption during critical responses #215
  • Update for v0.8.4 #216

Full Changelog: v0.8.3...v0.8.4

v0.8.3

04 Jan 18:55
7d4314c

Choose a tag to compare

What's Changed

  • Refactor system prompt handling to support async functions #203
  • Fix tool call result handling in MCPBase #204
  • Add async lock to Postgres pool initialization #205
  • Add character management feature #206
  • Add MemorySearchTool and update memory client usage #207
  • Add AudioConverter and async response parsing to TTS #208
  • Update for v0.8.3 #209

Full Changelog: v0.8.2...v0.8.3

v0.8.2

01 Jan 14:43
31909fb

Choose a tag to compare

What's Changed

  • Add invoke queue support for sequential request handling #201
  • Update for v0.8.2🗻🦅🍆 #202

Full Changelog: v0.8.1...v0.8.2

v0.8.1

31 Dec 17:46
05d2ce3

Choose a tag to compare

What's Changed

  • Remove Silero VAD dependency from AzureStreamSpeechDetector #194
  • Add allow_merge flag to request handling pipeline #195
  • Add camera integration and UI enhancements to WebSocket demo #196
  • Remove pgvector dependency and handle vector conversion manually #197
  • Add built-in language tag parsing in pipeline #198
  • Improve control tag handling in response splitting #199
  • 🥳 New year update for v0.8.1 🎍 #200

Full Changelog: v0.8.0...v0.8.1

v0.8.0

28 Dec 17:03
a71717a

Choose a tag to compare

What's Changed

  • Add README for WebSocket example #183
  • Stop AI speech immediately when user starts speaking🎄🎁 #185
  • Add AzureStreamSpeechDetector with STT #186
  • Refactor SileroSpeechDetector and improve tests #187
  • Refactor PostgreSQL adapters to use asyncpg #188
  • Notify client on request acceptance in pipeline #189
  • Add on_speech_detecting callback to AzureStreamSpeechDetector #190
  • Refactor microphone mute logic and response handling #191
  • Set SileroSpeechDetector as the default VAD #192
  • Update for v0.8.0 #193

Full Changelog: v0.7.21...v0.8.0

v0.7.21

21 Dec 10:17
3bdf11e

Choose a tag to compare

What's Changed

  • Add timestamp insertion feature #176
  • Add LINE Bot adapter with session management #177
  • Fix timestamp_inserted_at handling in session state managers #178
  • Refactor test assertions and update test messages #179
  • Add timestamp options to server adapters #180
  • Add LipSync and Blink to WebSocket example #181
  • Update for v0.7.21 #182

Full Changelog: v0.7.19...v0.7.21