NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
-
Updated
Dec 1, 2024 - Python
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
ICASSP 2022: "Text2Video: text-driven talking-head video synthesis with phonetic dictionary".
Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)
Reading list for research topics in Sound AI
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".
This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition".
The repository provides links to collections of influential and interesting research papers from top AI conferences, with open-source code to promote reproducibility and provide detailed implementation insights beyond the scope of the article. Stay up to date with the latest advances in AI research!
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
This repository provides LaTeX templates for academic papers, you can select the appropriate template for your target conference or journal by switching branches. Each branch corresponds to a specific publication venue and follows its official formatting requirements.|本项目提供多种学术论文的 LaTeX 模板,可通过切换分支选择对应的会议或期刊模板。每个分支均针对特定投稿场景设计,并遵循相应的官方排版规范。
[ICASSP 2024] Official implementation of our paper "Contrastive Deep Nonnegative Matrix Factorization for Community Detection"
Face Recognition in real-world images [ICASSP 2017]
This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Multi-Aspect Attention (ICASSP 2023).
[ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs
Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).
SERAB: a multi-lingual benchmark for speech emotion recognition
[ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images
ICASSP 2019 official Latex template
Add a description, image, and links to the icassp topic page so that developers can more easily learn about it.
To associate your repository with the icassp topic, visit your repo's landing page and select "manage topics."