Lists (1)
Sort Name ascending (A-Z)
Stars
A list of tools, papers and code related to Fake Audio Detection.
Muzic: Music Understanding and Generation with Artificial Intelligence
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
🔊 Text-Prompted Generative Audio Model
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
[T-IFS] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations
Audio Codec Benchmark
[ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization
The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)
Code release for ActionFormer (ECCV 2022)
[ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.
LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, which is widely used in text-to- speech research. The LibriTT…
The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"
The official repository of Dynamic-SUPERB.
A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
Learning audio concepts from natural language supervision
🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".
Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP