Skip to content
View ItzJuny's full-sized avatar
😇
😇

Block or report ItzJuny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A list of tools, papers and code related to Fake Audio Detection.

13 Updated Mar 20, 2024

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,436 431 Updated Jun 10, 2024
Python 1,358 182 Updated Feb 11, 2024

Unoffical implementation of Megatts2

Python 250 34 Updated Mar 23, 2024
JavaScript 2,263 764 Updated Jun 21, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,528 748 Updated Feb 11, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,104 4,109 Updated Aug 19, 2024

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 796 125 Updated Jul 18, 2024

[T-IFS] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations

Python 4 Updated Jul 31, 2024

Audio Codec Benchmark

Python 3 Updated Jun 11, 2024

[ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization

Python 8 Updated Aug 7, 2024

The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)

Python 33 3 Updated Jun 1, 2023

Code release for ActionFormer (ECCV 2022)

Python 415 77 Updated Apr 11, 2024

[ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

61 1 Updated Mar 8, 2024

This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.

Python 85 9 Updated Aug 29, 2024

LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, which is widely used in text-to- speech research. The LibriTT…

Rich Text Format 16 1 Updated Jan 24, 2023

The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"

Python 75 4 Updated Jul 12, 2024

The official repository of Dynamic-SUPERB.

Python 143 90 Updated Aug 21, 2024

A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024

10 Updated Jul 27, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,423 3,195 Updated Jul 23, 2024
Python 3 Updated Jul 7, 2024

CLIP-like model evaluation

Jupyter Notebook 567 71 Updated Aug 16, 2024

This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".

Python 27 2 Updated May 16, 2024

Learning audio concepts from natural language supervision

Python 453 35 Updated May 27, 2024
Python 7 Updated Dec 28, 2023

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 133 16 Updated Apr 23, 2024

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 191 11 Updated Jul 25, 2024

Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".

Python 45 2 Updated Jul 22, 2022

Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'

Python 41 5 Updated May 17, 2022

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Python 322 27 Updated Feb 15, 2022
Next