ItzJuny

Follow

😇

JunyanWu ItzJuny

😇

Follow

I'll go with my rhythm.

5 followers · 18 following

09:35 (UTC 08:00)
https://orcid.org/0000-0003-2692-5928

Lists (1)

Sort

🚀 My stack

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

john852517791 / awesome-fake-audio-detection

A list of tools, papers and code related to Fake Audio Detection.

13 Updated Mar 20, 2024

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,436 431 Updated Jun 10, 2024

microsoft / NeuralSpeech

Python 1,358 182 Updated Feb 11, 2024

LSimon95 / megatts2

Unoffical implementation of Megatts2

Python 250 34 Updated Mar 23, 2024

nerfies / nerfies.github.io

JavaScript 2,263 764 Updated Jun 21, 2024

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,528 748 Updated Feb 11, 2024

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,104 4,109 Updated Aug 19, 2024

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 796 125 Updated Jul 18, 2024

ItzJuny / AMSDF

[T-IFS] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations

Python 4 Updated Jul 31, 2024

roger-tseng / AudioDecBenchmark

Forked from hbwu-ntu/AudioDecBenchmark

Audio Codec Benchmark

Python 3 Updated Jun 11, 2024

ItzJuny / CFPRF

[ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization

Python 8 Updated Aug 7, 2024

RenHuan1999 / CVPR2023_P-MIL

The official implementation of 'Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization' (CVPR 2023)

Python 33 3 Updated Jun 1, 2023

happyharrycn / actionformer_release

Code release for ActionFormer (ECCV 2022)

Python 415 77 Updated Apr 11, 2024

ControlNet / AV-Deepfake1M

[ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

61 1 Updated Mar 8, 2024

csun22 / Synthetic-Voice-Detection-Vocoder-Artifacts

This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifacts accepted in CVPR Workshop on Media Forensic 2023.

Python 85 9 Updated Aug 29, 2024

csun22 / LibriVoc-Dataset

LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, which is widely used in text-to- speech research. The LibriTT…

Rich Text Format 16 1 Updated Jan 24, 2023

deepglint / RWKV-CLIP

The official code of "RWKV-CLIP: A Robust Vision-Language Representation Learner"

Python 75 4 Updated Jul 12, 2024

dynamic-superb / dynamic-superb

The official repository of Dynamic-SUPERB.

Python 143 90 Updated Aug 21, 2024

roger-tseng / CodecFake

A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024

10 Updated Jul 27, 2024

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,423 3,195 Updated Jul 23, 2024

xqwang14 / SMS-Loss

Python 3 Updated Jul 7, 2024

LAION-AI / CLIP_benchmark

CLIP-like model evaluation

Jupyter Notebook 567 71 Updated Aug 16, 2024

xieyuankun / Codecfake

This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".

Python 27 2 Updated May 16, 2024

microsoft / CLAP

Learning audio concepts from natural language supervision

Python 453 35 Updated May 27, 2024

Ming-er / Audio-Free-P-Tuning

Python 7 Updated Dec 28, 2023

cdjkim / audiocaps

🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps

Python 133 16 Updated Apr 23, 2024

XinhaoMei / WavCaps

This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.

Python 191 11 Updated Jul 25, 2024

akoepke / audio-retrieval-benchmark

Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".

Python 45 2 Updated Jul 22, 2022

XinhaoMei / audio-text_retrieval

Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'

Python 41 5 Updated May 17, 2022

descriptinc / lyrebird-wav2clip

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Python 322 27 Updated Feb 15, 2022