-
UC San Diego
- USA/Taiwan
- hermandong.com
- @hermanhwdong
- in/hwdong
Highlights
- Pro
Starred repositories
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
This repository lists important papers on automated trailer generation.
Models and datasets for training deep learning automatic mixing models
Flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
Collection of audio-focused loss functions in PyTorch
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
[CVPR'23 Highlight] AutoAD: Movie Description in Context.
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
Community list of startups working with AI in audio and music technology
LaTeX class file for writing dissertations at UC San Diego
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Pytorch implementation of the CREPE pitch tracker
Differentiable audio signal processors in PyTorch
Python launcher of animated MIDI player by @cifkao & @magenta
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)
A simple library for Fréchet Audio Distance (FAD) calculation
ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)
A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.
🎵 Music notation engraving library for MEI with MusicXML and Humdrum support and various toolkits (JavaScript, Python)
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]