seungheondoh

🌊

Ph.D Journey

SeungHeon Doh seungheondoh

🌊

Ph.D Journey

Music Informational Retrieval, Multimodal, Multimedia

224 followers · 181 following

Music and Audio Computing Lab
Daejeon, South Korea
https://seungheondoh.github.io/

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Stars

yonghyunk1m / TowardsRobustTranscription

Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation

Jupyter Notebook 4 Updated Oct 22, 2024

sanderwood / clamp2

CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models

Python 40 1 Updated Oct 21, 2024

gzhu06 / Cacophony

Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986

Python 34 4 Updated Oct 13, 2024

seungheondoh / music-text-representation-pp

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR ) [ICASSP24]

Python 28 1 Updated Oct 7, 2024

Hayeonbang / PIAST

A piano music dataset with Audio, Symbolic and Text labels

8 Updated Sep 26, 2024

Jyonn / VQ4Rec

Awesome Papers Using Vector Quantization for Recommender Systems (VQ4Rec)

15 1 Updated May 6, 2024

pnlong / PDMX

PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing

Python 33 2 Updated Oct 30, 2024

kyutai-labs / moshi

Python 6,581 500 Updated Oct 31, 2024

LTH14 / mar

PyTorch implementation of MAR DiffLoss https://arxiv.org/abs/2406.11838

Python 959 50 Updated Sep 27, 2024

mulab-mir / muchomusic

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 22 1 Updated Aug 9, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 12,173 1,020 Updated Oct 30, 2024

noteflakes / awesome-music

Awesome Music Projects

1,864 109 Updated Oct 6, 2024

yamathcy / ISMIR-2024-Papers

31 1 Updated Oct 25, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,568 1,616 Updated Aug 1, 2024

ZacharyNovack / Lead-AE

Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression

Python 15 1 Updated Oct 23, 2023

gnobitab / InstaFlow

⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Python 1,176 37 Updated Jun 7, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 14,524 2,120 Updated Oct 31, 2024

GFNOrg / GFlowNet-EM

Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.

Jupyter Notebook 38 2 Updated Feb 9, 2024

llms-heart-mir / tutorial

TeX 37 Updated Jun 16, 2024

HilaManor / AudioEditingCode

Python 137 22 Updated Oct 13, 2024

slSeanWU / beats-conformer-bart-audio-captioner

PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation"

Jupyter Notebook 30 1 Updated Jan 6, 2024

gudgud96 / MR-MT3

MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage

Python 36 2 Updated Jul 12, 2024

seungheondoh / musical-word-embedding

Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]

Jupyter Notebook 21 Updated Apr 23, 2024

POZAlabs / MID-FiLD_code

[AAAI'24] Official dataset & demo code for MID-FiLD: MIDI Dataset for Fine-Level Dynamics

12 Updated Mar 31, 2024

facebookresearch / clevr-dataset-gen

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Python 586 206 Updated Aug 30, 2021

sakemin / cog-musicgen-fine-tuner

Forked from replicate/cog-musicgen

This is a cog implementation of the fine-tuner for Meta's MusicGen

Python 47 10 Updated Apr 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SeungHeon Doh seungheondoh

Achievements

Achievements

Highlights

Organizations

Block or report seungheondoh

Stars

yonghyunk1m / TowardsRobustTranscription

sanderwood / clamp2

gzhu06 / Cacophony

seungheondoh / music-text-representation-pp

Hayeonbang / PIAST

Jyonn / VQ4Rec

pnlong / PDMX

kyutai-labs / moshi

LTH14 / mar

mulab-mir / muchomusic

SYSTRAN / faster-whisper

noteflakes / awesome-music

yamathcy / ISMIR-2024-Papers

karpathy / LLM101n

ZacharyNovack / Lead-AE

gnobitab / InstaFlow

meta-llama / llama-recipes

GFNOrg / GFlowNet-EM

llms-heart-mir / tutorial

HilaManor / AudioEditingCode

slSeanWU / beats-conformer-bart-audio-captioner

gudgud96 / MR-MT3

seungheondoh / musical-word-embedding

POZAlabs / MID-FiLD_code

facebookresearch / clevr-dataset-gen

sakemin / cog-musicgen-fine-tuner

stas00 / ml-engineering

marlin-codes / Awesome-Hyperbolic-Representation-and-Deep-Learning

seungheondoh / lp-music-caps

minju0821 / musical_instrument_retrieval