Skip to content

Latest commit

 

History

History
372 lines (310 loc) · 68.1 KB

README.md

File metadata and controls

372 lines (310 loc) · 68.1 KB

ISMIR-2023-Papers

Awesome Version GitHub repo size License: MIT Contributions welcome GitHub contributors GitHub commit activity (branch) GitHub closed issues GitHub issues GitHub closed pull requests GitHub pull requests GitHub last commit GitHub watchers GitHub forks GitHub Repo stars Visitors


ISMIR 2023 Papers: A complete collection of influential and exciting research papers from the ISMIR 2023 conference. Explore the latest advances in Music information retrieval. Code included. ⭐

ISMIR 2023

Total Papers Preprint Papers Papers with Open Code

👉 * This count includes repositories on GitHub, GitLab, Hugging Face, and distributions on PyPI, while excluding Web Page or GitHub Page links.


The PDF version of the ISMIR 2023 Conference Programme, comprises a list of all accepted full papers, their presentation order, as well as the designated presentation times.


Other collections of the best AI conferences

❗ Conference table will be up to date all the time.

Conference Year
2023
Computer Vision (CV)
CVPR
ICCV  
Speech/Signal Processing (SP/SigProc)
ICASSP
INTERSPEECH  

Contributors



Contributions to improve the completeness of this list are greatly appreciated. If you come across any overlooked papers, please feel free to create pull requests, open issues or contact me via email. Your participation is crucial to making this repository even better.


Title Material
Analysing Physiological Data Collected During Music Listening: An Introduction WEB Page
Introduction to Differentiable Audio Synthesizer Programming WEB Page
Transformer-based Symbolic Music Generation: Fundamentals to Advanced Concepts, Stylistic Considerations, Conditioning Mechanisms and Large Language Models Google SlideGoogle Slide
Computer-Assisted Music-Making Systems: Taxonomy, Review, and Coding
Learning with Music Signals: Technology Meets Education
Kymatio: Deep Learning Meets Wavelet Theory for Music Signal Processing WEB Page

List of sessions

Session 1

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
Exploring the Correspondence of Melodic Contour with Gesture in Raga Alap Singing WEB Page archives.ismir
Pdf
TriAD: Capturing Harmonics with 3D Convolutions GitHub archives.ismir
Data Collection in Music Generation Training Sets: A Critical Analysis GitHub archives.ismir
A Review of Validity and its Relationship to Music Information Research GitHub archives.ismir
arXiv
Segmentation and Analysis of Taniavartanam in Carnatic Music Concerts archives.ismir
Transfer Learning and Bias Correction with Pre-Trained Audio Embeddings GitHub archives.ismir
arXiv
Collaborative Song Dataset (CoSoD): An Annotated Dataset of Multi-Artist Collaborations in Popular Music GitHub archives.ismir
arXiv
Human-AI Music Creation: Understanding the Perceptions and Experiences of Music Creators for Ethical and Productive Collaboration GitHub archives.ismir
Impact of Time and Note Duration Tokenizations on Deep Learning Symbolic Music Modeling GitHub archives.ismir
arXiv
Musical Micro-Timing for Live Coding GitHub archives.ismir
A Few-Shot Neural Approach for Layout Analysis of Music Score Image GitHub archives.ismir
TapTamDrum: A Dataset for Dualized Drum Patterns GitHub Page
GitHub
archives.ismir
Real-Time Percussive Technique Recognition and Embedding Learning for the Acoustic Guitar GitHub archives.ismir
arXiv
IteraTTA: An Interface for Exploring both Text Prompts and Audio Priors in Generating Music with Text-to-Audio Models Demo archives.ismir
arXiv
Similarity Evaluation of Violin Directivity Patterns for Musical Instrument Retrieval archives.ismir
Polyrhythmic Modelling of Non-Isochronous and Microtiming Patterns archives.ismir

Session 2

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
CLaMP: Contrastive Language-Music Pre-Training for Cross-Modal Symbolic Music Information Retrieval GitHub Page
GitHub
archives.ismir
arXiv
Gender-Coded Sound: Analysing the Gendering of Music in Toy Commercials via Multi-Task Learning GitHub archives.ismir
ResearchGate
A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability archives.ismir
Efficient Notation Assembly in Optical Music Recognition archives.ismir
Pdf
White Box Search over Audio Synthesizer Parameters archives.ismir
Decoding Drums, Instrumentals, Vocals, and Mixed Sources in Music using Human Brain Activity with fMRI archives.ismir
Dual Attention-based Multi-Scale Feature Fusion Approach for Dynamic Music Emotion Recognition GitHub Page archives.ismir
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer GitHub archives.ismir
arXiv
High-Resolution Violin Transcription using Weak Labels GitHub archives.ismir
Pdf
Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls GitHub Page
GitHub
archives.ismir
arXiv
The Coordinated Corpus of Popular Musics (CoCoPops): A Meta-Dataset of Melodic and Harmonic Transcriptions GitHub archives.ismir
Towards Computational Music Analysis for Music Therapy archives.ismir
Timbre Transfer using Image-to-Image Denoising Diffusion Implicit Models GitHub Page
GitHub
archives.ismir
arXiv
Correlation of EEG Responses Reflects Structural Similarity of Choruses in Popular Music archives.ismir
Chromatic Chords in Theory and Practice archives.ismir

Session 3

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
BPS-Motif: A Dataset for Repeated Pattern Discovery of Polyphonic Symbolic Music GitHub archives.ismir
Weakly Supervised Multi-Pitch Estimation using Cross-Version Alignment GitHub archives.ismir
The Batik-Plays-Mozart Corpus: Linking Performance to Score to Musicological Annotations GitHub archives.ismir
arXiv
Mono-to-Stereo through Parametric Stereo Generation archives.ismir
arXiv
From West to East: Who Can Understand the Music of the Others Better? archives.ismir
ResearchGate
On the Performance of Optical Music Recognition in the Absence of Specific Training Data archives.ismir
Composer's Assistant: An Interactive Transformer for Multi-Track MIDI Infilling GitHub archives.ismir
arXiv
The FAV Corpus: An Audio Dataset of Favorite Pieces and Excerpts, with Formal Analyses and Music Theory Descriptors WEB Page archives.ismir
LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT GitHub archives.ismir
arXiv
Sounds Out of Pläce? Score Independent Detection of Conspicuous Mistake Regions in MIDI Piano Performances WEB Page archives.ismir
Pdf
VampNet: Music Generation via Masked Acoustic Token Modeling WEB Page
GitHub
archives.ismir
arXiv
Expert and Novice Evaluations of Piano Performances: Criteria for Computer-Aided Feedback archives.ismir
Contrastive Learning for Cross-Modal Artist Retrieval archives.ismir
arXiv
Repetition-Structure Inference with Formal Prototypes GitHub archives.ismir
Algorithmic Harmonization of Tonal Melodies using Weighted Pitch Context Vectors GitHub archives.ismir
Text-to-Lyrics Generation with Image-based Semantics and Reduced Risk of Plagiarism GitHub archives.ismir

Session 4

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
LP-MusicCaps: LLM-based Pseudo Music Captioning GitHub archives.ismir
arXiv
A Repetition-based Triplet Mining Approach for Music Segmentation archives.ismir
HAL Science
Predicting Music Hierarchies with a Graph-based Neural Decoder GitHub archives.ismir
arXiv
Stabilizing Training with Soft Dynamic Time Warping: A Case Study for Pitch Class Estimation with Weakly Aligned Targets GitHub archives.ismir
arXiv
Finding Tori: Self-Supervised Learning for Analyzing Korean Folk Song GitHub Page
GitHub
archives.ismir
arXiv
Singer Identity Representation Learning using Self-Supervised Techniques GitHub archives.ismir
HAL Science
On the Effectiveness of Speech Self-Supervised Learning for Music archives.ismir
arXiv
Transformer-based Beat Tracking with Low-Resolution Encoder and High-Resolution Decoder archives.ismir
Adding Descriptors to Melodies Improves Pattern Matching: A Study on Slovenian Folk Songs archives.ismir
How Control and Transparency for Users Could Improve Artist Fairness in Music Recommender Systems archives.ismir
Pdf
Towards a New Interface for Music Listening: A User Experience Study on YouTube archives.ismir
arXiv
FiloBass: A Dataset and Corpus based Study of Jazz Basslines GitHub Page archives.ismir
Comparing Texture in Piano Scores archives.ismir
Introducing DiMCAT for Processing and Analyzing Notated Music on a Very Large Scale GitHub archives.ismir
Sequence-to-Sequence Network Training Methods for Automatic Guitar Transcription with Tokenized Outputs GitHub archives.ismir

Session 5

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
PESTO: Pitch Estimation with Self-Supervised Transposition-Equivariant Objective GitHub archives.ismir
arXiv
The Games we Play: Exploring the Impact of ISMIR on Musicology archives.ismir
Carnatic Singing Voice Separation using Cold Diffusion on Training Data with Bleeding GitHub archives.ismir
Pdf
Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights from a User Survey GitHub archives.ismir
Towards Building a Phylogeny of Gregorian Chant Melodies archives.ismir
Audio Embeddings as Teachers for Music Classification GitHub archives.ismir
arXiv
ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control Github archives.ismir
Roman Numeral Analysis with Graph Neural Networks: Onset-Wise Predictions from Note-Wise Features GitHub archives.ismir
arXiv
Semi-Automated Music Catalog Curation using Audio and Metadata archives.ismir
Crowd's Performance on Temporal Activity Detection of Musical Instruments in Polyphonic Music archives.ismir
MoisesDB: A Dataset for Source Separation Beyond 4 Stems GitHub archives.ismir
arXiv
Music as Flow: A Formal Representation of Hierarchical Processes in Music archives.ismir
Online Symbolic Music Alignment with Offline Reinforcement Learning GitHub archives.ismir
InverSynthII: Sound Matching via Self-Supervised Synthesizer-Proxy and Inference-Time Finetuning GitHub archives.ismir
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task GitHub archives.ismir
Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction GitHub archives.ismir
arXiv

Session 6

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
Singing Voice Synthesis using Differentiable LPC and Glottalflow Inspired Wavetables GitHub Page
GitHub
archives.ismir
arXiv
Harmonic Analysis with Neural Semi-CRF GitHub archives.ismir
A Dataset and Baseline for Automated Assessment of Timbre Quality in Trumpet Sound GitHub
Dataset
archives.ismir
Visual Overviews for Sheet Music Structure archives.ismir
arXiv
Passage Summarization with Recurrent Models for Audio – Sheet Music Retrieval archives.ismir
arXiv
Predicting Performance Difficulty from Piano Sheet Music Images GitHub archives.ismir
arXiv
Self-Refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data archives.ismir
arXiv
Quantifying the Ease of Playing Song Chords on the Guitar archives.ismir
FlexDTW: Dynamic Time Warping with Flexible Boundary Conditions archives.ismir
Modeling Bends in Popular Music Guitar Tablatures archives.ismir
arXiv
Self-Similarity-based and Novelty-based Loss for Music Structure Analysis GitHub archives.ismir
arXiv
Modeling Harmonic Similarity for Jazz using Cooccurrence Vectors and the Membrane Area archives.ismir
Pdf
SingStyle111: A Multilingual Singing Dataset with Style Transfer GitHub Page archives.ismir
A Computational Evaluation Framework for Singable Lyric Translation archives.ismir
arXiv
Chorus-Playlist: Exploring the Impact of Listening to Only Choruses in a Playlist GitHub archives.ismir

Session 7

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
Supporting Musicological Investigations with Information Retrieval Tools: An Iterative Approach to Data Collection archives.ismir
Pdf
Optimizing Feature Extraction for Symbolic Music WEB Page
GitHub
archives.ismir
arXiv
Exploring Sampling Techniques for Generating Melodies with a Transformer Language Model archives.ismir
arXiv
Measuring the Eurovision Song Contest: A Living Dataset for Real-World MIR Github archives.ismir
Efficient Supervised Training of Audio Transformers for Music Representation Learning GitHub archives.ismir
arXiv
A Cross-Version Approach to Audio Representation Learning for Orchestral Music GitHub archives.ismir
Music Source Separation with MLP Mixing of Time, Frequency, and Channel archives.ismir
Symbolic Music Representations for Classification Tasks: A Systematic Evaluation GitHub archives.ismir
arXiv
The Music Meta Ontology: A Flexible Semantic Model for the Interoperability of Music Metadata GitHub archives.ismir
arXiv
Polar Manhattan Displacement: Measuring Tonal Distances between Chords based on Intervallic Content archives.ismir
Pdf

Late-Breaking Demo

Session Papers Preprint Papers Papers with Open Code

Title Repo Paper
Virtuoso Strings: A Dataset of String Ensemble Recordings and Onset Annotations for Timing Analysis GitHub Page
GitHub
archives.ismir
Music Scope Pad archives.ismir
The Chordinator: Chord Progression Modeling and Generation using Transformers archives.ismir
Cosine Contours: A Case Study with Melodies from Irish Traditional Dance Music archives.ismir
AutoOsu: Audio-Aware Action Generation for Rhythm Games GitHub Page
GitHub
archives.ismir
Can MusicGen Create Training Data for MIR Tasks? archives.ismir
Automatic Production of Acoustic Piano Transcription Data GitHub archives.ismir
MemoVision: A Tool for Feature Selection and Visualization of Performance Data GitHub archives.ismir
The Hi-Audio Online Platform for Distributed Music Crowdsourcing Database Collection archives.ismir
On the use of Synthesized Datasets and Transformer Adaptors for Musical Instrument Recognition archives.ismir
What Can Go Wrong When Conducting Beat Tracking Experiments GitHub archives.ismir
Bridging Audio and Symbolic Piano Data through a Web-based Annotation Interface Demo archives.ismir
Towards Differentiable Piano Synthesis based on Physical Modeling WEB Page archives.ismir
DJ StructFreak: Automatic DJ System Built with Music Structure Embeddings archives.ismir
Generating Folk-Like Music in Abc-Notation with Masked Language Models GitHub archives.ismir
AQUATK: An Audio Quality Assessment Toolkit GitHub archives.ismir
Encoding Performance Data in MEI with the Automatic Music Performance Analysis and Comparison Toolkit (AMPACT) archives.ismir
Retrieval Augmented Generation of Symbolic Music with LLMs GitHub archives.ismir
The Biased Journey of MSD_AUDIO.ZIP archives.ismir
arXiv
Fmak: A Dataset of Key and Mode Annotations for the Free Music Archive – Extended Abstract archives.ismir
Orchestral Texture Classification with Convolution GitHub archives.ismir
Beat-Aligned Spectrogram-to-Sequence Generation of Rhythm-Game Charts GitHub archives.ismir
Total Variation in Popular Rap Vocals from 2009-2023: Extension of the Analysis by Georgieva, Ripollés & Mcfee GitHub Page archives.ismir
Towards Automated Estimation of Values from Song Lyrics: A Data Collection Protocol archives.ismir
Track Role Prediction of Single-Instrumental Sequences archives.ismir
Demo of a Smart Musical Instrument-based Real Time Pattern Detection System archives.ismir
JamALT: A Formatting-Aware Lyrics Transcription Benchmark archives.ismir
STraDa: A Singer Traits Dataset archives.ismir
Interpretable Modular Representation Learning for Full-Band Accompaniment Arrangement archives.ismir
Improving Embeddings in Harmony Transformer archives.ismir
Singable and Controllable Neural Lyric Translation: A Late-Breaking Showcase archives.ismir
A Noise Augmentation Pipeline for Realistic Query-by-Example Simulation GitHub archives.ismir
LARS: An Open-Source VST3 Plugin for Deep Drums Demixing with Pretrained Models GitHub archives.ismir
Music Visualization using Chironomie archives.ismir
Automatic Transcription of Multi-Instrumental Songs: Integrating Demixing, Harmonic Dilated Convolution, and Joint Beat Tracking archives.ismir
Mel-Band RoFormer for Music Source Separation archives.ismir
arXiv
THE VocalNotes DATASET archives.ismir
From Taps to Drums: Audio Percussion Style Transfer archives.ismir
JAMMIN-GPT: Text-based Improvisation using Llms in Ableton Live archives.ismir
Visual Guitar Tab Comparison GitHub archives.ismir
Optimizing the Mridangam Stroke Transcription Pipeline: Addressing Key Challenges archives.ismir

Star History

Star History Chart