An End to End Model for Automatic Music Generation: Combining Deep Raw and Symbolic Audio Networks

Jean-Pierre Briot, Aluna Carina da Silva Chear,Rachel Manzelli,Vijay Thakkar,Ali Siahkamari, Áudio Bruto, Modelos Simbólicos, Ruído E Desestruturado, Modelos de Áudio Bruto

semanticscholar(2018)

引用 4|浏览2
暂无评分
摘要
We develop an approach to combine two types of music generation models, namely symbolic and raw audio models. While symbolic models typically operate at the note level and are able to capture long-term dependencies, they lack the expressive richness and nuance of performed music. Raw audio models train directly on raw audio waveforms, and can be used to produce expressive music; however, these models typically lack structure and long-term dependencies. We describe a work-in-progress model that trains a raw audio model based on the recently-proposed WaveNet architecture, but that incorporates the notes of the composition as a secondary input to the network. When generating novel compositions, we utilize an LSTM network whose output feeds into the raw audio model, thus yielding an end-to-end model that generates raw audio outputs combining the best of both worlds. We describe initial results of our approach, which we believe to show considerable promise for structured music generation.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要