Visualyre: multimodal album art generation for independent musicians

Pers. Ubiquitous Comput.(2023)

引用 0|浏览4
暂无评分
摘要
Album art often reflects the trends and themes of the songs in a given collection, and even the identities of the musicians who produced it. It therefore plays a central role in fomenting a potential listener’s first impression of the work. As such, musicians strive to find suitable images for this purpose, and those with limited financial resources or design skills may struggle to do so. Here, we report the development of Visualyre, a deep learning–based application that generates album art images from users’ song lyrics and audio files. This tool relies on generative adversarial network models to generate images from textual input (lyrics) and style transfer models to adjust the image according to the mood of the audio. We then report the results of a user study involving 35 amateur and independent musicians who tested the system. Results suggest that Visualyre was generally well received and largely effective in its intended purpose: providing musicians with a resource for generating their own album art.
更多
查看译文
关键词
Deep learning,Image generation,Style transfer,Lyrics,Mood detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要