Skip to main content

New best story on Hacker News: Show HN: Sonauto – A more controllable AI music creator

Show HN: Sonauto – A more controllable AI music creator
441 by zaptrem | 231 comments on Hacker News.
Hey HN, My cofounder and I trained an AI music generation model and after a month of testing we're launching 1.0 today. Ours is interesting because it's a latent diffusion model instead of a language model, which makes it more controllable: https://sonauto.ai/ Others do music generation by training a Vector Quantized Variational Autoencoder like Descript Audio Codec ( https://ift.tt/lDz3dZt ) to turn music into tokens, then training an LLM on those tokens. Instead, we ripped the tokenization part off and replaced it with a normal variational autoencoder bottleneck (along with some other important changes to enable insane compression ratios). This gave us a nice, normally distributed latent space on which to train a diffusion transformer (like Sora). Our diffusion model is also particularly interesting because it is the first audio diffusion model to generate coherent lyrics! We like diffusion models for music generation because they have some interesting properties that make controlling them easier (so you can make your own music instead of just taking what the machine gives you). For example, we have a rhythm control mode where you can upload your own percussion line or set a BPM. Very soon you'll also be able to generate proper variations of an uploaded or previously generated song (e.g., you could even sing into Voice Memos for a minute and upload that!). @Musicians of HN, try uploading your songs and using Rhythm Control/let us know what you think! Our goal is to enable more of you, not replace you. For example, we turned this drum line ( https://ift.tt/NFmswVX ) into this full song ( https://ift.tt/trmC0pb skip to 1:05 if impatient) or this other song I like better ( https://ift.tt/a9WOqdw - we accidentally compressed it with AAC instead of Opus which hurt quality, though) We also like diffusion models because while they're expensive to train, they're cheap to serve. We built our own efficient inference infrastructure instead of using those expensive inference as a service startups that are all the rage. That's why we're making generations on our site free and unlimited for as long as possible. We'd love to answer your questions. Let us know what you think of our first model! https://sonauto.ai/

Comments

Popular posts from this blog

New best story on Hacker News: Launch HN: Electric Air (YC W23) – Heat pump sold directly to homeowners

Launch HN: Electric Air (YC W23) – Heat pump sold directly to homeowners 571 by cmui | 527 comments on Hacker News. Hi HN! I’m Chris Mui, founder of Electric Air ( https://electricair.io ). We’re building a residential heat pump system. This will be an all-electric replacement for your home’s furnace and air conditioner that enables more centrally ducted installs, manages your indoor air quality, and saves you money on monthly energy bills. We also streamline purchase, finance and install by selling directly to homeowners. You can place a preorder today at https://electricair.io . Heat pumps work by using refrigerant and a compressor to move energy against a temperature gradient. If you put 1 kWh of energy into a heat pump, you get 3-5 kWh of heating in your home. But this isn’t breaking the laws of physics because heat pumps don’t make heat, they move it around. The extra 2-4kWh gets absorbed from the outdoors, even when it is cold outside. The low pressure refrigerant in the outdo...