You just have to enter a prompt, and you get a full song...
Honestly, that's not fair for musicians who want to compose their own music and make a living from it.
That said, if you take samples from AI-generated audio, you can use those chunks to create new songs, not far from traditional sampling, actually.
So, about 10 months ago, I decided to build an AI sampler, multitracks, multi-outputs, to generate loops while playing and improvising with my synths.
Just simple 6-second loops. On the fly. Synced with the BPM of my DAW.
It's not real-time, it takes around 20 seconds between hitting generate and the sound loading in the VST.
I added 8 tracks, 4 pages per track, 8 sequences per page with 64 steps each.
Waveform editing. Beat repeat.
During development, I'd play with the instrument for an hour or more, discover new bugs, realize I needed new features, then go back to my laptop to keep coding. It was genuinely fun to develop and play in parallel.
In September 2025, I was accepted to showcase the VST at the AES (Audio Engineering Society) conference in London.
In June 2025, Bedroom Producers Blog wrote an article about it, and the next day, a wave of GitHub stars came in. Along with a recurring piece of feedback: "your project is huge but the installation is a mess."
That's when I realized that a lot of musicians don't have a GPU at home and aren't developers.
So I had an idea: what if I built a GPU provider network where monthly subscription revenue was shared equally between providers, every month?
I opened GitHub Discussions to talk about it. Nobody answered.
So I decided to build a SaaS with monthly subscriptions, using fal.ai for generation. And then some people were almost insulting me for "selling my soul to the devil."
So, I proposed something collaborative, got no response. Then when I proposed a practical solution (not perfect, but I need to pay rent like everyone else), people almost attacked me for it.
Cool.
So here we are. I built the GPU provider network anyway.
Why? Because I hate the idea of being dependent on AWS and the like.
Part of the central server still relies on fal.ai for LLM calls, but I want to find other solutions in the future. Collaborative ones.
Here's how it works:
There's a central server acting as a router. When a user requests a loop, it's sent to the central server, which pings all available GPU providers. The request is forwarded to a random free provider, which generates the audio and returns it to the user.
I spent a lot of time securing the network, random proof-of-work tests, canary testing, code integrity checks. Everything is documented in the GitHub repo.
Subscription revenue is split equally between all active providers. 15% goes to the central server (infrastructure costs), the rest is shared equally. All financial data is public, anonymized, but public. Every prompt hash and audio hash is also public. Why? Trust. All code is open source. Finances too.
Currently, there are 267 free users. 0 paying subscribers.
Yes, that's not a lot. And yes, if new providers join now, they won't make money right away. That's a real risk. I won't pretend otherwise.
So why take that risk?
Because to me it's obvious: own the network. Be part of a group that says f**k off to AWS and every other compute monopoly, and builds something outside their grip.
Yes, I'm probably dreaming. Maybe you think I've been smoking too much. Maybe you project a lot onto someone you've never met, from the comfort of your chair, saying things you'd never say face to face.
That's fine. I'm open to criticism.
But the code is there. The network is there. If you want to be part of it:
https://github.com/innermost47/obsidian-neural-provider
