KVR Audio

guitarzan · Post by **guitarzan** » Sun Mar 01, 2026 12:21 am

I’ve been using regular chat sessions to sculpt the prompts lately because, in the case of Gemini/Lyria anyway, the collaborative crosstalk pollutes the generative music output in a Create Music session.

The music gen modeler (Lyria in this case) apparently polls the chat in a Create Music session even before the command to generate is issued.

Gemini didn’t even know that until our session where it described our situation as analogous to pulling on a specific thread that causes the whole sweater to unravel, then one of the subsequent generations produced a Pat Boone style crooner singing about his sweater unraveling (though we were constantly prompting for searing electric guitar bop instrumental music, etc.)

So the suggestion I made is that there needs to be a Collaboration Mode that disconnects the chat from Lyria altogether. Until that happens I switch to a regular Chat session to collaborate and then Gemini provides its rendition of our prompt in a block of plain text that I manually copy and paste into a fresh Create Music session.

So normally you basically have one Chat LLM (Gemini in this case) who translates the user’s prompt and passes it on to another LLM that runs the algorithmic composer (Lyria in this case), the generated file is passed to the user, but neither the Chat LLM nor the generative LLM have any real feedback on the generated file except via the human user’s subsequent prompts. I can’t believe it even works at all. The composition generative model is a black box, it produces no metadata concerning the actual composition of the music at all.

Michael L · Post by **Michael L** » Sun Mar 01, 2026 2:02 am

"Thought is the enemy of flow"
Vinnie Colaiuta

Michael L · Post by **Michael L** » Sun Mar 01, 2026 6:29 am

BONES wrote: Sat Feb 28, 2026 11:42 am From Craig:

Thanks, Craig!
I have a much better idea of what the workflow looks & feels like.

Tiles · Post by **Tiles** » Sun Mar 01, 2026 7:07 am

BONES wrote: Sat Feb 28, 2026 10:40 pm ...
Another good general tip when using AI is once you get something you're happy with, ask the AI what prompt would have led to that solution in the first place.

Indeed. A very good tip. In fact i go sometimes the other way around like you describe here. There are LLM's that can analyze Images. So i first search for example for an image that fits the style i want to achieve, and then let this LLM analyze the image. I think ChatGPT can do this natively with songs too.

jancivil · Post by **jancivil** » Sun Mar 08, 2026 7:23 pm

Michael L wrote: Sun Mar 01, 2026 2:02 am "Thought is the enemy of flow"
Vinnie Colaiuta

"You can't think and play at the same time." - Sonny Rollins.

How to get AI to think outside of your box?