Fire your singers folks, Vocaloid 5 is here!
- KVRAF
- 3338 posts since 6 Aug, 2009
then the vocaloid-&-string machine song i made for you may not make you happy....jancivil wrote:I hate string machines.
-
- KVRist
- 283 posts since 6 Aug, 2017
I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
- KVRAF
- 6980 posts since 28 Dec, 2015 from Atlantis Island
How much is this in Camels?Aloysius wrote:They seem to have a bartering system in place:
Vocaloid5 Standard, Upgrade Edition 15,000 hen.
Vocaloid5 Premium, Upgrade Edition 24,000 hen.
That's a lot of hens.
Alternatively Bavarian Cows?
https://sonograyn.bandcamp.com/music Experimental Ambient
https://martinjuenke.bandcamp.com/music Alternative Instrumental
https://martinjuenke.bandcamp.com/music Alternative Instrumental
- KVRAF
- 21196 posts since 8 Oct, 2014
A lot. The nuances of the human voice are...well, let's just say it is the most complex instrument on the planet. If it were as simple as you say, somebody would have done it by now. But you see, it's not just a matter of making the words and notes. There are inflections. Numerous inflections. There is breath singing, falsetto singing, growling, light vibrato, strong vibrato, accent...DrMEM wrote:I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
Do I really need to go on?
- KVRAF
- 4432 posts since 15 Nov, 2006 from Hell
you're missing Realitone librariesDrMEM wrote:I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
I don't know what to write here that won't be censored, as I can only speak in profanity.
- KVRAF
- 21196 posts since 8 Oct, 2014
I have them but they're very limited. They're great for background vocals but that's about it. No way to do lead vocals with them unless it's for a very simple song that has the words built into the program. And the dictionary is sparse, to say the least.Burillo wrote:you're missing Realitone librariesDrMEM wrote:I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
-
- KVRian
- 535 posts since 10 Apr, 2011
Just in case any marketing guys from Vocaloid were reading.
I would get it, but, I would like to try a demo first!
Last but not least, I would like to watch a video-tutorial where it's explained how to work with it inside a DAW, Cubase preferably.
Please, do it.
Thank you.
I would get it, but, I would like to try a demo first!
Last but not least, I would like to watch a video-tutorial where it's explained how to work with it inside a DAW, Cubase preferably.
Please, do it.
Thank you.
-
- KVRist
- 283 posts since 6 Aug, 2017
Script the legato and vibrato. You can do 4x rr for any articulation you can think of at a cost of 8448 samples (or fewer for a smaller singing range). That's what makes it feasible. It'd still be a ton of work, but it's not prohibitive.wagtunes wrote:A lot. The nuances of the human voice are...well, let's just say it is the most complex instrument on the planet. If it were as simple as you say, somebody would have done it by now. But you see, it's not just a matter of making the words and notes. There are inflections. Numerous inflections. There is breath singing, falsetto singing, growling, light vibrato, strong vibrato, accent...DrMEM wrote:I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
Do I really need to go on?
- Banned
- 1583 posts since 19 Aug, 2011
The term "string machine" is by no means meant to mean Vienna-type strings and the like.wagtunes wrote:The snobbery in this place is laughable, not to mention the hypocrisy.
The other day, one of our moderators (Kevvvv) was looking for a string machine. And all these suggestions came up, including what I think is one of the best ones out today, Arturia's Solina V.
Even though these things sound NOTHING like real strings. In fact, it's laughable to even consider these as a replacement for the real thing. But we use them. And we're not only content with them but we LOVE THEM.
And then, we have all our sample libraries that we use in place of the REAL thing. We love them too, even though some of them aren't all that great and even the best ones fall way short of the real thing in terms of sound and expressiveness.
And then, even with vocals, we use vocoders and other vocal FX processors that turn a human voice into a Cher clone because, by gosh, we LOVE THAT TOO.
But Vocaloid? We can't hurl enough insults at it. We can't find words vile enough to say about it.
What a bunch of hypocrites.
Cats are intended to teach us that not everything in nature has a function | http://soundcloud.com/bmoorebeats
- KVRAF
- 21196 posts since 8 Oct, 2014
Not prohibitive? That's debatable. But I'm not going to argue the issue any further because, trust me, nobody is going to try to make a convincing singing engine on samples.DrMEM wrote:Script the legato and vibrato. You can do 4x rr for any articulation you can think of at a cost of 8448 samples (or fewer for a smaller singing range). That's what makes it feasible. It'd still be a ton of work, but it's not prohibitive.wagtunes wrote:A lot. The nuances of the human voice are...well, let's just say it is the most complex instrument on the planet. If it were as simple as you say, somebody would have done it by now. But you see, it's not just a matter of making the words and notes. There are inflections. Numerous inflections. There is breath singing, falsetto singing, growling, light vibrato, strong vibrato, accent...DrMEM wrote:I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
Do I really need to go on?
- KVRAF
- 21196 posts since 8 Oct, 2014
LMAO. A Vocaloid that sang like me? Hell, even I wouldn't buy it.deastman wrote:If it can make me sing like Wags, I’m in!
- KVRAF
- 7397 posts since 20 Jul, 2004 from Clearwater
Try this. It sounds better than Vocaloid to me.DrMEM wrote:I don't understand why we don't have a proper sample based vocal library yet. There are only 44 phonemes (vocal articulations, essentially) in the English language. I must be missing something, because I figure all you'd have to do is record each of those phonemes across say a 4 octave range and allow for midi cc or keyswitching between them (figuring out a good midi map would probably be the hardest part). That's 2112 samples for 1x rr.
Then maybe you'd want 4 dynamic layers, something like soft, normal, emotional, and gritty, and of course you'd want rr too. That puts you at 33792 samples for 4x rr at each of the four dynamic layers.
What am I missing here?
https://www.youtube.com/watch?v=goDHaTz62Fs&t=1337s
You are currently reading my signature.