KVR Audio is the Internet's number one news and information resource for open standard audio plugins. We report new releases, product announcements and product updates (major and minor) for all VST Plugins, DirectX Plugins and Audio Units Plugins. We manage a fully searchable audio plugin database (updated daily), and offer many free member services including user reviews, product update notifications and a very active discussion forum. We also host official support forums for many plugin developers plus the official Receptor support forum.
Plug-in Database: Virtual
Instruments, Effects & Hosts
Banks & Patches
Download & Upload
Plug-in Ratings
by KVR Members
Wiki: Tutorials,
Audio Lexicon, ...
Listen to Music
by KVR Members
Search
KVR

Google Powered Search:

in new window

KVR Powered Plug-in Search:

AuthorTopic: How do I convert speech into musical notes..
theplanb
Posted: 24th June 2004 01:20
Hi everyone.
I'm new here and I'm so glad I found this website 'cause I was really having trouble until now.

Ok, I'm not into making musics as such.
I'm actually learning English and doing some experiment on linguistics stuff especially on prosody, you know like intonation and stress in a language.

Ok, I'm simulating a situation where a baby listens to speech sound in surroundings while the baby doesn't recognise what the words actually mean.

Babies at first listen to the rhythm of the language and familiarise with them and I want to do this as a part of English training.

Basically, I need to modify a speech file so I can listen to the rhythm of English language (intonation, stress, high and low pitch) and rhythm only. And I shouldn't be able to make out what the speak is actually saying.

Say, I have an audio file which contains a native English speaker's speech, I need to modify the audio file so that I can hear the speed, pause, the pitch and all of that except the actual subtle differences in consonants or even vowels.
In other words, I shouldn't be able to make out A from B from C.

So, if a speaker says "how do you do", it should to me sound like "ouyouuu~~" or somthing like that ^^;
Actually, it doesn't matter how it would sound as long as I can hear the rhythm and pitch correctly.

I guess it's like converting speech file into a musical notes as all the sound remains intact except meanings.

Can someone suggest how I should go about doing this.
I haven't actually decided which (free) host program I should be using as I'm a total newbie in this.
I'm not even sure I'm in the right forum!

Please help..
Vegetarian
Posted: 24th June 2004 01:36
I'm guessing what you might want is a transient modulator ... I know Sony Oxford made a plugin that does that (called "Transient Modulator", no less), but I don't know of any free ones. You should be able to remove consonant sounds with that.
Karsten Viese
Posted: 24th June 2004 01:38
What you could do is to get a software (you can find a few free online) which convert audio to midi (If you dont know what midi is do a search).

This will give you a midi file which can be played back in windows as a piano (or whatever instrument you like).

Im not sure if its what you are looking for.

Do a google search for "audio to midi" for the software.

Hope it helps you out somewhat Smile
cron
Posted: 24th June 2004 01:38
Wow, that sounds extremely difficult. I'll have a play around with some free effects later and see if I can come up with anything, though someone will probably beat me to it.
Yossarian
Posted: 24th June 2004 01:41
Interesting application. You might want to check out a plug-in called Kantos from www.antarestech.com . It’s a voice-controlled synth that can extract pitch, dynamics, harmonic content and formant information from an audio file. There are a number of mp3 examples at the website.

If you don’t already have it you should also have a look at Praat ( www.praat.org ) which is a program for analysis, synthesis and manipulation of speech.

/Yoss
007
Posted: 24th June 2004 01:46
What you can do is download a trial version of Audiomulch http://www.audiomulch.com/

Not really easy although...
With sound in you can open your file. As sound effect you can open DigiGrunge or DLGranulator.
With sound out you can save your file.
007
Posted: 24th June 2004 01:55
Or this:
http://hem.passagen.se/rasmuse/Granny.htm

It is a standalone free granulator.
But you have to choose the right knob to tweak...
rokkon
Posted: 24th June 2004 03:02
maybe it will be enough to simply lowpass filter the speech with a steep filter to get rid of the formants and all other spectrum-based information?
bugs
Posted: 24th June 2004 05:02
You might start with Spechstimme notation--it was developed by Arnold Schoenberg for "Pierrot Lunaire" around 1900. It would be probably serve you well.

Do a Goggle search on SPECHSTIMME.
bugs
Posted: 24th June 2004 05:02
You might start with Spechstimme notation--it was developed by Arnold Schoenberg for "Pierrot Lunaire" around 1900. It would be probably serve you well.

Do a Goggle search on SPECHSTIMME.
bugs
Posted: 24th June 2004 05:03
You might start with Spechstimme notation--it was developed by Arnold Schoenberg for "Pierrot Lunaire" around 1900. It would be probably serve you well.

Do a Goggle search on SPECHSTIMME.
bugs
Posted: 24th June 2004 05:05
Sorry about the multiple posts. Site wasn't responding normally.
Meffy
Posted: 24th June 2004 05:37
bugs wrote:
Do a Goggle search on SPECHSTIMME.


Actually, "Sprechstimme." Wink Interesting search, bugs. Thanks for mentioning this.

Personally, I agree with those who suggest using granular resynthesis to scramble the incoming audio. Some granular setups are very flexible, so you could choose just how strongly to alter the sound.

You have a fascinating research project -- best of luck!

Meffy
bugs
Posted: 24th June 2004 07:41
Sorry for the misspelling--its was early here.
Meffy
Posted: 24th June 2004 08:17
bugs wrote:
Sorry for the misspelling--its was early here.


Ah, just playing. I knew what you meant, and that you knew your stuff too. Over here it's either early or late, depending on time-scale and criteria. Wink

Meffy
deastman
Posted: 24th June 2004 08:31
What about using Melodyne to map out the pitch and rhythm of the speech, and then impose that on musical tones?

http://www.celemony.com/
nuffink
Posted: 24th June 2004 08:39
Radical formant shifting might render apeach* unintelligible while retaining overall pitch and timing.
I can't think of such an effect but others may.

*edit: I should correct this to speech but it made me smile.
ArneyS
Posted: 24th June 2004 08:59
Melodyne works like a charm, you will not find anything that works that good with such simplicity.
theplanb
Posted: 24th June 2004 15:32
Wow.. this is overwhelming. Smile
I'm trying out your sugggestions one by one and it's taking time since I know nothing.. and have no idea which knob to turn. Shocked

So far, I haven't achieved much but I'm sure I'll get there in time and I'll let you guys know how it went. In the mean time, well thank you. Wink
Dause
Posted: 25th June 2004 02:36
> Melodyne works like a charm, you will not find anything that works that good with such simplicity.

I second that. Especially with the new Version 2.5 it has become even easier.
You simply make your voice recording then let melodyne do an analysis. Finally You load a VST synthesizer into melodynes Mixer window and send the midi from the speech track to the synthesizer track.
It's a piece of cake. Yummy!
Forum topics in the archive are read only. New posts should be made in the main KVR Forums.
Disclaimer:
All communications made available as part of this forum and any opinions, advice, statements, views or other information expressed in this forum are solely provided by, and the responsibility of, the person posting such communication and not of kvraudio.com (unless kvraudio.com is specifically identified as the author of the communication).