|
Rockbox mail archiveSubject: Re: Make voices scriptRe: Make voices script
From: Peter Herngaard <pethern_at_email.dk>
Date: Wed, 18 Oct 2006 00:43:11 +0200 Hello Hamish Sorry, I forgot to explain that you have to place the two lines within the subroutine MakeVoiceFile. As to the possible values assigned to tate and volume refer to the SAPI5 documentation (sapi.chm). --- TTS Engine Characteristics The Microsoft.com Speech website Microsoft Speech SDK SAPI 5.1 TTS Engine Characteristics Engines use the three characteristics of Volume, Pitch, and Rate to partially define speech traits. At the application level, setting these values is simple; you need only set them to a given number. However, implementation of these traits is more complex for the engine. Volume At the application level, volume is a number from zero to 100 where 100 is the maximum value for a voice. It is a linear progression and a value of 50 represents half of the loudest permitted. The increments should be the range divided by 100. Pitch adjustment The value can range from -10 to +10. A value of zero sets a voice to speak at its default pitch. A value of -10 sets a voice to speak at three-fourths of its default pitch. A value of +10 sets a voice to speak at four-thirds of its default pitch. Each increment between -10 and +10 is logarithmically distributed such that incrementing or decrementing by 1 is multiplying or dividing the pitch by the 24th root of 2 (about 1.03). Values outside of the -10 and +10 range will be passed to an engine. However, SAPI 5-compliant engines may not support such extremes and may clip the pitch to the maximum or minimum the engine supports. Values of -24 and +24 must lower and raise pitch by 1 octave respectively. All incrementing or decrementing by 1 must multiply or divide the pitch by the 24th root of 2. Rate adjustment The value can range from -10 to +10. A value of zero sets a voice to speak at its default rate. A value of -10 sets a voice to speak at one-third of its default rate. A value of +10 sets a voice to speak at three times its default rate. Each increment between -10 and +10 is logarithmically distributed such that incrementing or decrementing by 1 is multiplying or dividing the rate by the 10th root of 3 (about 1.1). Values more extreme than -10 and +10 will be passed to an engine. However, SAPI 5-compliant engines may not support such extremes and may clip the rate to the maximum or minimum rate the engine supports. --- If you are interested, I can mail you my own makevoices.vbs. Kind regards Peter Herngaard ----- Original Message ----- From: "Hamish Mackenzie" <hamish_at_mackenzieoz.com> To: "Rockbox" <rockbox_at_cool.haxx.se> Sent: Tuesday, October 17, 2006 10:45 PM Subject: Re: Make voices script > Hi Peter > > Thanks for this info but I have a couple of questions. firstly where in > the script would be place these two lines? and secondly what are the > ranges that can be used for these two items where you have put value in > the lines of code you sent. > > Cheers > > Hamish > ----- Original Message ----- > From: "Peter Herngaard" <pethern_at_email.dk> > To: "Rockbox" <rockbox_at_cool.haxx.se> > Sent: Tuesday, October 17, 2006 12:17 PM > Subject: Re: Make voices script > > >> Hello Scott >> >> You can adjust the volume and speed properties of the generated voice by >> inserting the following lines in makevoices.vbs: >> >> ospvoice.rate = (value)) >> ospvoice.volume = (value) >> >> Please note that not all voices support all properties. But I have >> Scansoft Solo RealSpeak Nanna from Nextup.com on my computer and it >> works. >> >> For Sapi4 the method is slightly different but similar to that described >> above. >> You should download sapi.chm from Microsoft (SAPI5): >> >> http://www.microsoft.com/speech/download/old/sapi5.asp >> >> Kind regards >> Peter Herngaard >> >> ----- Original Message ----- >> From: "ScottLearned" <SLearned_at_cox.net> >> To: "Rockbox" <rockbox_at_cool.haxx.se> >> Sent: Monday, October 16, 2006 10:14 PM >> Subject: Re: Make voices script >> >> >>> >>> Hamish Mackenzie wrote: >>> >>> Hi All >>> I have now managed to make voice files using the Australian scansoft TTS >>> but >>> have one further question. Is there somewhere in the script that >>> controls >>> the speed and volume of the clips that are made as I cant seem to find >>> it. >>> I know there is a place to change the speed in the speech area of the >>> control panel and if this in turn changes the speed of the voice then >>> that >>> sorts out one problem but where do I adjust the volume? I know this can >>> be >>> achieved by modifying the script in the other vbs file that is used to >>> voice >>> folder and file names but I cant seem to find it in the script to create >>> the >>> english.voice file. >>> >>> thanks as always >>> >>> Hamish >>> >>> Hi Hamish, >>> Yes, speed of the voice file is changed in the control panel under >>> speech. >>> If you want to change the volume, you would change it in the >>> makevoices.vbs >>> script. >>> >>> snip-snip >>> >>> >>> >>> Scale the input volume by <factor. The factor 0.6 is recommended. Higher >>> values make the voice UI louder, lower values make it more quiet. Don't >>> use >>> values1, or the voice will probably get distorted. >>> >>> For instance, I changed the volume to 07 for my VF. >>> >>> MakeVoiceFile "english", "ScanSoft Karen_Full_22kHz", >>> SPSF_22kHz16BitMono, >>> 190, "-V 5 -B 64 --resample 22 --scale 0.7 --vbr-new -t -S" >>> >>> You can find out more about building and tweaking voice files at: >>> >>> http://www.rockbox.org/twiki/bin/view/Main/VoiceBuilding >>> >>> HtH >>> >>> Scott >>> >>> >> >> > >Received on 2006-10-18 Page template was last modified "Tue Sep 7 00:00:02 2021" The Rockbox Crew -- Privacy Policy |