Rockbox.org home
release
dev builds
extras
themes manual
wiki
device status forums
mailing lists
IRC bugs
patches
dev guide
translations



Rockbox mail archive

Subject: Re: Make voices script

Re: Make voices script

From: Peter Herngaard <pethern_at_email.dk>
Date: Wed, 18 Oct 2006 00:43:11 +0200

Hello Hamish

Sorry, I forgot to explain that you have to place the two lines within the
subroutine MakeVoiceFile.
As to the possible values assigned to tate and volume refer to the SAPI5
documentation (sapi.chm).

---
TTS Engine Characteristics
The Microsoft.com Speech website
Microsoft Speech SDK
SAPI 5.1
TTS Engine Characteristics
Engines use the three characteristics of Volume, Pitch, and Rate to 
partially define speech traits. At the application level, setting these 
values is simple;
you need only set them to a given number. However, implementation of these 
traits is more complex for the engine.
Volume
At the application level, volume is a number from zero to 100 where 100 is 
the maximum value for a voice. It is a linear progression and a value of 50 
represents
half of the loudest permitted. The increments should be the range divided by 
100.
Pitch adjustment
The value can range from -10 to +10. A value of zero sets a voice to speak 
at its default pitch. A value of -10 sets a voice to speak at three-fourths 
of
its default pitch. A value of +10 sets a voice to speak at four-thirds of 
its default pitch. Each increment between -10 and +10 is logarithmically 
distributed
such that incrementing or decrementing by 1 is multiplying or dividing the 
pitch by the 24th root of 2 (about 1.03). Values outside of the -10 and +10
range will be passed to an engine. However, SAPI 5-compliant engines may not 
support such extremes and may clip the pitch to the maximum or minimum the
engine supports. Values of -24 and +24 must lower and raise pitch by 1 
octave respectively. All incrementing or decrementing by 1 must multiply or 
divide
the pitch by the 24th root of 2.
Rate adjustment
The value can range from -10 to +10. A value of zero sets a voice to speak 
at its default rate. A value of -10 sets a voice to speak at one-third of 
its
default rate. A value of +10 sets a voice to speak at three times its 
default rate. Each increment between -10 and +10 is logarithmically 
distributed such
that incrementing or decrementing by 1 is multiplying or dividing the rate 
by the 10th root of 3 (about 1.1). Values more extreme than -10 and +10 will
be passed to an engine. However, SAPI 5-compliant engines may not support 
such extremes and may clip the rate to the maximum or minimum rate the 
engine
supports.
---
If you are interested, I can mail you my own makevoices.vbs.
Kind regards
Peter Herngaard
----- Original Message ----- 
From: "Hamish Mackenzie" <hamish_at_mackenzieoz.com>
To: "Rockbox" <rockbox_at_cool.haxx.se>
Sent: Tuesday, October 17, 2006 10:45 PM
Subject: Re: Make voices script
> Hi Peter
>
> Thanks for this info but I have a couple of questions.  firstly where in 
> the script would be place these two lines? and secondly what are the 
> ranges that can be used for these two items where you have put value in 
> the lines of code you sent.
>
> Cheers
>
> Hamish
> ----- Original Message ----- 
> From: "Peter Herngaard" <pethern_at_email.dk>
> To: "Rockbox" <rockbox_at_cool.haxx.se>
> Sent: Tuesday, October 17, 2006 12:17 PM
> Subject: Re: Make voices script
>
>
>> Hello Scott
>>
>> You can adjust the volume and speed properties of the generated voice by 
>> inserting the following lines in makevoices.vbs:
>>
>> ospvoice.rate = (value))
>> ospvoice.volume = (value)
>>
>> Please note that not all voices support all properties. But I have 
>> Scansoft Solo RealSpeak Nanna from Nextup.com on my computer and it 
>> works.
>>
>> For Sapi4 the method is slightly different but similar to that described 
>> above.
>> You should download sapi.chm from Microsoft (SAPI5):
>>
>> http://www.microsoft.com/speech/download/old/sapi5.asp
>>
>> Kind regards
>> Peter Herngaard
>>
>> ----- Original Message ----- 
>> From: "ScottLearned" <SLearned_at_cox.net>
>> To: "Rockbox" <rockbox_at_cool.haxx.se>
>> Sent: Monday, October 16, 2006 10:14 PM
>> Subject: Re: Make voices script
>>
>>
>>>
>>> Hamish Mackenzie wrote:
>>>
>>> Hi All
>>> I have now managed to make voice files using the Australian scansoft TTS 
>>> but
>>> have one further question.  Is there somewhere in the script that 
>>> controls
>>> the speed and volume of the clips that are made as I cant seem to find 
>>> it.
>>> I know there is a place to change the speed in the speech area of the
>>> control panel and if this in turn changes the speed of the voice then 
>>> that
>>> sorts out one problem but where do I adjust the volume?  I know this can 
>>> be
>>> achieved by modifying the script in the other vbs file that is used to 
>>> voice
>>> folder and file names but I cant seem to find it in the script to create 
>>> the
>>> english.voice file.
>>>
>>> thanks as always
>>>
>>> Hamish
>>>
>>> Hi Hamish,
>>> Yes, speed of the voice file is changed in the control panel under 
>>> speech.
>>> If you want to change the volume, you would change it in the 
>>> makevoices.vbs
>>> script.
>>>
>>> snip-snip
>>>
>>>
>>>
>>> Scale the input volume by <factor. The factor 0.6 is recommended. Higher
>>> values make the voice UI louder, lower values make it more quiet. Don't 
>>> use
>>> values1, or the voice will probably get distorted.
>>>
>>> For instance, I changed the  volume to 07 for my VF.
>>>
>>> MakeVoiceFile "english", "ScanSoft Karen_Full_22kHz", 
>>> SPSF_22kHz16BitMono,
>>> 190, "-V 5 -B 64 --resample 22 --scale 0.7 --vbr-new -t -S"
>>>
>>> You can find out more about building and tweaking voice files at:
>>>
>>> http://www.rockbox.org/twiki/bin/view/Main/VoiceBuilding
>>>
>>> HtH
>>>
>>> Scott
>>>
>>>
>>
>>
>
> 
Received on 2006-10-18

Page template was last modified "Tue Sep 7 00:00:02 2021" The Rockbox Crew -- Privacy Policy