dev builds
themes manual
device status forums
mailing lists
IRC bugs
dev guide

Rockbox mail archive

Subject: Re: voice file generation (Re: english.voice using AT&T

Re: voice file generation (Re: english.voice using AT&T

From: ruiner <>
Date: Fri, 2 Apr 2004 22:27:32 -0500

Wow, when I send from the correct e-mail address I'll bet this'll make it to
the list!

>Probably you mean "Resume" being spoken like curriculum vita?

That and it seems AT&T's TTS engine likes "mono" on it's own but "mono
right" and "left" turn into "Moe no"

>No real changes to the language file, it's not getting outdated so quickly.

>What parameters have you used to encode it? I had a look inside, half of
>file seems to be padding zeros. In general, you should use VBR to have more
>efficient coding. And because of a bug in changing from one clip to another
>it is currently better to disable the bit reservoir, Lame
parameter --nores.

>I have made a new program to generate the voice clips, see my tool
>The new one is "lang2wav", which generates the whole bunch of speech clips
>as .wav without a 3rd party program like TextAloud. Input is the .lang
>it uses the default voice as configured in the Control Panel. After running
>it, you can batch-encode (maybe first batch-trim) the lot and run
>to make the final language file. Result should be not larger than 1.5 MB,
>tune the encoder parameters. So the process is pretty simple now, and
>lang2wav is way faster than TextAloud.

Well it was actually about 15 minutes after I discovered your lang2wav
program that I put my first try together. I couldn't seem to get TextAloud
to work, though still thanks to it I have the AT&T voices. I encoded it with
your "make voicefont.cmd" in the same location, so unless it's depreciated
(and it does include vbr and nores) I can't explain why there were odd
results. Also in my more extensive testing (had to get to work the morning
i released!) I've noticed it sounds rather horrible!

>We're still waiting for a volunteer to do something similar under Linux
>e.g. the "Festival" TTS program, hint, wink. This could then be part of the
>automated build.

are the AT&T voices available under linux? after having them installed I'm
really quite spoiled!

Well I've updated the file on my webspace and at the very least it sounds
*much* better than my first attempt. (that url again is )

Received on 2004-04-03

Page template was last modified "Tue Sep 7 00:00:02 2021" The Rockbox Crew -- Privacy Policy