Rockbox mail archiveSubject: Re: roadmap to blind Rockboxing (need an audio amateur/speaker)
Re: roadmap to blind Rockboxing (need an audio amateur/speaker)
From: Rocker <rocker_at_shaw.ca>
Date: Wed, 25 Feb 2004 21:00:13 -0700
Hey deep the ideas coming sir! Someone is bound to get excited. Especially
----- Original Message -----
From: "Touillaud Nicolas" <Nicolas.Touillaud_at_fr.thalesgroup.com>
To: "'Rockbox development'" <rockbox_at_cool.haxx.se>
Sent: Wednesday, February 25, 2004 3:14 AM
Subject: RE: roadmap to blind Rockboxing (need an audio amateur/speaker)
Just a thought...
In the future, will it be possible to add some kind of "folder.mp3" to the
root of a directory ?
The idea is to scan each subdir for such file and build a byteswapped
indexed file for each dir, a variation of the mecanism used for the menu.
Loading the file while browsing the HD should not be a problem since the
drive already spinned up.
The process can be time consumming, so a realistic option is a tool, running
on the PC, that does the job on the whole archos' directory.
Another idea (I promise I'll stop after this one) may be to generate the
"folder.mp3" using a PC Text2Speech engine, so the whole process can be
automated. (assuming an open source engine exists and people agree that 1:
it is usefull and 2: this remain practical considering Pc related problems
such as witch OS for the tools etc...).
I'm sorry if I talk too much on this, but I feel this dev is really a good
De : [IDC]Dragon [mailto:idc-dragon_at_gmx.de]
Envoyé : mercredi 25 février 2004 09:24
À : rockbox_at_cool.haxx.se
Objet : roadmap to blind Rockboxing (need an audio amateur/speaker)
there have been some understandable requests for blindly operating Rockbox.
This is of course useful to handicapped people, but also in a car.
As a developer, I have the following picture in mind:
Speech synthesis is not an easy option, way too complex, since we already
can play mp3 audio. Recently I've added playing from memory, mainly for my
video plugin, but also with an eye on UI support. My suggestion would be to
little mp3 clips for the language IDs, then have some script tie them all
together to one big (bitswapped) file which includes an index. This could
be a part of the build process. This file has to fit into the mp3 buffer,
about 1.6 MByte on regular 2 MB boxes. Currently we have 335 entries, if
takes a one second clip that means we can use up to 40kBit/s. More if we
into account that not all language IDs are needed. Voice is mono by nature,
or 16 kHz should be sufficient.
OK, once the big file is there, it needs to be loaded into the mp3 buffer on
startup and every time you stop your music, because the normal playback
trashes it. Loading this takes less than a second (excluding spinup) with my
recent speedups. We can have the UI talking only if we're not playing (or
recording), this is the limitation.
I would implement the changes to the menu/screen code to play what's under
the cursor, if somebody else does the authoring of the clips. I am no audio
amateur (don't even have something that deserves the name microphone), and
don't want to hear _me_ talking out of your box. This need some stamina and
reproduceability, some more clips will be needed as Rockbox evolves along.
Some help on the scripting would also be appreciated.
Any takers, can we form a team?
We need nicely trimmed clips of what's in "english.lang". I suggest naming
the clips like the LANG_xx ID, e.g. "LANG_DELETE.mp3". This makes it easy
a script. They should not contain any ID tags, don't waste space on this. I
suggest keeping the original .wav's, so we can re-encode with a sample rate
bit rate that just fits. The clips should have a little fade in/out, to
clicks, but no silence at the begin or end, to avoid latencies. Maybe there
is some kind of push-to-talk tool that records and does the trimming and
fading more or less automatically?
Plus, this would be a good time to merge the "classic" talkbox patch, if we
want this in the main cvs.
And I have in mind to delay the file opening for recording until the buffer
runs full for the first time. This delays the disk spinup and makes the
internal mic useable for clips that are shorter than the buffer.
(Phew, when will I ever complete all my other stuff: the JPEG viewer, the
all-in-one video converter tool, my car stereo integration, remote control
-- GMX ProMail (250 MB Mailbox, 50 FreeSMS, Virenschutz, 2,99 EUR/Monat...) jetzt 3 Monate GRATIS + 3x DER SPIEGEL +++ http://www.gmx.net/derspiegel +++ _______________________________________________ http://cool.haxx.se/mailman/listinfo/rockbox _______________________________________________ http://cool.haxx.se/mailman/listinfo/rockbox _______________________________________________ http://cool.haxx.se/mailman/listinfo/rockboxReceived on 2004-02-26