release
dev builds
extras
themes manual
wiki
device status forums
mailing lists
IRC bugs
patches
dev guide



Search | Go
Wiki > Main > SummerOfCode > SummerOfCode2007 > TextToSpeech

Text to Speech (TTS) on Rockbox

Text to Speech (TTS) on Rockbox

This project is a part of Google Summer of Code 2007. It is initiated by MarcoenHirschberg and taken on by OzgurOzturk.

We considered porting one of two open source TTS engines to Rockbox. One is eSpeak, a formant-based extremely light TTS engine producing less-natural/more-robotic sounding speech. Other is Flite, a port of Festival for mobile and embedded systems. It has a larger footprint, around 2.5 MB, since it uses prerecorded diphones to synthesize the speech from text, but the output sounds much more human-like.

When it works, you will be able to load simple text files of hundreds of books (for example from Project Gutenberg) to only a few megabytes of your player and listen to them.

Planned for this Summer

  1. Successful port of Flite as a plug-in with the ability to synthesize simple text files
  2. Including Flite to the core
    Making option available to make Flite read file and directory names and ID3 tags

Suggestions for Later (Please add your suggestions)

  1. Adding filters for PDF HTML and other formats so that Flite can read them (maybe in collaboration with proposed DocumentViewer project)

The Problems

The voice files are included into the program as data structures, this requires large memory when compiling Flite and hence will cause the same for Rockbox when Flite is incorporated into the core. (We might decide to change this and have seperate voice files. TBD)

Progress

I have recently started working on the plug-in. Have successfully compiled Flite on my new set-up and my first (HelloWorld) plug-in.

I have received a few e-mails asking about progress. Still I couldn't finish the plug-in yet. I am sincerely sorry for that. I am still excited to work in this project, and I want to continue contributing to Rockbox even after Google Summer of Code ends.

The problem is, I am not very experience in Linux programming, so progress was slower and especially in the last two weeks I had to concentrate on other stuff. I have my dissertation defense this Thursday (August 9th), and there are so much paperwork etc needs to be done. Also I am moving to Stanford for a postdoctoral researcher position, just two days after my dissertation, this Saturday, so I was busy with packing up and eliminating junk, arranging new home and car, etc. etc. etc.

But be assured that, even after Google Summer of Code finishes, I want to continue on this project. Though, I still hope there will be something working before GSoC finishes. Spoken language systems is one of my interest areas, and I want to increase my strength in that field, so I want to even contibute to FLITE itself in future... I also liked the Rockbox firmware a lot and I became a user and proselitizer of it and want to contribute to it as well.

Thanks for your patience -- OzgurOzturk - 06 Aug 2007
r6 - 27 Sep 2007 - 00:16:30 - NicolasPennequin
Copyright by the contributing authors.