English Speech Files

Nested
jaiger-20061231 - Phoneme 8
User: jaiger
Date: 1/2/2007 11:17 am
Views: 3879
Rating: 24

 

Readme

 Speaker Characteristics:

Gender: male;
Age range: adult;
Pronunciation dialect: General American English;

Recording Information:

Microphone: Logitech Precision PC Gaming Headset
Audio Card: intel8x0 chipset, builtin IBM Thinkpad T23
Audio Recording Software: Audacity 1.2.3
O/S: Debian GNU/Linux sarge

File Info:

File type: wav;
Sampling rate: 48kHz;
Sample rate format: 16bit;
Number of channels: 1;

License

 Copyright (C) 2006  Joseph L Phillips

These files are free software; you can redistribute them and/or
modify them under the terms of the GNU General Public License
as published by the Free Software Foundation; either version 2
of the License, or (at your option) any later version.

These files are distributed in the hope that they will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

Prompts

vf8-01 I do not blame you for anything; remember that
vf8-02 If you mean to insinuate -- Brentwood began hotly
vf8-03 The woman in you is only incidental, accidental, and irrelevant
vf8-04 There was no forecasting this strange girl's processes
vf8-05 But what they want with your toothbrush is more than I can imagine
vf8-06 Give them their choice between a fine or an official whipping
vf8-07 Keep an eye on him
vf8-08 Those are my oysters, he said at last
vf8-09 They are not regular oyster pirates, Nicholas continued
vf8-10 One by one the boys were captured
vf8-11 The weeks had gone by, and no overt acts had been attempted
vf8-12 Here, in the midmorning, the first casualty occurred
vf8-13 They were deep in the primeval forest
vf8-14 He had been foiled in his attempt to escape
vf8-15 And twenty men could hold it with spears and arrows
vf8-16 Bassett was a fastidious man
vf8-17 There's a big English general right now whose name is Roberts
vf8-18 This tacit promise of continued acquaintance gave Saxon a little joy-thrill
vf8-19 I tell you I am disgusted with this adventure tomfoolery and rot
vf8-20 From my earliest recollection my sleep was a period of terror
vf8-21 But all my dreams violated this law
vf8-22 It is very plausible to such people, a most convincing hypothesis
vf8-23 But they make the mistake of ignoring their own duality
vf8-24 I graduated last of my class
vf8-25 They had no fixed values, to be altered by adjectives and adverbs
vf8-26 He was pressing beyond the limits of his vocabulary
vf8-27 Very early in my life, I separated from my mother
vf8-28 His infernal chattering worries me even now as I think of it
vf8-29 White Leghorns, said Mrs Mortimer
vf8-30 Massage under tension, was the cryptic reply
vf8-31 Therefore, hurrah for the game
vf8-32 It lived in perpetual apprehension of that quarter of the compass
vf8-33 Broken-Tooth yelled with fright and pain
vf8-34 Thus was momentum gained in the Younger World
vf8-35 Saxon waited, for she knew a fresh idea had struck Billy
vf8-36 We had been chased by them ourselves, more than once
vf8-37 He was a wise hyena
vf8-38 Production is doubling and quadrupling upon itself
vf8-39 And the Edinburgh Evening News says, with editorial gloom
vf8-40 With my strength I slammed it full into Red-Eye's face

--- (Edited on 1/ 2/2007 12:17:58 [GMT-0500] by jaiger) ---

jaiger-20061231-vf8.tar.gz jaiger-20061231-vf8.tar.gz

Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: jaiger-20061231 - Phoneme 8
User: kmaclean
Date: 1/2/2007 9:16 pm
Views: 171
Rating: 23

Hi Joe,

Thanks again for the submissions. 

It may be a little while before I can get to your audio, I'm currently working on automating the Acoustic Model build scripts so that they are created on a daily basis - basically you submit your audio, it gets approved, and then gets merged into the VoxForge Acoustic Model nightly. 

I've got a beta of the Acoustic Model build portion now running nightly, and am currently working on automating the validation of the user submissions (ClamAV check, prompts validation against dictionary, and audio to prompts validation), and the commits to the Subversion Trunk repository (which is basically finished - just needs to run nightly).  

all the best,

Ken 

 

--- (Edited on 1/ 2/2007 10:16 pm [GMT-0500] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: nightly jobs and suggestions
User: jaiger
Date: 1/3/2007 10:12 am
Views: 256
Rating: 19

Ken,

I'm in no hurry.  I'll submit whatever I can whenever I can and you update whenever you can.

I like automation.

As part of your nightly job, perhaps you can also collect metrics on the archive such as:

- total audio data, in seconds and MB

- compare the total with some goal: we're 10% of our 100hour goal

-  similar audio submission metrics by user: jaiger submitted 1 hour of audio or 1% of 100hour goal

Collecting and publishing the metrics might spur submissions for those of us with competitive personalities and at least show us where we are relative to our project goals.

For future programming ease you might also (at submission time) create an XML file containing the License, prompts and README data.  The XML file might also contain other data such as the calculated time metrics as above or perhaps MD5 hashes of audio files for use to check that a file as downloaded is not corrupt.  This might facilitate future scripts manipulating the data - say for import into a DB or other queries.

 

-joe 

--- (Edited on 1/ 3/2007 11:12:21 [GMT-0500] by jaiger) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

Re: nightly jobs and suggestions
User: kmaclean
Date: 1/3/2007 12:07 pm
Views: 493
Rating: 7

Hi Joe,

Thanks for the feedback - added ticket #127 to track you suggestions. 

I agree, the site needs clearly stated goals and the ability to show progress towards those goals.

Ken 

--- (Edited on 1/ 3/2007 1:07 pm [GMT-0500] by kmaclean) ---


Notice: many prompts in "English Speech Files" were adapted from the prompt files contained in the CMU_ARCTIC speech synthesis database, which were in turn derived from out-of-copyright texts from Project Gutenberg, by the FestVox project at the Language Technologies Institute at Carnegie Mellon University.

PreviousNext