Prompts transcriptions - voxforge.org

French

Flat

Prompts transcriptions

User: samuel buffet
Date: 9/10/2008 2:13 pm

Views: 15500
Rating: 17

Hi Daniël and Ken,

I start another thread for that :

>When you have a bigger amount of prompts you have to take the monophones and triphones (explained here: http://www.voxforge.org/home/docs/faq/faq/what-is-the-different-between-a-monophone-and-a-triphone) into account.

So I imagine next step is to do the transcription of the prompts into phonemes and triphones, right ?

That's a huge work but I know a GPL Phonetizer for French, is there any pb to use it to start the job ?

Phonemes to triphones will be easy If I have understood correctly what you want.

What phonetic alphabet do you need ?

Samuel,

Re: Prompts transcriptions

User: dano
Date: 9/11/2008 2:59 am

Views: 159
Rating: 18

Hi Samuel,

I meant that your prompts must have a good coverage for the often used phonemes in your language.

Although you can make a pronunciation dictionary if there is no good one for your language now.

Daniël

Re: Prompts transcriptions

User: samuel buffet
Date: 9/11/2008 3:05 pm

Views: 159
Rating: 19

Evening Daniël,

Okay I get you point about the coverage issue .

So actually I've done a quick transcription of prompts with espeak.

The transcription is not that bad, but I need to do some manual modifications.

Btw, without those modifications I have those stats for the first set of prompts I have pushed in the Listen section.

39 phonemes generated by espeak qty from 1 to 3290

6883 triphones qty from 1 to 240

But I think it's not correct because right now pause is considered as phoneme.

Do we have to consider pauses as a phoneme to built triphones or not ?

Thanks,

Samuel,

Re: Prompts transcriptions

User: dano
Date: 9/11/2008 3:24 pm

Views: 366
Rating: 19

It should.

look at the translate example

t+r t-r+@ r-@+n @-n+s n-s+l s-l+e l-e+t e-t

you see, tr and tr@ are different triphones, you can see it as <pause>tr , tr@ etc.

Daniël

Re: Prompts transcriptions

User: samuel buffet
Date: 9/12/2008 1:13 am

Views: 239
Rating: 15

Hi there,

So to be perfectl sure I've understood, let's take an example with a pause between 2 words :

sentense : Alors, quand tout semble perdu, un homme se dresse.

phonemes : a l O R _ k a~ t u s a~ b l p E R d y _ 9~ n2 O m s @ d R E s

triphones : a+l a-l+O l-O+R O-R+_ R-_+k _-k+a~ k-a~+t a~-t+u t-u+s u-s+a~ s-a~+b a~-b+l b-l+p l-p+E p-E+R E-R+d R-d+y d-y+_ y-_+9~ _-9~+n2 9~-n2+O n2-O+m O-m+s m-s+@ s-@+d @-d+R d-R+E R-E+s E-s

Is that good ?

Samuel,

Re: Prompts transcriptions

User: kmaclean
Date: 9/12/2008 12:48 pm

Views: 186
Rating: 16

HI Samuel & Daniël,

>Do we have to consider pauses as a phoneme to built triphones or not ?

I'm not sure I understand why you are trying to create your own triphones... Triphones are generated automatically from your corpus using the acoustic model creation tools - see Step 9 of the VoxForge tutorial.

What you need is a pronunciation dictionary (see LIUM Tools - you need to check licensing though), some French prompt recordings (the French Speech Submission app is live - so you can submit speech that way, or record your own...), and just follow the VoxForge tutorial to create your own monophone acoustic model (Step 1 to Step 8) for HTK/Julius.

In order to create an HTK/Julius triphone-based acoustic model, you'll need a tree.hed script. I've never created one. For more information on how to create a tree.hed file for a new language, see the following links:

nsh's overview of how to create clustered triphone "questions" for Sphinx and HTK for new languages
Ticket #153 - htk error on step 10, and a related thread in the forums
my post on this Thread on "Error when compiling model" where I discuss Creating clustered triphone "questions"


Username	Password