knowledge-database (beta)

Current group: comp.speech.users

HTK - FOM Calculation for word spotting performance

HTK - FOM Calculation for word spotting performance  
Holger
 Re: HTK - FOM Calculation for word spotting performance  
James Salsman
 Re: HTK - FOM Calculation for word spotting performance  
James Salsman
From:Holger
Subject:HTK - FOM Calculation for word spotting performance
Date:13 Dec 2004 08:49:31 -0800
Hi all,
I am trying to find out how the FOM in the HResults function of HTK
works.
The FOM is defined as follows: The FOM is calculated as follows where
it is assumed that the total duration of the test speech is T hours.
For each word, all of the spots are ranked in score order. The
percentage of true hits pi found before the i'th false alarm is then
calculated for i=1...N where N is the first integer N >=10T+0.5 .
The figure of merit is then defined as


FOM= 1/(10*T) * ( p_1 + ... + p_N + a*p_(N+1) )
where a= 10T-N is a factor that interpolates to 10 false alarms per
hour.

The problem I have is the following:
Example
In my recorded speech signal (60 Minutes long) I have 12 words
(groundtruth).
The Spotter finds 10 of these words, but also produces 4 False Alarms?

How do I calculate the FOM then? The problem is that for i>4 I cannot
calculate pi anymore.
I hope somebody can help.

Thanks a lot.
From:James Salsman
Subject:Re: HTK - FOM Calculation for word spotting performance
Date:Tue, 14 Dec 2004 05:21:56 GMT
advances with great rapidity and threatens freedom at
many different points at the same time (crowding, rules and
regulations, increasing dependence of individuals on large
organizations, propaganda and other psychological techniques, genetic
engineering, invasion of privacy through surveillance devices and
computers, etc.) To hold back any ONE of the threats to freedom would
require a long different social struggle. Those who want to protect
freedom are overwhelmed by the sheer number of new attacks and the
rapidity with which they develop, hence they become pathetic and no
longer resist. To fight each of the threats separately would be
futile. Success can be hoped for only by fighting the technological
system as a whole; but that is revolution not reform.

131. Technicians (we use this term in its broad sense to describe all
those who perform a specialized task that requires training) tend to
be so involved in their work (their surrogate activity) that when a
conflict arises between their technical work and freedom, they almost
always decide in favor of their technical work. This is obvious in the
case of scientists, but it also appears elsewhere: Educators,
humanitarian groups, conservation organizations do not hesitate to use
propaganda or other psychological techniques to help them achieve
their laudable ends. Corporations and government agencies, when they
find it useful, do not hesitate to collect information about
individuals without regard to their privacy. Law enforcement agencies
are frequently inconvenienced by the constitutional rights of suspects
and often of completely innocent persons, and they do whatever they
can do legally (or sometimes illegally) to restrict or circumvent
those rights. Most of these educators, government officials and law
officers believe in freedom, privacy and constitutional rights, but
when these conflict with their work, they usually feel tha
From:James Salsman
Subject:Re: HTK - FOM Calculation for word spotting performance
Date:Tue, 14 Dec 2004 03:35:05 GMT
Holger,

You want to ask your question on the comp.speech.research
newsgroup, where I'm crossposting it for you, or better yet
on the HTK-Users email list:
http://htk.eng.cam.ac.uk/mailing/subscribe_mail.shtml

Sincerely,
James

> Hi all,
> I am trying to find out how the FOM in the HResults function of HTK
> works.
> The FOM is defined as follows: The FOM is calculated as follows where
> it is assumed that the total duration of the test speech is T hours.
> For each word, all of the spots are ranked in score order. The
> percentage of true hits pi found before the i'th false alarm is then
> calculated for i=1...N where N is the first integer N >=10T+0.5 .
> The figure of merit is then defined as
>
>
> FOM= 1/(10*T) * ( p_1 + ... + p_N + a*p_(N+1) )
> where a= 10T-N is a factor that interpolates to 10 false alarms per
> hour.
>
> The problem I have is the following:
> Example
> In my recorded speech signal (60 Minutes long) I have 12 words
> (groundtruth).
> The Spotter finds 10 of these words, but also produces 4 False Alarms?
>
> How do I calculate the FOM then? The problem is that for i>4 I cannot
> calculate pi anymore.
> I hope somebody can help.
>
> Thanks a lot.
   

Copyright © 2006 knowledge-database   -   All rights reserved