Divider
  Speech Technology and Research Laboratory
  People
  Current Research Activities
  Past Research Activities
  Publications
  Career Opportunities
  Seminars
  Technologies for License
  In the News
  Contact Us
  STAR Search
  Information and Computing Sciences Division
SpacerAbout UsDividerR and D DivisionsDividerCareersDividerNewsroomDividerContact UsDividerSRI HomeSpacer

Spacer
         
  SRI Logo

Search SRILM-USER Archives

Match: Format: Sort by:
Search:

Re: Implementation details of -write-ngrams?

From: Andreas Stolcke <stolcke at ADDRESS HIDDEN>
Date: Sat, 21 Oct 2006 09:38:49 PDT

In message <20061021075440.GA3056 at ADDRESS HIDDENaldomain>you wrote:
> Hello,
>
> Where can I find a detailed description of the algorithm for computing
> n-gram counts (-write-ngrams) in SRILM? Thanks!

The concept of posterior ngram counts is explained in section 3.3.2
of the paper

A. O. Hatch, B. Peskin, and A. Stolcke (2005), Improved Phonetic
Speaker Recognition Using Lattice Decoding, Proc. IEEE ICASSP,
Philadelphia, vol. 1, pp. 169-172.
http://www.speech.sri.com/cgi-bin/run-distill?papers/icassp2005-spkr-phonelats.ps.gz

(where you have to replace "phone" with "word" since the default is to
compute word ngrams).  Note this is not a new concept.

The algorithm is a forward-backward computation with on-the-fly lattice
expansion.  For further details you'll have to read the source code.

Andreas

Click here to go to the SRILM home page.

 

About Us  Vertical divider  R&D Divisions  Divider  Careers  Divider  Newsroom  Divider  Contact Us
©2006 SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493
SRI International is an independent, nonprofit corporation. Privacy policy

Last modified Dec 02, 2008