speech recognition - Trying to find better models for CMU Sphinx

I'm writing a program to transcribe audio using CMU Sphinx. I'm not happy with the quality and I thought maybe I could find a better model. But I don't really understand the difference between the models available. There are the models that are in the sphinx4-data jar and then I found this page, https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/US%20English/, but I don't fully understand what the differences are. And I'm not even sure what files to use.

There is the Accoustic Model, Dictionary and Language Model.
I'd like my program to be as general as possible, i.e., to be able to transcribe any speech (English, to start with). What are the best models to use?

question from:https://stackoverflow.com/questions/65904192/trying-to-find-better-models-for-cmu-sphinx

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

Categories

speech recognition - Trying to find better models for CMU Sphinx

speech recognition - Trying to find better models for CMU Sphinx

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags