We’re using RapidEars/Rejecto/OpenEars to do live speech recognition on prepared text a user is reading. We’ve got it working fine for 4-5 minute long speeches, however we’ve noticed that after some time of speaking (anywhere between 6-15 minutes users have reported) OpenEars stops working and a whole bunch oe_assertions() start to fail like these:
failed oe_assertion `hmm_frame(&hmm_re_combelleautechnologiespromptsmart->hmm_re_combelleautechnologiespromptsmart) >= frame_idx_re_combelleautechnologiespromptsmart’
failed oe_assertion `hmm_frame(&hmm_re_combelleautechnologiespromptsmart->hmm_re_combelleautechnologiespromptsmart) == frame_idx_re_combelleautechnologiespromptsmart’
failed oe_assertion `frame_idx_re_combelleautechnologiespromptsmart == bpe.frame_re_combelleautechnologiespromptsmart’
One thing I suspect might be the culprit is that whenever these assertions start to fail, the hypothesis being delivered to our rapidEars recognition delegate is very long () .
I noticed that it happens more often when there are no pauses in the speech (which I guess explains why the hypothesis is so long).
i.) Any idea what might be causing the assertions to start to fail?
ii.) Are there options on rapidears or openears to tell it to not continue to generate such a long hypothesis in the absence of pauses by the speaker?
]]>