Forum Replies Created
-
AuthorPosts
-
billyloParticipant
Let me give it a shot after the X’mas break. We have chosen to disable voice recognition function for now until we have a better user experience. Thanks.
billyloParticipantHow’s your next update progressing? Could really use your guidance here. Some users continue to struggle with road noise. Billy.
billyloParticipantHi Halle, any luck with listening to/testing these audio files and offer some guidance on next steps? thx,
billyloParticipantAny luck with tracking this one down, Halle?
billyloParticipantYou are quick!
I should have added some more context to my question. In general, openears works really well even with road noises. This specific production scenario has puzzled me for weeks.
1. An example successful recognition (with significant road noise, iPhone6 plus). This consistently works.
2. An unsuccessful one (not sure what caused the issue. This user has no luck when she is on the road, iPhone6). This consistently has trouble.If you can shed some light on this, it would be great.
thx,
billyloParticipantI see. Does [stop] actually stop it? I am thinking of adding a support feature to my app so that they can upload sample audio to the cloud if my app does not behave correctly.
billyloParticipantGreat. I did have trouble getting the wavWasSavedAtLocation delegate method triggered. SaveThatWave worked, just the delegate method not triggered somehow. Could use your guidance there. Billy.
- (void) wavWasSavedAtLocation:(NSString *)location { NSLog(@"WAV was saved at the path %@", location); }
billyloParticipantHi, Here is a minimal test case that demonstrates my two issues (duplicated calls to delegate methods and zero scores). It’s basically the sampleApp modified to support RapidEars and these three lines added to initialization.
[[OEPocketsphinxController sharedInstance] setFinalizeHypothesis:NO]; [[OEPocketsphinxController sharedInstance] setReturnSegments:YES]; [[OEPocketsphinxController sharedInstance] setReturnSegmentTimes:YES];
and I used this to capture the score and start/end times.
- (void) rapidEarsDidDetectLiveSpeechAsWordArray: (NSArray *) words scoreArray: (NSArray *) scores startTimeArray: (NSArray *) startTimes endTimeArray: (NSArray *) endTimes { NSNumber *scoreNumber, *startTime, *endTime; for (int i = 0; i< words.count; i++) { scoreNumber = scores[i]; startTime = startTimes[i]; endTime = endTimes[i]; NSLog(@"word = %@, score = %.1f, startTime = %.1f, endTime = %.1f", words[i], [scoreNumber floatValue], [startTime floatValue], [endTime floatValue]); } //output score }
The console output looks like this:
2015-08-31 19:26:27.019 OpenEarsSampleApp[681:135806] Local callback: Pocketsphinx has detected speech. 2015-08-31 19:26:29.141 OpenEarsSampleApp[681:135806] word = LEFT, score = 0.0, startTime = 1.9, endTime = 2.2 2015-08-31 19:26:29.268 OpenEarsSampleApp[681:135806] word = LEFT, score = 0.0, startTime = 1.9, endTime = 2.3 2015-08-31 19:26:29.404 OpenEarsSampleApp[681:135806] word = LEFT, score = 0.0, startTime = 1.9, endTime = 2.3 2015-08-31 19:26:29.529 OpenEarsSampleApp[681:135806] word = LEFT, score = 0.0, startTime = 1.9, endTime = 2.3 2015-08-31 19:26:29.657 OpenEarsSampleApp[681:135806] word = LEFT, score = 0.0, startTime = 1.9, endTime = 2.3 2015-08-31 19:26:29.785 OpenEarsSampleApp[681:135806] Local callback: Pocketsphinx has detected a second of silence, concluding an utterance. 2015-08-31 19:26:30.296 OpenEarsSampleApp[681:135806] Local callback: Pocketsphinx has detected speech. 2015-08-31 19:26:30.432 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.2 2015-08-31 19:26:30.552 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.2 2015-08-31 19:26:30.684 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.3 2015-08-31 19:26:30.814 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.2 2015-08-31 19:26:30.936 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.2 2015-08-31 19:26:31.066 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.2 2015-08-31 19:26:31.197 OpenEarsSampleApp[681:135806] word = RIGHT, score = 0.0, startTime = 0.0, endTime = 0.2 2015-08-31 19:
thanks.
billyloParticipantyes. the hypothesis are the correct ones.
—
While I have you, I am getting a lot of duplicated calls to the rapidEars delegate method for the same audio trigger (just one word). Sometimes, the delegate method is called 20+ times. I have set
[[OEPocketsphinxController sharedInstance] setReturnDuplicatePartials:NO];
In fact, after a while, RapidEars would stop picking up words. (this is probably a different thread.)
billyloParticipantNSNumber *scoreNumber = scores[i]; int score = [scoreNumber intValue];
I also used the debugger to view the scoreArray and it shows an array of zeroes.
any hints?
-
AuthorPosts