- This topic has 7 replies, 3 voices, and was last updated 10 years, 9 months ago by Halle Winkler.
-
AuthorPosts
-
June 28, 2013 at 7:10 pm #1017568doogie001Participant
I have been developing with the openears framework and I need to be able to “parrot back” or re-itereate what the user said when openears was listening.
Looking through the code, I am not sure how to accomplish this. Any help is much appreciated.
Thank You
June 28, 2013 at 7:22 pm #1017570Halle WinklerPolitepixHi doogie,
Which part of the process are you having trouble with?
June 28, 2013 at 7:56 pm #1017571doogie001ParticipantAfter the user speaks and openears processes, I want to be able to get at what the user has said. This will be like siri in the repect that it writes out or speaks exactly what the user said before it continues to process the users request. It seems though that looking at the code and several logs produced that it only will understand what you have placed in the Language model generator.
So unless I have the “phrase” or words that the user will speak, I don’t think I can get at all that the user has said. This may be the “nature of the beast” for offline speech recognition but I just want to make sure this is the case.
For example: I want to say “This is a test can you understand what I am saying”. But my Language model generator is initialized with the words @”cat”, @”mouse”. @”dog”, I don’t think I’ll be able to get at what the user intially said. Is this a correct assesment or am I missing someting here.
In my testing I was looking at the coentents of words from the method – (void) rapidEarsDidDetectFinishedSpeechAsWordArray:(NSArray *)words andScoreArray:(NSArray *)scores {
}
Thanks for your helpJune 28, 2013 at 8:14 pm #1017572Halle WinklerPolitepixHi doogie,
That’s correct, it’s always necessary to create a language model or grammar containing the words that can be recognized. The interesting thing about it is that this isn’t actually a property of offline recognition — even Google Voice Search and Siri have to use pre-defined language models and grammar.
The difference is just that they are being run on enormous server farms and then the models are shared across many user sessions simultaneously, so it is possible for their language and acoustic models to be so large and exist in so much memory that they can create the illusion of detecting “anything”, even though at some point in time a (very big) language model like the output of LanguageModelGenerator was made. Since we’re just running on a phone, which is like a sliver of the available memory and cpu power of just a single server, we have to be very frugal and efficient with what is possible to recognize so it has to be constrained vocabularies which are in some way specific to the task at hand.
Luckily you can swap between vocabularies very quickly with OpenEars, or even generate them dynamically using LanguageModelGenerator based on the needs of the moment, so the usual approach for offline recognition is to have vocabularies which change based on the mode of the app.
June 28, 2013 at 8:20 pm #1017573doogie001ParticipantThank you for your quick and detailed reply.
June 28, 2013 at 8:41 pm #1017574Halle WinklerPolitepixNo problem! Feel free to follow up if you have questions about the specifics of setting up vocabularies or switching between them.
July 12, 2013 at 1:25 pm #1017651RubyParticipantHi i am using openears framework.i have downloaded the openears link there i came to know how to use openear framework.my problem is when i am in same viewcontroller it is recognise my voice and giving response but when i change my view i mean i go to previous view and again came back to same view then the application is crashing and reason for crash is please turn on [OpenEarsLogging startOpenEarsLogging] plese say me the solution for this
Thank you
July 12, 2013 at 1:32 pm #1017652Halle WinklerPolitepixWelcome Ruby,
Can you read the instructions in this post and show me the logging output that you get as a result?
https://www.politepix.com/forums/topic/install-issues-and-their-solutions/
-
AuthorPosts
- You must be logged in to reply to this topic.