Tagged: delay stopping recognition loop
- This topic has 17 replies, 2 voices, and was last updated 9 years, 11 months ago by Halle Winkler.
-
AuthorPosts
-
May 6, 2014 at 5:32 pm #1021128HarzParticipant
I’m developing speech recognition application based on OpenEars and RapidEars. It works fine on iPhone 5 and 5s, but there is some bug on iPhone 4: OpenEars doesn’t stop recognition loop immediately when stopListening method is called. It’s delay correlates with quantity of recognized words in current loop and can be up to 1 minute and more… So, how can I stop recognition loop immediately (or at least during several seconds)? Below is openears log. Thanks in advice.
2014-05-06 18:11:37.404 MyApp[2221:907] Stop button pressed
time delta 1399389097.435529 partial result fwdtree
partial: <s>: …[recognized words]… :(0000001)
INFO: file_omitted(0): cmn_prior_update: from < 12.80 -1.94 -0.17 0.24 -0.35 -0.17 -0.14 -0.17 -0.31 -0.08 -0.17 -0.13 -0.13 >
INFO: file_omitted(0): cmn_prior_update: to < 12.93 -2.03 -0.19 0.26 -0.36 -0.15 -0.15 -0.13 -0.34 -0.06 -0.16 -0.14 -0.10 >
INFO: file_omitted(0): idx 1380 is final (1380)
INFO: file_omitted(0): fwdtree: marking arc buffer final
INFO: file_omitted(0): fwdtree: allocated 4096 arcs (48 KiB)
INFO: file_omitted(0): fwdtree: allocated 512 start frame entries (2 KiB)
INFO: file_omitted(0): Waiting for 1 consumers to finish
INFO: file_omitted(0): idx 1380 is final (1380)
time delta 1399389118.439060 end 0000001
2014-05-06 18:11:58.439 MyApp[2221:907] Status: Detected end of speech, Code: 4
INFO: file_omitted(0): fwdflat canceled
INFO: file_omitted(0): fwdtree canceled
INFO: file_omitted(0): TOTAL fwdtree 48.961 wall 2.135 xRT
INFO: file_omitted(0): TOTAL fwdflat 77.410 wall 3.376 xRT
2014-05-06 18:11:58.605 MyApp[2221:907] Status: It has stopped listening, Code: 3May 6, 2014 at 5:41 pm #1021129Halle WinklerPolitepixWelcome,
Are both the frameworks up to the current version? OpenEars is 1.7 and RapidEars is 1.65.
May 6, 2014 at 6:15 pm #1021130HarzParticipantYes, OpenEars version is 1.7 (with minor changes in the code) and RapidEars is 1.65 (Licensed).
May 6, 2014 at 6:18 pm #1021131Halle WinklerPolitepixWhat are the changes to OpenEars?
May 6, 2014 at 6:22 pm #1021132HarzParticipantModified method pathToModel to enable model updates via Wi-Fi.
May 6, 2014 at 6:32 pm #1021133Halle WinklerPolitepixOK, can you show the full logging output of both OpenEarsLogging and verbosePocketsphinx from the beginning to the end of the app session? It’s fine to omit anything that isn’t logging that OpenEars creates and to omit specific vocabulary as long as the vocabulary count isn’t changed.
May 6, 2014 at 7:32 pm #1021140HarzParticipantMay 6, 2014 at 7:37 pm #1021141HarzParticipant2014-05-06 21:07:08.962 MyApp[4651:60b] otherApps (
“com.apple.mobilesafari”,
“com.youmi.FileMasterEN”
)
2014-05-06 21:07:17.950 MyApp[4651:60b] ‘Open Ears’ has been Initialized
2014-05-06 21:07:17.952 MyApp[4651:60b] Starting OpenEars logging for OpenEars version 1.7 on 32-bit device: iPhone running iOS version: 7.000000
2014-05-06 21:07:37.318 MyApp[4651:60b] Bundle Name : Model
2014-05-06 21:07:37.320 MyApp[4651:60b] OE Ext : Model Path is /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
2014-05-06 21:07:37.321 MyApp[4651:60b] OE Ext : Model Path is /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
2014-05-06 21:07:37.325 MyApp[4651:60b] Accuracy set to 20.
2014-05-06 21:07:37.327 MyApp[4651:60b] OE Ext : Model Path is /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
2014-05-06 21:07:37.336 MyApp[4651:60b] User gave mic permission for this app.
2014-05-06 21:07:37.338 MyApp[4651:60b] setting sample rate to 16000
2014-05-06 21:07:37.339 MyApp[4651:60b] The audio session has never been initialized so we will do that now.
2014-05-06 21:07:37.339 MyApp[4651:60b] Checking and resetting all audio session settings.
2014-05-06 21:07:37.340 MyApp[4651:60b] audioCategory is incorrect, we will change it.
2014-05-06 21:07:37.341 MyApp[4651:60b] audioCategory is now on the correct setting of kAudioSessionCategory_PlayAndRecord.
2014-05-06 21:07:37.342 MyApp[4651:60b] bluetoothInput is incorrect, we will change it.
2014-05-06 21:07:37.343 MyApp[4651:60b] bluetooth input is now on the correct setting of 1.
2014-05-06 21:07:37.344 MyApp[4651:60b] Output Device: ReceiverAndMicrophone.
2014-05-06 21:07:37.345 MyApp[4651:60b] categoryDefaultToSpeaker is incorrect, we will change it.
2014-05-06 21:07:37.346 MyApp[4651:60b] CategoryDefaultToSpeaker is now on the correct setting of 1.
2014-05-06 21:07:37.347 MyApp[4651:60b] preferredBufferSize is incorrect, we will change it.
2014-05-06 21:07:37.348 MyApp[4651:60b] PreferredBufferSize is now on the correct setting of 0.128000.
2014-05-06 21:07:37.349 MyApp[4651:60b] preferredSampleRateCheck is incorrect, we will change it.
2014-05-06 21:07:37.350 MyApp[4651:60b] preferred hardware sample rate is now on the correct setting of 16000.000000.
2014-05-06 21:07:37.385 MyApp[4651:60b] AudioSessionManager startAudioSession has reached the end of the initialization.
2014-05-06 21:07:37.386 MyApp[4651:60b] Exiting startAudioSession.
2014-05-06 21:07:37.394 MyApp[4651:650b] setSecondsOfSilence value of 0.000000 was too large or too small or was NULL, using default of 0.700000.
2014-05-06 21:07:37.400 MyApp[4651:600f] Audio route has changed for the following reason:
2014-05-06 21:07:37.401 MyApp[4651:600f] There has been a change of category
2014-05-06 21:07:37.402 MyApp[4651:600f] The previous audio route was Speaker
2014-05-06 21:07:37.403 MyApp[4651:600f] This is not a case in which OpenEars performs a route change voluntarily. At the close of this function, the audio route is SpeakerAndMicrophone
2014-05-06 21:07:37.409 MyApp[4651:650b] Project has these words or phrases in its dictionary:
[deleted]
2014-05-06 21:07:37.411 MyApp[4651:650b] Recognition loop has started
INFO: file_omitted(0): Parsing command line:
\
-lm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.lm.dmp \
-dict /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.dic \
-hmm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle \
-lw 6.5 \
-samprate 16000 \
-rawlogdir /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Library/CachesCurrent configuration:
[NAME] [DEFLT] [VALUE]
-adchdr 0 0
-adcin no no
-agc none none
-agcthresh 2.0 2.000000e+00
-align
-alpha 0.97 9.700000e-01
-arcdumpdir
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes yes
-bestpathlw 9.5 9.500000e+00
-bghist no no
-cepdir
-cepext .mfc .mfc
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-ctl
-ctlcount -1 -1
-ctlincr 1 1
-ctloffset 0 0
-debug 0
-dict /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-fwdtreelm
-hmm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle
-hyp
-hypprefix
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 512 512
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/base.lm.dmp
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxsilfr -1 -1
-maxwpf 50 50
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Library/Caches
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-vm
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02INFO: file_omitted(0): Parsing command line:
\
-nfilt 40 \
-lowerf 133.3334 \
-upperf 6855.4976 \
-feat s2_4x \
-agc none \
-cmn current \
-varnorm no \
-fwdflat yesCurrent configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no no
-doublebw no no
-feat 1s_c_d_dd s2_4x
-frate 100 100
-input_endian little little
-lda
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 1.333334e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02INFO: file_omitted(0): Parsed model-specific feature parameters from /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/feat.params
INFO: file_omitted(0): Reading HMM transition probability matrices: /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/transition_matrices
INFO: file_omitted(0): Loading senones from dump file /var/mobile/Applications/E43C3D68-1E23-4C8A-96FD-917CCE0F8B8E/Documents/Model.bundle/sendump
INFO: file_omitted(0): BEGIN FILE FORMAT DESCRIPTION
INFO: file_omitted(0): Rows: 64, Columns: 6147
INFO: file_omitted(0): Using memory-mapped I/O for senones
INFO: file_omitted(0): Building PID tables for dictionary
INFO: file_omitted(0): Allocating 49^3 * 2 bytes (229 KiB) for word-initial triphones
INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for word-final triphones
INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for single-phone word triphones
INFO: file_omitted(0): No \data\ mark in LM file
INFO: file_omitted(0): Number of word mappings: 2522
INFO: file_omitted(0): Number of word mappings: 2522
INFO: file_omitted(0): Initialized arc buffer ‘fwdtree’, each arc occupies 12 bytes
2014-05-06 21:07:38.845 MyApp[4651:650b] Starting openAudioDevice on the device.
2014-05-06 21:07:38.846 MyApp[4651:650b] Audio unit wrapper successfully created.
2014-05-06 21:07:38.854 MyApp[4651:650b] Set audio route to SpeakerAndMicrophone
2014-05-06 21:07:38.855 MyApp[4651:650b] Restoring SmartCMN value of 11.306060
2014-05-06 21:07:38.856 MyApp[4651:650b] Checking and resetting all audio session settings.
2014-05-06 21:07:38.858 MyApp[4651:650b] audioCategory is correct, we will leave it as it is.
2014-05-06 21:07:38.858 MyApp[4651:650b] bluetoothInput is correct, we will leave it as it is.
2014-05-06 21:07:38.859 MyApp[4651:650b] Output Device: SpeakerAndMicrophone.
2014-05-06 21:07:38.860 MyApp[4651:650b] categoryDefaultToSpeaker is correct, we will leave it as it is.
2014-05-06 21:07:38.861 MyApp[4651:650b] preferredBufferSize is correct, we will leave it as it is.
2014-05-06 21:07:38.862 MyApp[4651:650b] preferredSampleRateCheck is correct, we will leave it as it is.
2014-05-06 21:07:38.863 MyApp[4651:650b] Setting the variables for the device and starting it.
2014-05-06 21:07:38.863 MyApp[4651:650b] Looping through ringbuffer sections and pre-allocating them.
2014-05-06 21:07:39.359 MyApp[4651:650b] Started audio output unit.
2014-05-06 21:07:39.361 MyApp[4651:650b] Calibration has started
2014-05-06 21:07:41.566 MyApp[4651:650b] Calibration has completed
2014-05-06 21:07:41.568 MyApp[4651:650b] Listening.
2014-05-06 21:07:49.319 MyApp[4651:650b] Speech detected…
INFO: file_omitted(0): arc_buffer_producer_start_utt
time delta 1399399669.323025 start 0000000
INFO: file_omitted(0): arc_buffer_consumer_start_utt
time delta 1399399669.324462 start 0000000
time delta 1399399669.324843 partial result fwdtree
partial: <s>:(0000000)
time delta 1399399669.341870 partial result fwdtree_[deleted partial recognized results]_
2014-05-06 21:08:04.284 MyApp[4651:60b] In ContinuousAudioUnit
INFO: file_omitted(0): cmn_prior_update: from < 10.43 -1.69 -0.17 0.14 -0.38 -0.18 0.06 -0.16 -0.21 -0.02 -0.05 -0.10 -0.06 >
INFO: file_omitted(0): cmn_prior_update: to < 10.42 -1.69 -0.17 0.14 -0.38 -0.18 0.06 -0.16 -0.21 -0.02 -0.05 -0.10 -0.06 >
INFO: file_omitted(0): idx 1404 is final (1404)
INFO: file_omitted(0): fwdtree: marking arc buffer final
INFO: file_omitted(0): fwdtree: allocated 4096 arcs (48 KiB)
INFO: file_omitted(0): fwdtree: allocated 512 start frame entries (2 KiB)
INFO: file_omitted(0): Waiting for 1 consumers to finish
INFO: file_omitted(0): idx 1404 is final (1404)
2014-05-06 21:08:04.959 MyApp[4651:650b] Stopping audio unit.
2014-05-06 21:08:05.091 MyApp[4651:650b] Audio Output Unit stopped, cleaning up variable states.
2014-05-06 21:08:05.092 MyApp[4651:650b] This device is not recording, so first we will set its recording status to 0
2014-05-06 21:08:05.093 MyApp[4651:650b] The audio unit is running so we are going to dispose of its instance
INFO: file_omitted(0): fwdtree canceled
INFO: file_omitted(0): TOTAL fwdtree 6.994 wall 0.498 xRT
INFO: file_omitted(0): fwdflat canceled
INFO: file_omitted(0): TOTAL fwdflat 27.945 wall 1.990 xRT
2014-05-06 21:08:05.108 MyApp[4651:650b] No longer listening.May 6, 2014 at 9:11 pm #1021142Halle WinklerPolitepixSent you an email – get in touch when you have a moment so I can assist you with this.
May 7, 2014 at 10:29 am #1021148Halle WinklerPolitepixThanks for getting in touch! OK, so, I’ve taken some time to look at your logging output and I think what I’m seeing is that this isn’t using one of the two supported 8khz acoustic models that ship with RapidEars, is that correct? It looks like this is probably one of the 16khz Voxforge models if I’m not mistaken, which is why the performance is going to be slow with RapidEars. Can you let me know if there are any other modifications before we look into if there are any ways to improve the performance for the slower 16khz models on the old devices? A few of the other Sphinx runtime arguments look a bit unexpected to me so it would be good to know if any other changes have been made so that the troubleshooting time can be put to the most efficient use.
May 7, 2014 at 10:52 am #1021150HarzParticipantThanks a lot Halle. I’m using 16kHz semicontinous model generated from wavs and text corpus. There are no other modifications in OpenEars except pathToModel method.
Does it make sense to downsample model to 8kHz without losing an acuracy?May 7, 2014 at 11:10 am #1021152Halle WinklerPolitepixAh OK, you are using your own model, very cool. It is possible that the 16khz isn’t the only issue but it is definitely going to interact with any other issues on the older devices because that’s a lot more analysis to do.
Probably it’s best to start out by showing the logging from the iPhone 4 from an incident when it is showing this issue so I can get a look at which stage in the listening loop the behavior is happening in, and then we can look at possible causes and maybe find a way to fix or improve the behavior (I can’t guarantee there is a fix for this since it isn’t one of the models I ship, but I’m happy to work with you to try to improve things). It’s fine removing the vocabulary from the logs, but do let me know the reported size of the vocabulary so I can get a sense of what we’re trying to do here.
May 7, 2014 at 11:14 am #1021153Halle WinklerPolitepixBTW, let me suggest that you create a replicable case for yourself using PocketsphinxController’s pathToTestFile property which will let you use prerecorded audio in place of live audio in the live recognition loop. That way it will be easy for you to verify whether any changes help and you’ll also have some replicable data for me in case you want to send me a test case. It also just makes development a lot easier when you don’t have to provide live input yourself while working on a bug.
May 7, 2014 at 11:25 am #1021154HarzParticipantThank you for advice. I will implement it soon.
Currently our vocabulary contains 2800 words including (2) and (3) variants for some words. But we tested application on about 8000 words vocabulary and 20M language model (lm.dmp) It worked fast enough with RapidEars on iPhones 5/5s.
May 7, 2014 at 11:47 am #1021155Halle WinklerPolitepixOK, I think this is probably going to be the root of the issue – RapidEars is only designed for use with the shipped acoustic models and with the normal OpenEars-required language model maximum size of around 200-500 words depending. I can imagine your specs working some of the time on a 5S, but it’s too much overhead for an iPhone 4 to support. I think you’re just seeing a delayed search time on a slow device for a 16khz model doing large vocabulary recognition – speed considerations are the reason that even stock OpenEars doesn’t use 16khz models and big vocabularies, and RapidEars is more sensitive to performance considerations since it has to do much more.
If you’re finding the performance acceptable on devices later than the iPhone 4, I think what I would do is to make a downsampled version of your acoustic model, ship it alongside your 16khz model, and have the iPhone 4 and earlier fall back to using the 8khz model. This might help your issue sufficiently, but it also might be necessary for you to have the iPhone 4 fall back to a smaller vocabulary as well in order to achieve normal performance. Those would be my starting suggestions so that you can preserve your ideal performance for your users whose devices can support it without giving the older phones a job to do that they aren’t up for.
May 7, 2014 at 3:03 pm #1021156Halle WinklerPolitepixI thought of another way you can fall back to some faster iPhone 4 performance along with the other two suggestions. In the file PocketsphinxRunConfig.h there is this sequence:
#if __LP64__ #define kBEAM @"1e-96" // -beam Beam width applied to every frame in Viterbi search (smaller values mean wider beam), defaults to 1e-48 #else #define kBEAM @"1e-66" // -beam Beam width applied to every frame in Viterbi search (smaller values mean wider beam), defaults to 1e-48 #endif
You can set the value 1e-66 to a larger value such as 1e-48 for faster searches on older devices. I would leave it as-is for the devices that are already performing satisfactorily.
May 7, 2014 at 3:24 pm #1021158HarzParticipantThanks a lot for your help. We appreciate your advice and suggestions. Soon we’ll regenerate our model to 8kHz and test performance again. In any way it will be nice to realize possibility to stop delayed recognition, because it’s don’t take any sense, but spend too much time. Here is log from iPhone 4. As you can see, delay between sending stopListenning and actually it’s stopping is about 3 minutes.
2014-05-07 16:36:59.904 MyApp[2864:907] ‘Open Ears’ has been Initialized
2014-05-07 16:36:59.924 MyApp[2864:907] Starting OpenEars logging for OpenEars version 1.7 on 32-bit device: iPhone running iOS version: 6.100000
2014-05-07 16:37:01.531 MyApp[2864:907] Bundle Name : Model
2014-05-07 16:37:01.534 MyApp[2864:907] OE Ext : Model Path is /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
2014-05-07 16:37:01.537 MyApp[2864:907] OE Ext : Model Path is /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
2014-05-07 16:37:01.563 MyApp[2864:907] Accuracy set to 10.
2014-05-07 16:37:01.566 MyApp[2864:907] OE Ext : Model Path is /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
2014-05-07 16:37:01.573 MyApp[2864:907] setting sample rate to 16000
2014-05-07 16:37:01.584 MyApp[2864:907] The audio session has never been initialized so we will do that now.
2014-05-07 16:37:01.585 MyApp[2864:907] Checking and resetting all audio session settings.
2014-05-07 16:37:01.588 MyApp[2864:907] audioCategory is incorrect, we will change it.
2014-05-07 16:37:01.590 MyApp[2864:907] audioCategory is now on the correct setting of kAudioSessionCategory_PlayAndRecord.
2014-05-07 16:37:01.592 MyApp[2864:907] bluetoothInput is incorrect, we will change it.
2014-05-07 16:37:01.593 MyApp[2864:907] bluetooth input is now on the correct setting of 1.
2014-05-07 16:37:01.596 MyApp[2864:907] Output Device: ReceiverAndMicrophone.
2014-05-07 16:37:01.601 MyApp[2864:907] categoryDefaultToSpeaker is incorrect, we will change it.
2014-05-07 16:37:01.603 MyApp[2864:907] CategoryDefaultToSpeaker is now on the correct setting of 1.
2014-05-07 16:37:01.605 MyApp[2864:907] preferredBufferSize is incorrect, we will change it.
2014-05-07 16:37:01.607 MyApp[2864:907] PreferredBufferSize is now on the correct setting of 0.128000.
2014-05-07 16:37:01.609 MyApp[2864:907] preferredSampleRateCheck is incorrect, we will change it.
2014-05-07 16:37:01.651 MyApp[2864:907] preferred hardware sample rate is now on the correct setting of 16000.000000.
2014-05-07 16:37:01.756 MyApp[2864:907] AudioSessionManager startAudioSession has reached the end of the initialization.
2014-05-07 16:37:01.758 MyApp[2864:907] Exiting startAudioSession.
2014-05-07 16:37:01.772 MyApp[2864:5c03] setSecondsOfSilence value of 0.000000 was too large or too small or was NULL, using default of 0.700000.
2014-05-07 16:37:01.813 MyApp[2864:5c03] Project has these words or phrases in its dictionary:
[del]
…and 2775 more.
2014-05-07 16:37:01.819 MyApp[2864:5c03] Recognition loop has started
2014-05-07 16:37:01.884 MyApp[2864:907] Audio route has changed for the following reason:
2014-05-07 16:37:01.885 MyApp[2864:907] There has been a change of category
2014-05-07 16:37:01.887 MyApp[2864:907] The previous audio route was Speaker
2014-05-07 16:37:01.900 MyApp[2864:907] This is not a case in which OpenEars performs a route change voluntarily. At the close of this function, the audio route is SpeakerAndMicrophone
INFO: file_omitted(0): Parsing command line:
\
-lm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.lm.dmp \
-dict /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.dic \
-hmm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle \
-lw 6.5 \
-samprate 16000 \
-rawlogdir /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Library/CachesCurrent configuration:
[NAME] [DEFLT] [VALUE]
-adchdr 0 0
-adcin no no
-agc none none
-agcthresh 2.0 2.000000e+00
-align
-alpha 0.97 9.700000e-01
-arcdumpdir
-ascale 20.0 2.000000e+01
-aw 1 1
-backtrace no no
-beam 1e-48 1.000000e-48
-bestpath yes yes
-bestpathlw 9.5 9.500000e+00
-bghist no no
-cepdir
-cepext .mfc .mfc
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-compallsen no no
-ctl
-ctlcount -1 -1
-ctlincr 1 1
-ctloffset 0 0
-debug 0
-dict /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.dic
-dictcase no no
-dither no no
-doublebw no no
-ds 1 1
-fdict
-feat 1s_c_d_dd 1s_c_d_dd
-featparams
-fillprob 1e-8 1.000000e-08
-frate 100 100
-fsg
-fsgusealtpron yes yes
-fsgusefiller yes yes
-fwdflat yes yes
-fwdflatbeam 1e-64 1.000000e-64
-fwdflatlw 8.5 8.500000e+00
-fwdflatsfwin 25 25
-fwdflatwbeam 7e-29 7.000000e-29
-fwdtree yes yes
-fwdtreelm
-hmm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle
-hyp
-hypprefix
-input_endian little little
-jsgf
-kdmaxbbi -1 -1
-kdmaxdepth 0 0
-kdtree
-latsize 512 512
-lda
-ldadim 0 0
-lextreedump 0 0
-lifter 0 0
-lm /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/base.lm.dmp
-lmctl
-lmname default default
-logbase 1.0001 1.000100e+00
-logfn
-logspec no no
-lowerf 133.33334 1.333333e+02
-lpbeam 1e-40 1.000000e-40
-lponlybeam 7e-29 7.000000e-29
-lw 6.5 6.500000e+00
-maxhmmpf -1 -1
-maxnewoov 20 20
-maxsilfr -1 -1
-maxwpf 50 50
-mdef
-mean
-mfclogdir
-min_endfr 0 0
-mixw
-mixwfloor 0.0000001 1.000000e-07
-mllr
-mmap yes yes
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-nwpen 1.0 1.000000e+00
-pbeam 1e-48 1.000000e-48
-pip 1.0 1.000000e+00
-pl_beam 1e-10 1.000000e-10
-pl_pbeam 1e-5 1.000000e-05
-pl_window 0 0
-rawlogdir /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Library/Caches
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-sendump
-senlogdir
-senmgau
-silprob 0.005 5.000000e-03
-smoothspec no no
-svspec
-tmat
-tmatfloor 0.0001 1.000000e-04
-topn 4 4
-topn_beam 0 0
-toprule
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-usewdphones no no
-uw 1.0 1.000000e+00
-var
-varfloor 0.0001 1.000000e-04
-varnorm no no
-verbose no no
-vm
-warp_params
-warp_type inverse_linear inverse_linear
-wbeam 7e-29 7.000000e-29
-wip 0.65 6.500000e-01
-wlen 0.025625 2.562500e-02INFO: file_omitted(0): Parsing command line:
\
-nfilt 40 \
-lowerf 133.3334 \
-upperf 6855.4976 \
-feat s2_4x \
-agc none \
-cmn current \
-varnorm no \
-fwdflat yesCurrent configuration:
[NAME] [DEFLT] [VALUE]
-agc none none
-agcthresh 2.0 2.000000e+00
-alpha 0.97 9.700000e-01
-ceplen 13 13
-cmn current current
-cmninit 8.0 8.0
-dither no no
-doublebw no no
-feat 1s_c_d_dd s2_4x
-frate 100 100
-input_endian little little
-lda
-ldadim 0 0
-lifter 0 0
-logspec no no
-lowerf 133.33334 1.333334e+02
-ncep 13 13
-nfft 512 512
-nfilt 40 40
-remove_dc no no
-round_filters yes yes
-samprate 16000 1.600000e+04
-seed -1 -1
-smoothspec no no
-svspec
-transform legacy legacy
-unit_area yes yes
-upperf 6855.4976 6.855498e+03
-varnorm no no
-verbose no no
-warp_params
-warp_type inverse_linear inverse_linear
-wlen 0.025625 2.562500e-02INFO: file_omitted(0): Parsed model-specific feature parameters from /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/feat.params
INFO: file_omitted(0): Reading HMM transition probability matrices: /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/transition_matrices
INFO: file_omitted(0): Loading senones from dump file /var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/Model.bundle/sendump
INFO: file_omitted(0): BEGIN FILE FORMAT DESCRIPTION
INFO: file_omitted(0): Rows: 64, Columns: 6147
INFO: file_omitted(0): Using memory-mapped I/O for senones
INFO: file_omitted(0): Building PID tables for dictionary
INFO: file_omitted(0): Allocating 49^3 * 2 bytes (229 KiB) for word-initial triphones
INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for word-final triphones
INFO: file_omitted(0): Allocated 29008 bytes (28 KiB) for single-phone word triphones
INFO: file_omitted(0): No \data\ mark in LM file
INFO: file_omitted(0): Number of word mappings: 2522
INFO: file_omitted(0): Number of word mappings: 2522
INFO: file_omitted(0): Initialized arc buffer ‘fwdtree’, each arc occupies 12 bytes
2014-05-07 16:37:05.653 MyApp[2864:5c03] Warning: There is a request to use pathToTestFile but there isn’t a usable file at the location given (“/var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/test.wav”) so the live microphone will be used instead.
2014-05-07 16:37:05.662 MyApp[2864:5c03] Starting openAudioDevice on the device.
2014-05-07 16:37:05.666 MyApp[2864:5c03] Audio unit wrapper successfully created.
2014-05-07 16:37:05.678 MyApp[2864:5c03] Set audio route to SpeakerAndMicrophone
2014-05-07 16:37:05.683 MyApp[2864:5c03] Checking and resetting all audio session settings.
2014-05-07 16:37:05.687 MyApp[2864:5c03] audioCategory is correct, we will leave it as it is.
2014-05-07 16:37:05.692 MyApp[2864:5c03] bluetoothInput is correct, we will leave it as it is.
2014-05-07 16:37:05.696 MyApp[2864:5c03] Output Device: SpeakerAndMicrophone.
2014-05-07 16:37:05.700 MyApp[2864:5c03] categoryDefaultToSpeaker is correct, we will leave it as it is.
2014-05-07 16:37:05.704 MyApp[2864:5c03] preferredBufferSize is correct, we will leave it as it is.
2014-05-07 16:37:05.709 MyApp[2864:5c03] preferredSampleRateCheck is correct, we will leave it as it is.
2014-05-07 16:37:05.713 MyApp[2864:5c03] Setting the variables for the device and starting it.
2014-05-07 16:37:05.716 MyApp[2864:5c03] Looping through ringbuffer sections and pre-allocating them.
2014-05-07 16:37:06.316 MyApp[2864:5c03] Started audio output unit.
2014-05-07 16:37:06.319 MyApp[2864:5c03] Calibration has started
2014-05-07 16:37:08.528 MyApp[2864:5c03] Calibration has completed
2014-05-07 16:37:08.532 MyApp[2864:5c03] Listening.
2014-05-07 16:37:09.573 MyApp[2864:5c03] Speech detected…
INFO: file_omitted(0): arc_buffer_producer_start_utt
time delta 1399469829.599150 start 0000000
time delta 1399469829.602701 partial result fwdtree
partial: <s>:(0000000)
INFO: file_omitted(0): arc_buffer_consumer_start_utt
time delta 1399469829.609724 start 0000000
time delta 1399469829.633936 partial result fwdtree
[del]2014-05-07 16:38:29.691 MyApp[2864:907] In ContinuousAudioUnit
INFO: file_omitted(0): cmn_prior_update: from < 14.39 -2.20 -0.33 0.33 -0.42 -0.08 -0.21 -0.08 -0.34 -0.05 -0.17 -0.12 -0.09 >
INFO: file_omitted(0): cmn_prior_update: to < 14.23 -2.12 -0.30 0.33 -0.37 -0.12 -0.21 -0.08 -0.34 -0.06 -0.18 -0.13 -0.08 >
[Here is delay!]
INFO: file_omitted(0): idx 8005 is final (8005)
INFO: file_omitted(0): fwdtree: marking arc buffer final
INFO: file_omitted(0): fwdtree: allocated 8192 arcs (96 KiB)
INFO: file_omitted(0): fwdtree: allocated 512 start frame entries (2 KiB)
INFO: file_omitted(0): Waiting for 1 consumers to finish
INFO: file_omitted(0): idx 8005 is final (8005)
2014-05-07 16:41:41.845 MyApp[2864:5c03] Stopping audio unit.
time delta 1399470101.847927 end 0000000
2014-05-07 16:41:41.913 MyApp[2864:5c03] Audio Output Unit stopped, cleaning up variable states.
2014-05-07 16:41:41.916 MyApp[2864:5c03] This device is not recording, so first we will set its recording status to 0
2014-05-07 16:41:41.918 MyApp[2864:5c03] The audio unit is running so we are going to dispose of its instance
INFO: file_omitted(0): fwdtree canceled
INFO: file_omitted(0): TOTAL fwdtree 270.115 wall 3.374 xRT
INFO: file_omitted(0): fwdflat canceled
INFO: file_omitted(0): TOTAL fwdflat 244.569 wall 3.055 xRT
2014-05-07 16:41:41.957 MyApp[2864:5c03] No longer listening.May 7, 2014 at 3:35 pm #1021159Halle WinklerPolitepixHi,
There’s a bit too much missing from the logging above. Can you email me the complete logging output from the beginning to the end with nothing removed? There also needs to be some NSLogging for the method you use to call stopListening so some programmatic indication of the fact that it was called appears inline in the log with a datestamp, since I can’t get any information out of the logs with the sequence deleted in which stopListening was called.
BTW, did you see that pathToTestFile isn’t working in the case above? Here is the error:
Warning: There is a request to use pathToTestFile but there isn’t a usable file at the location given (“/var/mobile/Applications/A786BFDD-68F9-4FAF-A535-47561B580771/Documents/test.wav”) so the live microphone will be used instead.
-
AuthorPosts
- You must be logged in to reply to this topic.