-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sonus in the browser #28
Comments
@evancohen My setup is a rpi with a chrome opened (for SpeechRecognition, so i have unlimited cloud speech api keys). In my opinion, sonus is ok, but processing speech still needs to be done in cloud for the moment. |
I like that approach, a few suggestions for you: First, you can record multiple audio streams on the Pi with Second, if you are getting false positives I recommend playing with the recognition sensitivity. Also, short activation phrases tend to be more prone to false positives, so you could also try something like "hey Jarvis". Anyhow...
At the end of the day, if you're super dedicated to it being free you end up having to do a little extra legwork. What I'd like to see is a snowboy keyword spotter that will run in the browser. Then I could write a simple wrapper to make it and In the meantime I'm going to see if I can write a keyword spotter that will work in the browser, then then use |
@evancohen You suggested me to use JsSpeechRecognizer and now I see that I made a huge mistake disregarding it, I assumed it was also related somehow to pocketsphinx. But now I came back to it, and see that they have just what I need - keyword that you can train yourself, without any real recognition and phoneme stuff, that's not really needed for a simple task of recognizing just 1 or even a few keywords. Will see how that works. Hopefull Ill be able to switch between it and chrome's speech recognition. Thanks! |
@ghost I had the same idea. Did you change your implementation meanwhile? Do you have some code snippets which you would like to share? |
But isn't there a chance that you loose some of the audio chunk? Let's assume this is a timeline in x axis
In case snowboy needs some (X) time to realize that the hotword was spoken and the user continues speaking (maybe very fast) then the browser will start to listen to late and would only get the |
@timaschew I've got an experimental implementation that uses a ring buffer for audio on the Right now I'm in Cambodia with some rather limited resources, but I'd like to help you get Sonus working. Can you file a separate issue with some repro steps to where you're stuck? |
Ah nice.
Actually, I didn't get stuck, it were just some concerns I had. |
@timaschew happy to answer any questions (or concerns) you have! I am traveling at the moment, so I can't promise I'll respond instantly, but I will get back to you eventually 😄 |
Continuing our discussion from TalAter/annyang#100
The text was updated successfully, but these errors were encountered: