08-23-2017, 03:37 PM
(This post was last modified: 09-12-2017, 08:42 AM by Dean Roddey.)
[So, though promising, in more ways that were eventually realized over the course of this thread, technical limitations of this system means that it is of no use to us in terms of a replacement for the speech recognition in something like CQC Voice. See the post below for details:
http://www.charmedquark.com/vb_forum/sho...#pid148359]
So Mike Potts has been after me for a while to look into Amazon's chatbot system. I read through some of the docs today. It does offer an interesting option. The sort version of it is:
It wouldn't be configurable like the Echo stuff is. If you want that, use the Echo as is. This would be for CQC specific, predefined commands like CQC Voice does, but with the Echo's high quality speech recognition. You'd still want to use an array mic like the Kinect or something of that sort. But we wouldn't be limited by the quality of speech recognition available on PCs at this time. I don't think you could use the Echo itself as the mic. If you want to do that, just use the existing Echo stuff probably.
I dunno. I'm iffy on it. It clearly has the huge advantage of massive processing power for audio to text conversion. But, if it was done, I think it would have to replace CQC Voice, since I just can't see continuing to support three different schemes. It's hard enough as it is. So it sort of comes down to, do we believe that we can get CQC Voice up another couple notches in speech recognition quality. If so, I'd argue for keeping it instead. If not, and folks will always be sort of underwhelmed by CQC Voice's local processing capabilities, then maybe it should be replaced and we just accept that cloud based processing is a necessity at least for the time being.
So I'm thinking let's see what I can do on the CQC Voice front with accepting multiple microphones for multiple, parallel speech recognition engines and taking the best of each. If that still isn't solid enough to get people excited, them maybe we dump it for something based on this chat bot technology.
But, I'm open to opinions. Is the security and privacy of all local more important to you than likely better speech recognition?
http://www.charmedquark.com/vb_forum/sho...#pid148359]
So Mike Potts has been after me for a while to look into Amazon's chatbot system. I read through some of the docs today. It does offer an interesting option. The sort version of it is:
- It is essentially the intent system that the Echo uses, but we get to ship our own audio off to the servers, instead of it having to go through the Echo.
- It has an API that would let me build up the intents and utterances and such, so that I could do something like CQC Voice does, i.e. create an easy to use system based on the auto-gen data, and update it any time you change the auto-gen configuration.
- It inherently is conversation oriented, unlike the Echo, so it could work like CQC Voice does on that front, asking for information not initially provided.
- And of course it's cloud based like the Echo, not purely local like CQC Voice.
It wouldn't be configurable like the Echo stuff is. If you want that, use the Echo as is. This would be for CQC specific, predefined commands like CQC Voice does, but with the Echo's high quality speech recognition. You'd still want to use an array mic like the Kinect or something of that sort. But we wouldn't be limited by the quality of speech recognition available on PCs at this time. I don't think you could use the Echo itself as the mic. If you want to do that, just use the existing Echo stuff probably.
I dunno. I'm iffy on it. It clearly has the huge advantage of massive processing power for audio to text conversion. But, if it was done, I think it would have to replace CQC Voice, since I just can't see continuing to support three different schemes. It's hard enough as it is. So it sort of comes down to, do we believe that we can get CQC Voice up another couple notches in speech recognition quality. If so, I'd argue for keeping it instead. If not, and folks will always be sort of underwhelmed by CQC Voice's local processing capabilities, then maybe it should be replaced and we just accept that cloud based processing is a necessity at least for the time being.
So I'm thinking let's see what I can do on the CQC Voice front with accepting multiple microphones for multiple, parallel speech recognition engines and taking the best of each. If that still isn't solid enough to get people excited, them maybe we dump it for something based on this chat bot technology.
But, I'm open to opinions. Is the security and privacy of all local more important to you than likely better speech recognition?
Dean Roddey
Explorans limites defectum
Explorans limites defectum