Hey all,
As I've stated on another thread, I've started working on a speech recognition system for MP utilizing MS Speech 5.1.
Currently it does the following:
When you start MP, the only thing it understands is the keyword: "my computer"
If you say the keyword, it will change to understanding the following:
-move up
-move down
-move left
-move right
-previous menu
-home
-exit
-my music
-my videos
-settings
-select item
etc...(many words not tested yet)
Phrases such as "my music" will take you to that window from whatever screen you're on.
If you don't say anything it understands in 5 seconds, it reverts back to just understanding the keyword.
My next steps are:
-further explore MP's interface and add the corresponding voice commands
-add modifiers (ex: move down three)
-integrate playlist/video/music selection through artist, genre, etc
-multi languages (could be SDK specific though)?
What I could use:
-General feedback
-beta tester(s)
-collaborators
I'm trying not to release a large public beta on this as I'm looking at an easier way to distribute the MS Speech engine as most people don't need most of the SDK.
That's about it for now. I'll use this thread to update my progress, so please feel free to post or PM me.
As I've stated on another thread, I've started working on a speech recognition system for MP utilizing MS Speech 5.1.
Currently it does the following:
When you start MP, the only thing it understands is the keyword: "my computer"
If you say the keyword, it will change to understanding the following:
-move up
-move down
-move left
-move right
-previous menu
-home
-exit
-my music
-my videos
-settings
-select item
etc...(many words not tested yet)
Phrases such as "my music" will take you to that window from whatever screen you're on.
If you don't say anything it understands in 5 seconds, it reverts back to just understanding the keyword.
My next steps are:
-further explore MP's interface and add the corresponding voice commands
-add modifiers (ex: move down three)
-integrate playlist/video/music selection through artist, genre, etc
-multi languages (could be SDK specific though)?
What I could use:
-General feedback
-beta tester(s)
-collaborators
I'm trying not to release a large public beta on this as I'm looking at an easier way to distribute the MS Speech engine as most people don't need most of the SDK.
That's about it for now. I'll use this thread to update my progress, so please feel free to post or PM me.