This content originally appeared on DEV Community and was authored by Raman
Introduction
The extension gives you the ability to control Chrome by only using you voice. Once activated, you can open your favourite websites, add bookmarks and search Google using just some easy commands (like "Chrome, open Netflix") and a microphone. While in these 24 hours we only incorporated the most useful and intuitive commands, the implementation can be easily extended to provide full functionality.
My Deepgram Use-Case
The heart of speech recognition is the Deepgram API. When the chrome extension window is opened, we prompt the user to give access to their microphone. For the rest of the session, the microphone listens in the background and the generated transcript is sent to the back-end. The extension will then use a language processing algorithm to identify the commands in the recorded text, and by integrating these commands with the Chrome developer tools, it executes them in order to control the browser.
Dive into Details
Successfully using the Deepgram API in order to incorporate speech recognition in a software project for the first time.
Making our project compatible with Chrome (gaining access to all the useful permissions and having control over the browser)
Working efficiently as a team to create a project that requires skills from different domains of Computer Science
Conclusion
Seeing the documentation of deepgram, we can definitely say that it's easier to integrate with it compared to other platform.
This content originally appeared on DEV Community and was authored by Raman
Raman | Sciencx (2022-04-11T12:00:27+00:00) Chrome extension. Retrieved from https://www.scien.cx/2022/04/11/chrome-extension/
Please log in to upload a file.
There are no updates yet.
Click the Upload button above to add an update.