GSoC 2022 :: Final Report
This post will briefly cover:
- Tasks done and blogs / weekly updates
Offline Voice Agent for AGL
This report is a requirement of the
Final Evaluations phase of Google Summer of Code program. I worked with the Linux Foundation, under Automotive Grade Linux on the project to
An Offline voice-agent for Automotive Grade Linux.
The proposal for the project can be found here.
The Linux Foundation is the nonprofit consortium dedicated to fostering the growth of Linux. Automotive Grade Linux is an open-source project hosted by The Linux Foundation that is building an open operating system and framework for automotive applications.
- Create and integrate a new Layer named
meta-offline-voice-agentwith Automotive Grade Linux (AGL) layers for the offline voice agent, using Vosk-API as the offline speech recognition engine.
- Create custom recipes for all unmet dependencies and libraries required by the Vosk API and Server to work.
- Create a PyQt based basic application to test offline voice recognition using vosk-server.
- Create a gRPC-based Python server and Flutter Client Application for integration with AGL.
- [WIP] Improvements and Integration of the created basic demo application for the voice agent. Possibility of testing on real hardware.
- Ultimately documenting the whole implementation methodology for the developer community to develop similar applications for AGL.
Owing to timezone differences, my mentor and I scheduled the right time and medium for contact: weekly Zoom Calls with other GSoC participants for all updates and roadblocks, and constant communication over Telegram. Further refinement of objectives and goals were set to ensure that the proposed plan of action for the development of
meta-offline-voice-agent was in line with the priorities of the project. I started engaging actively within the AGL community by regularly participating in the Weekly Developer Call. I have also regularly updated personal blog to log progress for future reference and help future Google Summer of Code applicants and aspiring student developers alike. Since I was already acquainted with the AGL community, I could start with setting up the project and
coding-phase a little sooner, during the end of community bonding period itself.
Starting from Week 9 upto Week 15 of GSoC'22, I had to take a break because of my relocation to Italy from India and all related preperations to commence my Master’s of Science studies in Robotics and get used to living in a new country. For the same reason, my project deadline was extended.
- Community Bonding Period ==> AGL mailing list | blog
- Week 1 and 2 ==> AGL mailing list | blog
- Week 3 ==> AGL mailing list | blog
- Week 4 ==> AGL mailing list | blog
- Week 5 ==> AGL mailing list | blog
- Week 6 ==> AGL mailing list | blog
- Week 7 ==> AGL mailing list
- Week 8 ==> AGL mailing list
- Week 9 to 15 ==> Break and deadline extension due to relocation to Italy from India for Master’s studies.
- Week 16 ==> AGL mailing list
- Week 17 ==> AGL mailing list
- Week 18 ==> AGL mailing list
- Week 19, 20 ==> AGL mailing list
- Week 21, 22 ==> AGL mailing list
Commits / Change Requests
AGL Minor Documentation Fixes
- Gerrit: #27370 (and #27339) : Fixed indentation error for a sub-bullet point
- Gerrit: #27537 (and 27520) : Fixed internal link error
- Gerrit: #27575 : Fixed quote errors in echo expressions
- Gerrit: #27794 : meta-offline-voice-agent: Enable layer for offline voice recognition
- This enables offline voice recognition in AGL
- Can successfully run vosk API and vosk websocket server
voice-agent demo app
- Improve and finesse basic demo voice-agent app for AGL, create docs and app-integration with AGL
- Becoming proficient with AGL developer tools, especially
Gerrithas been quite a worthwhile effort.
- Having to relocate to a new country in between GSoC, and then keeping up to the academic expectations of Master’s degree towards the end of GSoC period made it slightly difficult for me to manage all things together, but due to mutual understanding and support from the mentors, I was able to accommodate some new changes in the deliverables and keep the project work going upto the completion of a working demo.
What did I learn?
I was able to learn about the Yocto project, and creating layers and recipes for our specific application
I also learned a lot more about Automotive Grade Linux organization especially their developer tools, workflow, and developmental pipeline.
My technical writing skills have definitely become better and have led me to contribute more effectively and robustly to academic research projects.
I plan to maintain and add to the Automotive Grade Linux’s repositories for the foreseeable future and work on continually improving it.
I will keep contributing and help other new contributors to explore and learn projects in this organization.
It has been one of the best professional experience that I undertook this year. Learning about new tools and technologies, and working on an open source project along with experienced developers from all over the world has been a really great experience for me. I would love to continue working with AGL, improving my project, and whatever I feel that I can contribute to.