User Manual
Objectives: Making this feature of corpus collection offline and more accessible to people using an offline mobile app that can be integrated into the online database easily to increase the reach of the features to a broader audience for a larger corpus. We are designing the Corpus collection mechanisms and modules - Which include the design and development of a common framework by incorporating all the below-discussed sub-frameworks for corpus collection. Regional demographic-based organisational corpus modelling. Age demographic-based sequential corpus modelling. Dialectical-based objective modelling Application-specific based corpus modelling.
Use cases: When users don't have internet connectivity on their mobile device, they can still operate the app fully with all its functionalities, creating a good user experience. Senior citizens, people from rural areas, and people who might not be exposed to or comfortable with English can use this app to record their voices and verify the existing sentences, thus enlarging the corpus collected.
Components of the app: Optional login and signup page, User dashboard, Polls to existing users, Record and listen to screen, Speech to text, OCR to text(works in Telugu),
Polls to existing users: To collect data from the user regarding their demographics, dialects and region they belong in a database. To use the data collected and make improvements. Designed with Flutter and has questions that collect the user's demographics, such as their gender, age and dialect.
Recording Functionality and Speech to Text: To identify the words and phrases the user speaks, convert the audio into text, and display it on the screen. Also, to store that audio in the database for further analysis. Coded the recording functionality and speech-to-text in Python using SpeechRecognition Library(works with English as of now)
Certificate Generation: To Provide a Certificate to the user. Using the data stored through polls, an automated certificate would be generated after the successful usage of features available in the app.
Database of the application: To create a database for the app for storing user data Using PostgreSQL for the database Connecting the Postgres server to the app and the database is updated on the backend.
User dashboard and navigation:
- Displays the streak (number of days logged in by the user)
- Displays the statistics of the number of times the user has provided recordings and how many times they have verified the existing database
- On clicking the listen or record button, it redirects you to the speak and listen menu to record more.
- Has a button to allow you to generate a certificate.
- Navigation is done through a menu on the top right, which has options to let you :
- Login
- Generate your certificate
- Generate OCR
Development status:
- Front-end integration: completed.
- Conversion to Telugu: in progress.
- Retrieving data from the database and displaying it: in progress
- Integration of backend: in progress.
- OCR integration: in progress.
Future features:
- Adding more gamification features like a leaderboard, badges and achievements to be obtained
- Smoother navigation through the app, more attractive front end
- Adding more languages into the app (like Hindi) for more reach.
- Making the app more functional.