corpus
Projects with this topic
-
Utsav Kathalu is a web application built with Streamlit for collecting and organizing festival stories in multiple Indian languages. Users can submit stories as text, attach images for each section, and view the content as an interactive virtual book. The platform aims to preserve and present cultural narratives in a structured and user-friendly format.
Updated -
Mana Ruchulu is a Telugu-language recipe sharing and discovery platform built using Streamlit. It allows users to:
Create an account and log in securely
Upload recipes with text, images, videos, or audio instructions
View and interact with other users’ recipes
Participate in weekly cooking challenges
Post short-lived “stories” about their cooking experiences
Compete on a leaderboard based on points earned from contributions
The project supports both SQLite and Supabase backends, making it deployable on platforms like Hugging Face Spaces while being scalable for production use.
Its design emphasizes Telugu cultural heritage by supporting native script and traditional dish categorization.
Updated -
Telugu Farmer Assistant is a free, AI-powered platform for farmers in Telangana and Andhra Pradesh. It provides crop disease diagnosis, soil-based crop planning, and real-time weather updates — all in Telugu language, with an offline-first design for accessibility.
Updated -
A family recipe sharing app built with Python for managing and sharing recipes across family members.
Updated -
A Telugu Proverbs Chatbot built with Streamlit and SQLite to explore and preserve traditional Telugu Proverbs.
Updated -
An AI-powered, open-source Streamlit application for preserving Indian culture and diversity through multi-media corpus collection with persistent data storage and secure authentication
Updated -
Desi Proverbs & Local Lore Collector is a lightweight, mobile-friendly Streamlit app to collect Indian proverbs, folk sayings, and cultural facts across languages. It auto-detects language, stores structured metadata (language, region, tags), and exports a clean corpus for research and cultural preservation. Built for low-bandwidth users, deployable on Hugging Face Spaces, and designed with an offline-first approach.
Updated -
Gamanam — A Python Streamlit web application for sharing and exploring festivals, art, dance, and music. Users can create posts, view submissions from others, and interact with a cultural community. The app uses a SQLite database to store posts and user data.
Updated -
Fitzor Bot is an open-source AI-powered fitness assistant that helps users stay healthy and active. It provides personalized workout recommendations, tracks fitness progress, answers health-related queries, and motivates users to achieve their goals. Built using Python, NLP, and chatbot frameworks, Fitzor Bot is designed for gyms, trainers, and individuals seeking smart health guidance.
Updated -
Resume Analyzer is an open-source tool built with Python that analyzes resumes using NLP and ML. It extracts skills, education, and experience, then matches them with job descriptions to generate similarity scores and recommendations for recruiters and job seekers.
Updated -
JanaBhasha is an open-source multilingual dictionary and corpus collection platform focused on preserving India's linguistic diversity. It enables users to contribute regional words, folk songs, and stories — both in text and audio. Using AI models like Whisper and Puter, the platform supports real-time transcription and translation.
Updated -
"SahayaSoochi is a Streamlit-based AI assistant that generates All types of letters in Telugu and English from user text/voice input."
Updated -
Lok Katha is a corpus data collection app for preserving and sharing proverbs, folk tales, and oral traditions in multiple Indian languages. It allows contributors to submit text and audio, helping build an open, community-driven dataset for research, education, and cultural preservation.
Updated -
this project is for recalling the old spiritual things
Updated -
AI-powered offline-first Streamlit application for collecting corpus data (audio, video, image+caption, and text) in 11 Indian languages for SOAI 2025.
Updated -
Festival Log – A Cultural Corpus Collection Engine
Festival Log is an open-source, AI-powered Streamlit application designed to document and preserve the rich festival traditions of India. Users can submit cultural information, stories, and personal experiences related to festivals in their local language. The app automatically translates each entry into Telugu using open-source AI models and displays all entries in a collective log. It is optimized for low-bandwidth areas, supports multilingual input, and contributes to building a parallel corpus for Indian language AI research.
Updated