About Me
Hi, I'm K V Jaya Harsha a third-year undergraduate at IIIT Raichur with a strong interest in data engineering, MLOps, and scalable machine learning systems. I work at the intersection of cloud platforms, modern data pipelines, and AI workflows.
Currently, I’m focused on building production-grade ML pipelines using Azure and Databricks, applying best practices in modular design, monitoring, and orchestration. I’m also involved in full-stack web development and contribute to open-source projects at VISWAM AI.
My work spans from ingestion and transformation in cloud-based environments to deployment and observability in ML systems. I value clean design, automation, and reproducibility in every layer of the data and ML lifecycle.
Previously, I served as PR Head for student initiatives, where I led communications and outreach strategies.
| Programming Languages | Data Engineering / Big Data | Databases | Data Science / ML / Analytics |
|---|---|---|---|
|
|
|
|
|
| Web Frameworks | Cloud / Hosting / DevOps | Containers / Orchestration | CI/CD / Version Control |
|---|---|---|---|
|
|
|
|
|
| Other Tools / Libraries |
|---|
|
|
🧠 Machine Learning Projects
|
A simple machine learning model trained on the classic Iris dataset to classify flower species based on sepal and petal measurements. |
EcoCarb is an AI-powered carbon footprint predictor for the transportation sector. It estimates CO₂ emissions from travel data, helping users and policymakers make eco-conscious decisions. |
🤖 RAG / Agent Projects
|
An intelligent flight assistant powered by Groq's ultra-fast LLM acceleration and LLaMA 3 (8B). Built using LangGraph for modular agent flow and Gradio for an interactive UI. Handles itinerary queries, searches flights, and books tickets in natural language. |
A Retrieval-Augmented Generation (RAG) based chatbot built using Streamlit, ChromaDB, and OpenAI API. Designed to ingest documents and answer user queries contextually with accurate, grounded responses. |
🏗 ️ Data Engineering Projects
|
A real-time data engineering pipeline built on Microsoft Azure. It ingests earthquake data via a public API, processes it using Azure Data Factory and Databricks (following the Medallion Architecture), and stores it in Synapse Analytics for dashboarding and alerting. |
💡 Feel free to explore, clone, or contribute to any of these projects!










