π Hi, I'm Niteesh ( @niteesh_ai )
Summer of AI 2025 β Viswam.ai | Swecha
Building Indic datasets and fine-tuning LLMs.
π About Me
- AIML student (3rd year), India
- Interests: NLP, Flutter, Data Engineering, MLOps
- Currently: Corpus collection & LLM fine-tuning for Summer of AI 2025
π§° Skills
Languages: Python, Java, C/C++
AI/DS: NumPy, Pandas, Scikit-learn, PyTorch/TensorFlow (basic)
Dev: Git, GitLab, Linux CLI, Docker (basic)
Mobile: Flutter/Dart (beginnerβintermediate)
π― Internship Goals (2025)
-
80+ hours Audio/Video corpus -
800+ Image/Text records -
Build a Collaborative Corpus Collection Engine (team project) -
Fine-tune an LLM for an Indic use-case -
Weekly updates in this README
π οΈ Quick Links
- Code:
code.swecha.org/<your-username> - Chat (Mattermost):
chat.swecha.org - Courses:
courses.viswam.ai- Python Simple | Python Advanced | Learn AI
- Corpus App/Web:
corpus.swecha.org(and Android app)
π Learning Path (Live Checklist)
Week 1β2
-
Python Simple (basics, functions, files) -
Git/GitLab workflow, Issues/MRs -
Start corpus: 20h A/V + 200 Image/Text records
Week 3β4
-
Python Advanced (OOP, generators, venv, packaging) -
Data cleaning + labeling pipeline -
Another 20h + 200 recs
Week 5β8
-
Learn AI (NLP basics, tokenization, embeddings) -
Fine-tune small LLM on collected corpus -
Deployment notes, evaluation metrics
π¦ Projects
-
Corpus Collection Engine (Team):
Role: Data pipeline & validation
Stack: Python, FastAPI, Postgres, MinIO, Docker
Status:π (update weekly) -
LLM Fine-tuning (Indic):
Model:π e.g., Llama-3-Instruct
Task:π e.g., Q&A in Telugu
Data: My collected corpus (cleaned + split)
Metrics:π e.g., BLEU/ROUGE/Accuracy
π Corpus Tracker (update daily)
| Date | Modality | Count/Hours | Source/Domain | Notes |
|---|---|---|---|---|
| 2025-08-23 | Audio | 2h | ||
| 2025-08-23 | Images | 40 |
Totals
- Audio/Video: 0 / 80h
- Images/Text: 0 / 800
π Daily Log
- 2025-08-23: Set up README, enrolled in Python Simple, 2h audio collected.
-
2025-08-24:
π
π§ͺ Tech Notes (snippets I reuse)
- Virtual env:
python -m venv .venv && source .venv/bin/activate - Lint/format:
ruff . && black . - Train script sample:
python train.py --epochs 3 --lr 3e-5 --batch 8
π« Contact
- Email: [email protected]
- Chat handle (Mattermost): @
π - LinkedIn/GitLab: @niteesh_ai
This README is my living portfolio for Summer of AI 2025. Iβll update it weekly with progress and artifacts.
Personal projects
View allLoading
Loading
Info
Member since August 13, 2025