Improve README with detailed workflow, modes, and feature explanations

Description

The current README.md provides a solid overview of the Telugu Dictation App, including architecture, features, and setup instructions. However, it lacks detailed explanations of how the system works and how users interact with key functionalities.

This makes it difficult for new users and contributors to clearly understand the application flow, especially around recording modes, controls, and feature behavior.

Problems Identified

No clear end-to-end workflow of the system
Missing explanation of Push-to-Talk mode
Missing explanation of Continuous mode (with VAD)
Unclear behavior of Stop recording
Features are listed but not explained in detail
No performance/timing information (model loading, inference time)
Lack of clarity on user interaction flow

Expected Changes

Enhance the README by adding the following sections:

How It Works
- Step-by-step flow: audio capture → processing → transcription → UI update
Recording Modes
- Push-to-Talk (manual recording)
- Continuous Mode (auto with silence detection)
Controls
- Explanation of Start, Stop, Reset, Copy, Download
Performance
- Model loading time
- Inference/response time
- Silence detection delay
Feature Details
- Explanation of key features like VAD, auto-save, undo/redo, etc.
Stopping Behavior
- What happens when recording is stopped

Goal

Make the README:

Clear and easy to understand
Beginner-friendly
Self-explanatory without external guidance
Suitable for GitHub showcase and project evaluation

Type

Documentation Improvement