Skip to content

Added Telugu Food Dataset (Audio, Text, Images + metadata.csv) – Warangal Region

Sai Rishika Dasharadhi requested to merge Rishika11/datasets-handbook:main into main

This commit adds a structured dataset under the Food category as part of the Viswam AI contribution initiative from the Warangal district.

The dataset includes:

  • 7 audio recordings (~1 min each) in spoken Telugu, narrating local recipes like జొన్న రొట్టె, బెల్లం దోసెలు, పల్లీ పొడి, and others.
  • 20+ images of traditional Telangana food items such as గోంగూర పచ్చడి, బొబ్బట్లు, మజ్జిగ పులుసు, etc.
  • 6+ typed text stories in Telugu (.txt files) describing the background and preparation of common dishes.
  • A metadata.csv file that documents every resource with fields: File Name, Category, District, Language, and Description.

All files are organized under the following structure:

  • Food/
  • Audio/
  • Image/
  • Text/
  • metadata.csv

This submission follows the Viswam dataset format and is contributed as part of Swecha's AI internship to support the development of Telugu datasets for LLMs.

Merge request reports

Loading