Add indic-tts skill - Indian TTS for everyone by ankitjh4 · Pull Request #28 · zocomputer/skills

ankitjh4 · 2026-03-04T17:04:29Z

Summary

Adds comprehensive Indian AI toolkit using Sarvam AI - a Bangalore-based AI company building models specifically for Indian languages.

About Sarvam AI

Sarvam AI is an Indian AI company building foundational models and APIs optimized for Indian languages. They provide state-of-the-art models for speech and text processing in 10+ Indian languages including Hindi, Bengali, Tamil, Telugu, Gujarati, Kannada, Malayalam, Marathi, Punjabi, Odia, and English.

Get API key: https://dashboard.sarvam.ai (free tier available)

Features

1. Text-to-Speech (TTS)

Model: Bulbul v3
Languages: 11 Indian languages
Speakers: 30+ voices (male and female)
Natural prosody and pronunciation

2. Document Intelligence

Extract text from PDFs and images (JPEG/PNG)
23 supported languages including Hindi, Bengali, Tamil, Telugu, Gujarati, Kannada, Malayalam, Marathi, Punjabi, Odia, Urdu, Assamese, Bodo, Dogri, Kashmiri, Konkani, Maithili, Manipuri, Nepali, Sanskrit, Santali, Sindhi, English
Output formats: Markdown, HTML, JSON
Full workflow: create job → upload → process → download results

3. Text Processing

Chat/Completion (sarvam-m model) - OpenAI-compatible API
Translation (mayura:v1, sarvam-translate:v1) - 23 Indian languages
Transliteration - Convert between scripts (e.g., Devanagari ↔ Roman)
Language Detection - Auto-detect language of input text

4. Speech-to-Text with Translation

Three modes for different use cases:

REST API - Quick transcription for audio < 30 seconds
WebSocket - Real-time streaming transcription with 4 output modes (translated text, original transcript, both, or bilingual)
Batch Processing - Process multiple audio files with speaker diarization for meeting transcription

Scripts

tts.py - Text-to-speech conversion
document_intelligence.py - PDF/image OCR extraction
text_processing.py - Chat, translation, transliteration, language detection
speech_to_text.py - STT via REST, WebSocket, or Batch

Setup

Add SARVAM_API_KEY to Zo secrets at Settings > Advanced

Changes

Added skills/sarvam-ai/ folder with SKILL.md and scripts/
4 comprehensive Python scripts covering all Sarvam APIs
Full documentation with usage examples
Updated manifest.json with skill metadata

- Added sarvam-ai based TTS skill supporting 11 Indian languages - Includes API key enforcement via SARVAM_API_KEY secret - Features 30+ voices with Bulbul v3 model - Closes: adding Indian language TTS support to Zo skills

skeletor-js · 2026-04-04T12:08:36Z

Please clean up the packaging and resubmit.

Put the skill in the correct registry structure. Right now the PR layout is not clean.
Remove the stray SKILL.md.bak file.
Revert the unrelated manifest.json churn and keep the diff scoped to the actual skill.
Keep the setup instructions aligned with Zo Secrets. Do not drift into generic env-export patterns.
Make sure the PR only includes the files required for this skill and nothing else.

Add indic-tts skill - Indian TTS for everyone

a78ec4c

- Added sarvam-ai based TTS skill supporting 11 Indian languages - Includes API key enforcement via SARVAM_API_KEY secret - Features 30+ voices with Bulbul v3 model - Closes: adding Indian language TTS support to Zo skills

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add indic-tts skill - Indian TTS for everyone#28

Add indic-tts skill - Indian TTS for everyone#28
ankitjh4 wants to merge 1 commit intozocomputer:mainfrom
ankitjh4:add-indic-tts

ankitjh4 commented Mar 4, 2026 •

edited

Loading

Uh oh!

skeletor-js commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ankitjh4 commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

About Sarvam AI

Features

1. Text-to-Speech (TTS)

2. Document Intelligence

3. Text Processing

4. Speech-to-Text with Translation

Scripts

Setup

Changes

Uh oh!

skeletor-js commented Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ankitjh4 commented Mar 4, 2026 •

edited

Loading