Full Pipeline

Speech-to-Text → Translation → Text-to-Speech

Upload file

Google Drive URL

Accepts a file (audio, video, .srt, .txt, .json) or a folder of audio/video files (STT only). Must be shared publicly.

Pipeline Stages

Speech to Text

Translate

Text to Speech

Include Summary (Gemini)

Gemini API Key

Edits are saved to disk and used on the next pipeline run.

ASR Correction Prompt

Task: Act as an expert Dharma editor. Your goal is to clean up the English in an automated speech recognition transcript.

Source Context: This is an ASR (Automated Speech Recognition) transcript of an English interpreter translating for H.E. Garchen Rinpoche.

Instructions:

- Clean: Fix disfluencies (stutters, "and so", "um") and correct obvious ASR errors into logical English.
- Dharma Technical Terms: Ensure the following Tibetan Buddhist terms are spelled correctly:
"Om Ah Hung" (not Omaha Home/Oma-Hung)
"Vajra Recitation" (not Vatu/Vacheracitation)
"Samsara" and "Nirvana"
"Vajrayana" (not Vaturacitation/Ajayana)
"Samantabhadra Prayer" (not Desamanta Badra Preyam)
- Constraint: Do not include any internal citations, AI-generated tags, or text in square brackets (e.g., ) within the final SRT content.

The following numbered items are consecutive segments from an automatic speech recognition transcript. 
Using the full context of all segments, correct any ASR errors in each segment. Preserve the speaker's words and meaning as closely as possible. 
Output only the corrected segments in the exact same numbered format — one per line, nothing else.

Summary Prompt

Role: Act as an expert in Tibetan Buddhist philosophy and a translator familiar with the colloquial teaching style of Tibetan Rinpoches.

Task: Analyze the attached transcript. I am aware there are significant phonetic transcription errors. Please provide:

Executive Summary: The overarching "Main Theme" of this specific day's teaching.

Dharma Terminology Correction: A list of the key Tibetan terms the speaker likely used, corrected to their standard transliteration (Wylie or phonetic), and their English definitions (e.g., if the text says "Kunsalong," identify it as Kun slong / Motivation).

Chapter Breakdown: A timestamped outline of major topics. Please look specifically for:

The Opening: Preparatory remarks or motivation.

Analogies: Identify any metaphors used (like the "Tea and the Cup" or "Mirror").

Core Instructions: Key sections on Mind Training (Lojong), Ethics (Sila), or Wisdom (Prajna).

The Conclusion: Final advice to students and Dedication of Merit (Bsngo ba).

Constraint: Use the timestamps in the SRT to ensure the chapter breakdown is chronologically accurate.

Use {target_language} as a placeholder for the target language.

Translation Prompt

Status

Segments

Synthesized Audio