Back to Catalog

Automate video voiceover & subtitles with Whisper, OpenAI TTS & FFmpeg

LenouarLenouar
223 views
2/3/2026
Official Page

Automate Video Editing with AI

Who’s it for

This workflow is ideal for content creators, training providers, agencies, and businesses that need to quickly turn raw videos into polished, captioned, or narrated content — without hiring editors or spending hours in manual editing tools.

It’s especially valuable for those who want full freedom of video editing capacity on their own platform instead of relying on costly SaaS tools with heavy limitations, recurring fees, or watermarks.
With this template, you get enterprise-grade AI editing self-hosted in your n8n, under your control.

Perfect for:

  • E-learning & educators producing accessible multilingual lessons
  • Corporate trainers automating internal tutorials and compliance walkthroughs
  • Social media teams creating captioned, high-engagement clips
  • Product teams generating quick demos and explainer videos

How it works / What it does

This template combines AI transcription, TTS voiceover, and video editing with FFmpeg into a single automated pipeline:

  • Voiceover Mode

    • Transcribes video with OpenAI Whisper
    • Cleans text using on-screen frames for accuracy
    • Generates natural AI voiceovers (EN/AR) with OpenAI Speech
    • Re-times the video to match the narration → synced professional dub
  • Subtitle Mode

    • Merges multiple video clips with FFmpeg
    • Transcribes or translates audio to generate SRT subtitles
    • Hardcodes captions directly into the final MP4 (style customizable)

✅ Output: a ready-to-publish MP4 with AI voiceover or burned-in subtitles.


Requirements

  • n8n self-hosted or cloud instance
  • Server with FFmpeg installed
  • OpenAI API key (Whisper + Speech)
  • (Optional) Google Drive credentials for delivery

How to customize the workflow

  • Swap voices in the OpenAI Speech node (change “ash” / “verse” to another).
  • Add languages by extending the dropdown in the Upload Video form.
  • Change subtitle style (font, margins, background) in the Apply Subtitle node.
  • Route final videos to Slack, Notion, or Dropbox instead of Drive.

Why it matters / Benefits

Save hours of manual editing — no need for Premiere/DaVinci for basic voiceovers and captions.
💰 Cut production costs — avoid hiring voice actors and editors for every video.
🌍 Multilingual content instantly — create English ↔ Arabic versions without re-recording.
📈 Boost engagement — subtitles increase video watch time on social media.
🎨 Professional results on autopilot — clean transcripts, natural AI voices, synced visuals.

💡 SaaS vs Self-Hosted

  • Descript, Kapwing, Synthesia → $30–100/month per user, limited exports, watermarks on free tiers.
  • This workflow → one-time template + your own server, only pay OpenAI usage (~$0.006/min for Whisper).
  • You get unlimited exports, no restrictions, and full data ownership.

What you win buying this template

  • A ready-made AI editing studio inside n8n.
  • End-to-end automation: upload → AI → polished video.
  • Works for solo creators and agencies delivering client-ready assets.
  • Scales with your content: handle 1 video or 100, just drop them in.

👉 By purchasing, you get:

  • Full workflow JSON file.
  • Email delivery with setup guidelines.
  • Access to a step-by-step walkthrough video.
  • Contact Us via services@quantumti.ae.

n8n Video Voiceover & Subtitle Automation with OpenAI Whisper, TTS, and FFmpeg

This n8n workflow automates the process of generating voiceovers and subtitles for video files using OpenAI's Whisper for transcription, OpenAI's Text-to-Speech (TTS) for voiceover generation, and FFmpeg for video processing. It provides a flexible way to handle video files from various sources and apply AI-powered audio and text enhancements.

Description

This workflow streamlines the complex task of adding voiceovers and subtitles to videos. It's designed to be highly adaptable, allowing you to trigger the process manually, via a form submission, or by integrating with file storage services like Google Drive or FTP. Once a video file is provided, the workflow leverages advanced AI models to transcribe the audio, generate a new voiceover, and prepare the necessary data for subtitle creation, ultimately enabling automated video localization and accessibility.

What it does

  1. Triggers: The workflow can be initiated in several ways:
    • Manually: By clicking 'Execute workflow' in the n8n editor.
    • On Form Submission: By submitting data through a configured n8n form.
    • From Google Drive: By detecting new or updated files in a specified Google Drive folder.
    • From FTP: By monitoring an FTP server for new or updated files.
    • Via SSH: Potentially by executing commands on a remote server to retrieve file information or trigger processes.
    • HTTP Request: By receiving an HTTP request, likely containing video file details or a URL.
  2. Edit Fields (Set): Prepares and structures the incoming data for subsequent processing, ensuring consistency and extracting relevant information like file paths or URLs.
  3. Merge: Combines data from different branches or previous steps, ensuring all necessary information (e.g., original video details, transcription results, TTS audio) is available for the next stages.
  4. OpenAI Integration:
    • Whisper (Transcription): Transcribes the audio track of the input video file into text.
    • Text-to-Speech (TTS) (Voiceover): Generates a new audio voiceover from the transcribed text, potentially in a different language or with a different voice.
  5. HTTP Request: Likely used to interact with an external FFmpeg service or API to perform video and audio manipulation, such as:
    • Merging the generated voiceover audio with the original video.
    • Embedding subtitles (SRT/VTT) into the video.
    • Converting video formats or adjusting properties.
  6. Code: Executes custom JavaScript code to handle complex data transformations, manipulate file paths, or prepare data for FFmpeg commands based on the AI outputs.
  7. Sticky Note: Provides documentation or notes within the workflow for clarity and understanding of specific steps.

Prerequisites/Requirements

  • n8n Instance: A running n8n instance (cloud or self-hosted).
  • OpenAI API Key: An API key for OpenAI services (Whisper for transcription, TTS for voice generation).
  • FFmpeg Service/API: Access to an FFmpeg installation or a service that can execute FFmpeg commands for video processing. This workflow uses an HTTP Request node, implying an external service.
  • Google Drive Account (Optional): If using the Google Drive trigger, an authenticated Google Drive credential in n8n.
  • FTP Server Access (Optional): If using the FTP trigger, FTP server credentials configured in n8n.
  • SSH Access (Optional): If using the SSH node, SSH credentials configured in n8n.

Setup/Usage

  1. Import the Workflow: Download the JSON provided and import it into your n8n instance.
  2. Configure Credentials:
    • OpenAI: Set up your OpenAI API key as a credential in n8n and link it to the "OpenAI" node.
    • Google Drive (Optional): If you plan to use the Google Drive trigger, configure your Google Drive OAuth2 credentials.
    • FTP (Optional): If you plan to use the FTP trigger, set up your FTP credentials.
    • SSH (Optional): If you plan to use the SSH node, configure your SSH credentials.
  3. Configure Trigger Node:
    • Choose your preferred trigger (Manual, Form Submission, Google Drive, FTP, SSH, HTTP Request).
    • If using the "On form submission" trigger, customize the form fields as needed.
    • If using Google Drive or FTP, specify the folder/path to monitor.
    • If using "HTTP Request", configure the endpoint and expected payload.
  4. Customize "Edit Fields (Set)" and "Code" Nodes: Adjust these nodes to match the exact structure of your input data and the desired output for FFmpeg commands. The "Code" node will likely contain logic for generating FFmpeg commands based on the transcription and TTS results.
  5. Configure HTTP Request (FFmpeg): Update the "HTTP Request" node to point to your FFmpeg service/API endpoint and configure the request body with the necessary FFmpeg commands generated by the "Code" node.
  6. Activate the Workflow: Once configured, activate the workflow to enable it to run automatically based on your chosen trigger.

This workflow provides a robust framework for automating video localization and accessibility, significantly reducing manual effort in creating voiceovers and subtitles.

Related Templates

AI multi-agent executive team for entrepreneurs with Gemini, Perplexity and WhatsApp

This workflow is an AI-powered multi-agent system built for startup founders and small business owners who want to automate decision-making, accountability, research, and communication, all through WhatsApp. The “virtual executive team,” is designed to help small teams to work smarter. This workflow sends you market analysis, market and sales tips, It can also monitor what your competitors are doing using perplexity (Research agent) and help you stay a head, or make better decisions. And when you feeling stuck with your start-up accountability director is creative enough to break the barrier 🎯 Core Features 🧑‍💼 1. President (Super Agent) Acts as the main controller that coordinates all sub-agents. Routes messages, assigns tasks, and ensures workflow synchronization between the AI Directors. 📊 2. Sales & Marketing Director Uses SerpAPI to search for market opportunities, leads, and trends. Suggests marketing campaigns, keywords, or outreach ideas. Can analyze current engagement metrics to adjust content strategy. 🕵️‍♀️ 3. Business Research Director Powered by Perplexity AI for competitive and market analysis. Monitors competitor moves, social media engagement, and product changes. Provides concise insights to help the founder adapt and stay ahead. ⏰ 4. Accountability Director Keeps the founder and executive team on track. Sends motivational nudges, task reminders, and progress reports. Promotes consistency and discipline — key traits for early-stage success. 🗓️ 5. Executive Secretary Handles scheduling, email drafting, and reminders. Connects with Google Calendar, Gmail, and Sheets through OAuth. Automates follow-ups, meeting summaries, and notifications directly via WhatsApp. 💬 WhatsApp as the Main Interface Interact naturally with your AI team through WhatsApp Business API. All responses, updates, and summaries are delivered to your chat. Ideal for founders who want to manage operations on the go. ⚙️ How It Works Trigger: The workflow starts from a WhatsApp Trigger node (via Meta Developer Account). Routing: The President agent analyzes the incoming message and determines which Director should handle it. Processing: Marketing or sales queries go to the Sales & Marketing Director. Research questions are handled by the Business Research Director. Accountability tasks are assigned to the Accountability Director. Scheduling or communication requests are managed by the Secretary. Collaboration: Each sub-agent returns results to the President, who summarizes and sends the reply back via WhatsApp. Memory: Context is maintained between sessions, ensuring personalized and coherent communication. 🧩 Integrations Required Gemini API – for general intelligence and task reasoning Supabase- for RAG and postgres persistent memory Perplexity API – for business and competitor analysis SerpAPI – for market research and opportunity scouting Google OAuth – to connect Sheets, Calendar, and Gmail WhatsApp Business API – for message triggers and responses 🚀 Benefits Acts like a team of tireless employees available 24/7. Saves time by automating research, reminders, and communication. Enhances accountability and strategy consistency for founders. Keeps operations centralized in a simple WhatsApp interface. 🧰 Setup Steps Create API credentials for: WhatsApp (via Meta Developer Account) Gemini, Perplexity, and SerpAPI Google OAuth (Sheets, Calendar, Gmail) Create a supabase account at supabase Add the credentials in the corresponding n8n nodes. Customize the system prompts for each Director based on your startup’s needs. Activate and start interacting with your virtual executive team on WhatsApp. Use Case You are a small organisation or start-up that can not afford hiring; marketing department, research department and secretar office, then this workflow is for you 💡 Need Customization? Want to tailor it for your startup or integrate with CRM tools like Notion or HubSpot? You can easily extend the workflow or contact the creator for personalized support. Consider adjusting the system prompt to suite your business

ShadrackBy Shadrack
331

🎓 How to transform unstructured email data into structured format with AI agent

This workflow automates the process of extracting structured, usable information from unstructured email messages across multiple platforms. It connects directly to Gmail, Outlook, and IMAP accounts, retrieves incoming emails, and sends their content to an AI-powered parsing agent built on OpenAI GPT models. The AI agent analyzes each email, identifies relevant details, and returns a clean JSON structure containing key fields: From – sender’s email address To – recipient’s email address Subject – email subject line Summary – short AI-generated summary of the email body The extracted information is then automatically inserted into an n8n Data Table, creating a structured database of email metadata and summaries ready for indexing, reporting, or integration with other tools. --- Key Benefits ✅ Full Automation: Eliminates manual reading and data entry from incoming emails. ✅ Multi-Source Integration: Handles data from different email providers seamlessly. ✅ AI-Driven Accuracy: Uses advanced language models to interpret complex or unformatted content. ✅ Structured Storage: Creates a standardized, query-ready dataset from previously unstructured text. ✅ Time Efficiency: Processes emails in real time, improving productivity and response speed. *✅ Scalability: Easily extendable to handle additional sources or extract more data fields. --- How it works This workflow automates the transformation of unstructured email data into a structured, queryable format. It operates through a series of connected steps: Email Triggering: The workflow is initiated by one of three different email triggers (Gmail, Microsoft Outlook, or a generic IMAP account), which constantly monitor for new incoming emails. AI-Powered Parsing & Structuring: When a new email is detected, its raw, unstructured content is passed to a central "Parsing Agent." This agent uses a specified OpenAI language model to intelligently analyze the email text. Data Extraction & Standardization: Following a predefined system prompt, the AI agent extracts key information from the email, such as the sender, recipient, subject, and a generated summary. It then forces the output into a strict JSON structure using a "Structured Output Parser" node, ensuring data consistency. Data Storage: Finally, the clean, structured data (the from, to, subject, and summarize fields) is inserted as a new row into a specified n8n Data Table, creating a searchable and reportable database of email information. --- Set up steps To implement this workflow, follow these configuration steps: Prepare the Data Table: Create a new Data Table within n8n. Define the columns with the following names and string type: From, To, Subject, and Summary. Configure Email Credentials: Set up the credential connections for the email services you wish to use (Gmail OAuth2, Microsoft Outlook OAuth2, and/or IMAP). Ensure the accounts have the necessary permissions to read emails. Configure AI Model Credentials: Set up the OpenAI API credential with a valid API key. The workflow is configured to use the model, but this can be changed in the respective nodes if needed. Connect the Nodes: The workflow canvas is already correctly wired. Visually confirm that the email triggers are connected to the "Parsing Agent," which is connected to the "Insert row" (Data Table) node. Also, ensure the "OpenAI Chat Model" and "Structured Output Parser" are connected to the "Parsing Agent" as its AI model and output parser, respectively. Activate the Workflow: Save the workflow and toggle the "Active" switch to ON. The triggers will begin polling for new emails according to their schedule (e.g., every minute), and the automation will start processing incoming messages. --- Need help customizing? Contact me for consulting and support or add me on Linkedin.

DavideBy Davide
1616

Send WooCommerce discount coupons to customers via WhatsApp using Rapiwa API

Who is this for? This workflow is ideal for WooCommerce store owners who want to automatically send promotional WhatsApp messages to their customers when new coupons are created. It’s designed for marketers and eCommerce managers looking to boost engagement, streamline coupon sharing, and track campaign performance effortlessly through Google Sheets. Overview This workflow listens for WooCommerce coupon creation events (coupon.created) and uses customer billing data to send promotional WhatsApp messages via the Rapiwa API. The flow formats the coupon data, cleans phone numbers, verifies WhatsApp registration with Rapiwa, sends the promotional message when verified, and logs each attempt to Google Sheets (separate sheets for verified/sent and unverified/not sent). What this Workflow Does Listens for new coupon creation events in WooCommerce via the WooCommerce Trigger node Retrieves all customer data from the WooCommerce store Processes customers in batches to control throughput Cleans and formats customer phone numbers for WhatsApp Verifies if phone numbers are valid WhatsApp accounts using Rapiwa API Sends personalized WhatsApp messages with coupon details to verified numbers Logs all activities to Google Sheets for tracking and analysis Handles both verified and unverified numbers appropriately Key Features Automated coupon distribution: Triggers when new coupons are created in WooCommerce Customer data retrieval: Fetches all customer information from WooCommerce Phone number validation: Verifies WhatsApp numbers before sending messages Personalized messaging: Includes customer name and coupon details in messages Dual logging system: Tracks both successful and failed message attempts Rate limiting: Uses batching and wait nodes to prevent API overload Data formatting: Structures coupon information for consistent messaging Google Sheet Column Structure A Google Sheet formatted like this ➤ sample The workflow uses a Google Sheet with the following columns to track coupon distribution: | name | number | email | address1 | couponCode | couponTitle | couponType | couponAmount | createDate | expireDate | validity | status | | ----------- | ------------- | --------------------------------------------------- | --------- | ---------- | -------------- | ---------- | ------------ | ------------------- | ------------------- | ---------- | -------- | | Abdul Mannan | 8801322827799 | contact@spagreen.net | mirpur-DOHS | 62dhryst | eid offer 2025 | percent | 20.00 | 2025-09-11 06:08:02 | 2025-09-15 00:00:00 | unverified | not sent | | Abdul Mannan | 8801322827799 | contact@spagreen.net | mirpur-DOHS | 62dhryst | eid offer 2025 | percent | 20.00 | 2025-09-11 06:08:02 | 2025-09-15 00:00:00 | verified | sent | Requirements n8n instance with the following nodes: WooCommerce Trigger, Code, SplitInBatches, HTTP Request, IF, Google Sheets, Wait WooCommerce store with API access Rapiwa account with API access for WhatsApp verification and messaging Google account with Sheets access Customer phone numbers in WooCommerce (stored in billing.phone field) Important Notes Phone Number Format: The workflow cleans phone numbers by removing all non-digit characters. Ensure your WooCommerce phone numbers are in a compatible format. API Rate Limits: Rapiwa and WooCommerce APIs have rate limits. Adjust batch sizes and wait times accordingly. Data Privacy: Ensure compliance with data protection regulations when sending marketing messages. Error Handling: The workflow logs unverified numbers but doesn't have extensive error handling. Consider adding error notifications for failed API calls. Message Content: The current message template references the first coupon only (coupons[0]). Adjust if you need to handle multiple coupons. Useful Links Dashboard: https://app.rapiwa.com Official Website: https://rapiwa.com Documentation: https://docs.rapiwa.com Support & Help WhatsApp: Chat on WhatsApp Discord: SpaGreen Community Facebook Group: SpaGreen Support Website: https://spagreen.net Developer Portfolio: Codecanyon SpaGreen

RapiwaBy Rapiwa
110