Back to Catalog

Ebook to audiobook converter using MiniMax and FFmpeg

Jay Emp0Jay Emp0
634 views
2/3/2026
Official Page

Ebook to Audiobook Converter

Watch Demo

▶️ Watch Full Demo Video


What It Does

Turn any PDF ebook into a professional audiobook automatically. Upload a PDF, get an MP3 audiobook in your Google Drive. Perfect for listening to books, research papers, or documents on the go.

Example: Input PDFOutput Audiobook

Key Features

  • Upload PDF via web form → Get MP3 audiobook in Google Drive
  • Natural-sounding AI voices (MiniMax Speech-02-HD)
  • Automatic text extraction, chunking, and audio merging
  • Customizable voice, speed, and emotion settings
  • Processes long books in batches with smart rate limiting

Perfect For

  • Students: Turn textbooks into study audiobooks
  • Professionals: Listen to reports and documents while commuting
  • Content Creators: Repurpose written content as audio
  • Accessibility: Make content accessible to visually impaired users

Requirements

| Component | Details | |-----------|---------| | n8n | Self-hosted ONLY (cannot run on n8n Cloud) | | FFmpeg | Must be installed in your n8n environment | | Replicate API | For MiniMax TTS (Sign up here) | | Google Drive | OAuth2 credentials + "Audiobook" folder |

⚠️ Important: This workflow does NOT work on n8n Cloud because FFmpeg installation is required.

Quick Setup

1. Install FFmpeg

Docker users:

docker exec -it <n8n-container-name> /bin/bash
apt-get update && apt-get install -y ffmpeg

Native installation:

sudo apt-get install ffmpeg  # Linux
brew install ffmpeg          # macOS

2. Get API Keys

  • Replicate: Sign up at replicate.com and copy your API token
  • Google Drive: Set up OAuth2 in n8n and create an "Audiobook" folder in Drive

3. Import & Configure

  1. Import n8n.json into your n8n instance
  2. Replace the Replicate API token in the "MINIMAX TTS" node
  3. Configure Google Drive credentials and select your "Audiobook" folder
  4. Activate the workflow

Cost Estimate

| Component | Cost | |-----------|------| | MiniMax TTS API | ~$0.15 per 1000 characters (~$3-5 for average book) | | Google Drive Storage | Free (up to 15GB) | | Processing Time | ~1-2 minutes per 10 pages |

How It Works

Workflow Diagram

PDF Upload → Extract Text → Split into Chunks → Convert to Speech (batches of 5)
→ Merge Audio Files (FFmpeg) → Upload to Google Drive

The workflow uses four main modules:

  1. Extraction: PDF text extraction and intelligent chunking
  2. Conversion: MiniMax TTS processes text in batches
  3. Merging: FFmpeg combines all audio files seamlessly
  4. Upload: Final audiobook saved to Google Drive

Voice Settings (Customizable)

{
  "voice_id": "Friendly_Person",
  "emotion": "happy",
  "speed": 1,
  "pitch": 0
}

Available emotions: happy, neutral, sad, angry, excited

Limitations

  • ⚠️ Self-hosted n8n ONLY (not compatible with n8n Cloud)
  • PDF files only (not EPUB, MOBI, or scanned images)
  • Large books (500+ pages) take longer to process
  • Requires FFmpeg installation (see setup above)

Troubleshooting

FFmpeg not found?

  • Docker: Run docker exec -it <container> /bin/bash then apt-get install ffmpeg
  • Native: Run sudo apt-get install ffmpeg (Linux) or brew install ffmpeg (macOS)

Rate limit errors?

  • Increase wait time in the "WAITS FOR 5 SECONDS" node to 10-15 seconds

Google Drive upload fails?

  • Make sure you created the "Audiobook" folder in your Google Drive
  • Reconfigure OAuth2 credentials in n8n

Created by emp0 | More workflows: n8n Gallery

n8n Ebook to Audiobook Converter using Minimax and FFmpeg

This n8n workflow automates the process of converting ebook files into audiobooks using a combination of AI for text-to-speech and FFmpeg for audio processing. It leverages Minimax for generating speech from text and FFmpeg for handling the audio file creation and merging.

What it does

This workflow is designed to simplify the creation of audiobooks from text-based ebook files. Here's a step-by-step breakdown of its functionality:

  1. Trigger on Form Submission: The workflow starts when a form is submitted, likely providing details about the ebook to be converted.
  2. Read Ebook File from Disk: It reads the specified ebook file (e.g., EPUB, TXT) from the local disk where n8n is running.
  3. Extract Text from Ebook: The content of the ebook file is extracted into raw text.
  4. Loop Over Text Chunks: The extracted text is split into smaller chunks (batches) to be processed individually. This is crucial for handling large ebooks and API rate limits.
  5. Generate Audio with Minimax (HTTP Request): For each text chunk, an HTTP request is made to the Minimax API to convert the text into speech. This step generates an audio segment.
  6. Save Audio Segment to Disk: The generated audio segment from Minimax is saved as a temporary file on the local disk.
  7. Wait (Optional Delay): An optional delay is introduced between processing chunks, likely to manage API rate limits or system load.
  8. Execute FFmpeg Command: After all audio segments are generated and saved, an Execute Command node runs FFmpeg to merge all the individual audio segments into a single audiobook file.
  9. Upload to Google Drive: The final merged audiobook file is then uploaded to a specified Google Drive folder for storage and easy access.

Prerequisites/Requirements

To use this workflow, you will need:

  • n8n Instance: A running n8n instance with access to the local file system.
  • Minimax API Key: An API key for the Minimax text-to-speech service. This will be configured in the HTTP Request node.
  • FFmpeg Installed: FFmpeg must be installed and accessible from the command line on the server hosting your n8n instance.
  • Google Drive Account: A Google Drive account and credentials configured in n8n for uploading the final audiobook.
  • Ebook Files: Ebook files (e.g., .txt, .epub) that you wish to convert.

Setup/Usage

  1. Import the Workflow: Import the provided JSON into your n8n instance.
  2. Configure Credentials:
    • Set up your Google Drive credentials in n8n.
    • Configure the HTTP Request node with your Minimax API key and the appropriate endpoint for text-to-speech conversion.
  3. Adjust Node Settings:
    • On form submission: Customize the form fields if needed to accept ebook file paths or other relevant information.
    • Read/Write Files from Disk: Ensure the paths for reading ebook files and saving temporary audio segments are correctly configured for your n8n server's file system.
    • Extract from File: Verify the settings for extracting text from your specific ebook file types.
    • Loop Over Items (Split in Batches): Adjust the batch size if necessary based on your Minimax API limits and desired processing speed.
    • Execute Command: Update the FFmpeg command to correctly merge the audio files and specify the output format (e.g., MP3). Ensure the input file pattern matches how your temporary audio segments are named.
    • Google Drive: Specify the target folder in Google Drive where you want the audiobooks to be uploaded.
  4. Activate the Workflow: Once configured, activate the workflow. You can then trigger it by submitting the n8n form.

This workflow provides a powerful and flexible way to automate audiobook creation, bridging the gap between digital text and spoken word.

Related Templates

AI multi-agent executive team for entrepreneurs with Gemini, Perplexity and WhatsApp

This workflow is an AI-powered multi-agent system built for startup founders and small business owners who want to automate decision-making, accountability, research, and communication, all through WhatsApp. The “virtual executive team,” is designed to help small teams to work smarter. This workflow sends you market analysis, market and sales tips, It can also monitor what your competitors are doing using perplexity (Research agent) and help you stay a head, or make better decisions. And when you feeling stuck with your start-up accountability director is creative enough to break the barrier 🎯 Core Features 🧑‍💼 1. President (Super Agent) Acts as the main controller that coordinates all sub-agents. Routes messages, assigns tasks, and ensures workflow synchronization between the AI Directors. 📊 2. Sales & Marketing Director Uses SerpAPI to search for market opportunities, leads, and trends. Suggests marketing campaigns, keywords, or outreach ideas. Can analyze current engagement metrics to adjust content strategy. 🕵️‍♀️ 3. Business Research Director Powered by Perplexity AI for competitive and market analysis. Monitors competitor moves, social media engagement, and product changes. Provides concise insights to help the founder adapt and stay ahead. ⏰ 4. Accountability Director Keeps the founder and executive team on track. Sends motivational nudges, task reminders, and progress reports. Promotes consistency and discipline — key traits for early-stage success. 🗓️ 5. Executive Secretary Handles scheduling, email drafting, and reminders. Connects with Google Calendar, Gmail, and Sheets through OAuth. Automates follow-ups, meeting summaries, and notifications directly via WhatsApp. 💬 WhatsApp as the Main Interface Interact naturally with your AI team through WhatsApp Business API. All responses, updates, and summaries are delivered to your chat. Ideal for founders who want to manage operations on the go. ⚙️ How It Works Trigger: The workflow starts from a WhatsApp Trigger node (via Meta Developer Account). Routing: The President agent analyzes the incoming message and determines which Director should handle it. Processing: Marketing or sales queries go to the Sales & Marketing Director. Research questions are handled by the Business Research Director. Accountability tasks are assigned to the Accountability Director. Scheduling or communication requests are managed by the Secretary. Collaboration: Each sub-agent returns results to the President, who summarizes and sends the reply back via WhatsApp. Memory: Context is maintained between sessions, ensuring personalized and coherent communication. 🧩 Integrations Required Gemini API – for general intelligence and task reasoning Supabase- for RAG and postgres persistent memory Perplexity API – for business and competitor analysis SerpAPI – for market research and opportunity scouting Google OAuth – to connect Sheets, Calendar, and Gmail WhatsApp Business API – for message triggers and responses 🚀 Benefits Acts like a team of tireless employees available 24/7. Saves time by automating research, reminders, and communication. Enhances accountability and strategy consistency for founders. Keeps operations centralized in a simple WhatsApp interface. 🧰 Setup Steps Create API credentials for: WhatsApp (via Meta Developer Account) Gemini, Perplexity, and SerpAPI Google OAuth (Sheets, Calendar, Gmail) Create a supabase account at supabase Add the credentials in the corresponding n8n nodes. Customize the system prompts for each Director based on your startup’s needs. Activate and start interacting with your virtual executive team on WhatsApp. Use Case You are a small organisation or start-up that can not afford hiring; marketing department, research department and secretar office, then this workflow is for you 💡 Need Customization? Want to tailor it for your startup or integrate with CRM tools like Notion or HubSpot? You can easily extend the workflow or contact the creator for personalized support. Consider adjusting the system prompt to suite your business

ShadrackBy Shadrack
331

🎓 How to transform unstructured email data into structured format with AI agent

This workflow automates the process of extracting structured, usable information from unstructured email messages across multiple platforms. It connects directly to Gmail, Outlook, and IMAP accounts, retrieves incoming emails, and sends their content to an AI-powered parsing agent built on OpenAI GPT models. The AI agent analyzes each email, identifies relevant details, and returns a clean JSON structure containing key fields: From – sender’s email address To – recipient’s email address Subject – email subject line Summary – short AI-generated summary of the email body The extracted information is then automatically inserted into an n8n Data Table, creating a structured database of email metadata and summaries ready for indexing, reporting, or integration with other tools. --- Key Benefits ✅ Full Automation: Eliminates manual reading and data entry from incoming emails. ✅ Multi-Source Integration: Handles data from different email providers seamlessly. ✅ AI-Driven Accuracy: Uses advanced language models to interpret complex or unformatted content. ✅ Structured Storage: Creates a standardized, query-ready dataset from previously unstructured text. ✅ Time Efficiency: Processes emails in real time, improving productivity and response speed. *✅ Scalability: Easily extendable to handle additional sources or extract more data fields. --- How it works This workflow automates the transformation of unstructured email data into a structured, queryable format. It operates through a series of connected steps: Email Triggering: The workflow is initiated by one of three different email triggers (Gmail, Microsoft Outlook, or a generic IMAP account), which constantly monitor for new incoming emails. AI-Powered Parsing & Structuring: When a new email is detected, its raw, unstructured content is passed to a central "Parsing Agent." This agent uses a specified OpenAI language model to intelligently analyze the email text. Data Extraction & Standardization: Following a predefined system prompt, the AI agent extracts key information from the email, such as the sender, recipient, subject, and a generated summary. It then forces the output into a strict JSON structure using a "Structured Output Parser" node, ensuring data consistency. Data Storage: Finally, the clean, structured data (the from, to, subject, and summarize fields) is inserted as a new row into a specified n8n Data Table, creating a searchable and reportable database of email information. --- Set up steps To implement this workflow, follow these configuration steps: Prepare the Data Table: Create a new Data Table within n8n. Define the columns with the following names and string type: From, To, Subject, and Summary. Configure Email Credentials: Set up the credential connections for the email services you wish to use (Gmail OAuth2, Microsoft Outlook OAuth2, and/or IMAP). Ensure the accounts have the necessary permissions to read emails. Configure AI Model Credentials: Set up the OpenAI API credential with a valid API key. The workflow is configured to use the model, but this can be changed in the respective nodes if needed. Connect the Nodes: The workflow canvas is already correctly wired. Visually confirm that the email triggers are connected to the "Parsing Agent," which is connected to the "Insert row" (Data Table) node. Also, ensure the "OpenAI Chat Model" and "Structured Output Parser" are connected to the "Parsing Agent" as its AI model and output parser, respectively. Activate the Workflow: Save the workflow and toggle the "Active" switch to ON. The triggers will begin polling for new emails according to their schedule (e.g., every minute), and the automation will start processing incoming messages. --- Need help customizing? Contact me for consulting and support or add me on Linkedin.

DavideBy Davide
1616

Send WooCommerce discount coupons to customers via WhatsApp using Rapiwa API

Who is this for? This workflow is ideal for WooCommerce store owners who want to automatically send promotional WhatsApp messages to their customers when new coupons are created. It’s designed for marketers and eCommerce managers looking to boost engagement, streamline coupon sharing, and track campaign performance effortlessly through Google Sheets. Overview This workflow listens for WooCommerce coupon creation events (coupon.created) and uses customer billing data to send promotional WhatsApp messages via the Rapiwa API. The flow formats the coupon data, cleans phone numbers, verifies WhatsApp registration with Rapiwa, sends the promotional message when verified, and logs each attempt to Google Sheets (separate sheets for verified/sent and unverified/not sent). What this Workflow Does Listens for new coupon creation events in WooCommerce via the WooCommerce Trigger node Retrieves all customer data from the WooCommerce store Processes customers in batches to control throughput Cleans and formats customer phone numbers for WhatsApp Verifies if phone numbers are valid WhatsApp accounts using Rapiwa API Sends personalized WhatsApp messages with coupon details to verified numbers Logs all activities to Google Sheets for tracking and analysis Handles both verified and unverified numbers appropriately Key Features Automated coupon distribution: Triggers when new coupons are created in WooCommerce Customer data retrieval: Fetches all customer information from WooCommerce Phone number validation: Verifies WhatsApp numbers before sending messages Personalized messaging: Includes customer name and coupon details in messages Dual logging system: Tracks both successful and failed message attempts Rate limiting: Uses batching and wait nodes to prevent API overload Data formatting: Structures coupon information for consistent messaging Google Sheet Column Structure A Google Sheet formatted like this ➤ sample The workflow uses a Google Sheet with the following columns to track coupon distribution: | name | number | email | address1 | couponCode | couponTitle | couponType | couponAmount | createDate | expireDate | validity | status | | ----------- | ------------- | --------------------------------------------------- | --------- | ---------- | -------------- | ---------- | ------------ | ------------------- | ------------------- | ---------- | -------- | | Abdul Mannan | 8801322827799 | contact@spagreen.net | mirpur-DOHS | 62dhryst | eid offer 2025 | percent | 20.00 | 2025-09-11 06:08:02 | 2025-09-15 00:00:00 | unverified | not sent | | Abdul Mannan | 8801322827799 | contact@spagreen.net | mirpur-DOHS | 62dhryst | eid offer 2025 | percent | 20.00 | 2025-09-11 06:08:02 | 2025-09-15 00:00:00 | verified | sent | Requirements n8n instance with the following nodes: WooCommerce Trigger, Code, SplitInBatches, HTTP Request, IF, Google Sheets, Wait WooCommerce store with API access Rapiwa account with API access for WhatsApp verification and messaging Google account with Sheets access Customer phone numbers in WooCommerce (stored in billing.phone field) Important Notes Phone Number Format: The workflow cleans phone numbers by removing all non-digit characters. Ensure your WooCommerce phone numbers are in a compatible format. API Rate Limits: Rapiwa and WooCommerce APIs have rate limits. Adjust batch sizes and wait times accordingly. Data Privacy: Ensure compliance with data protection regulations when sending marketing messages. Error Handling: The workflow logs unverified numbers but doesn't have extensive error handling. Consider adding error notifications for failed API calls. Message Content: The current message template references the first coupon only (coupons[0]). Adjust if you need to handle multiple coupons. Useful Links Dashboard: https://app.rapiwa.com Official Website: https://rapiwa.com Documentation: https://docs.rapiwa.com Support & Help WhatsApp: Chat on WhatsApp Discord: SpaGreen Community Facebook Group: SpaGreen Support Website: https://spagreen.net Developer Portfolio: Codecanyon SpaGreen

RapiwaBy Rapiwa
110