AI chatbot for Max Messenger with voice recognition (GigaChat +SaluteSpeech)
Name: AI Chatbot for Max Messenger with Voice Recognition (GigaChat + Sber)
Description:
How it works
This workflow powers an intelligent, conversational AI bot for Max messenger that can understand and respond to both text and voice messages. The bot uses GigaChat AI with built-in memory, allowing it to remember the conversation history for each unique user and answer follow-up questions. Voice messages are transcribed using Sber SmartSpeech. It's a complete solution for creating an engaging, automated assistant within your Max bot, using Russian AI services.
Step-by-step
- Max Trigger: The workflow starts when the Max Trigger node receives a new message sent to your Max bot.
- Access Control: The Check User node verifies the sender's user ID against an allowed list. This prevents unauthorized users from accessing your bot.
- Access Denied Response: If the user is not authorized, the Access Denied node sends a polite rejection message.
- Message Type Routing: The Text/Attachment (Switch) node checks if the message contains plain text or has attachments (voice, photo, file).
- Attachment Processing: If an attachment is detected, the Download Attachment (HTTP Request) node retrieves it, and the Attachment Router (Switch) node determines its type (voice, photo, or file).
- Voice Transcription: For voice messages, the workflow gets a Sber access token via Get Access Token (HTTP Request), merges it with the audio file, and sends it to Get Response (HTTP Request) which uses Sber SmartSpeech API to transcribe the audio to text.
- Input Unification: The Voice to Prompt node converts transcribed text into a prompt, while Text to Prompt does the same for plain text messages. Both paths merge at the Combine node.
- AI Agent Processing: The unified prompt is passed to the AI Agent, powered by GigaChat Model and using Simple Memory to retain the last 10 messages per user (using Max
user_idas the session key). - Response Delivery: The AI-generated response is sent back to the user via the Send Message node.
Set up steps
Estimated set up time: 15 minutes
- Get Max bot credentials: Visit https://business.max.ru/ to create a bot and obtain API credentials. Add these credentials to Max Trigger, Send Message, and Access Denied nodes.
- Add GigaChat credentials: Register for GigaChat API access and add your credentials to the GigaChat Model node.
- Add Sber credentials: Obtain Sber SmartSpeech API credentials and add them to Get Access Token and Get Response nodes (HTTP Header Auth).
- Configure access control: Open the Check User node and change the
user_idvalue (currently 50488534) to your own Max user ID. This ensures only you can use the bot during testing. - Customize bot personality: Open the AI Agent node and edit the system message to change the bot's name, behavior, and add your own contact information or links.
- Test the bot: Activate the workflow and send a text or voice message to your Max bot to verify it responds correctly.
Notes
This workflow is specifically designed for Russian-speaking users and uses Russian AI services (GigaChat and Sber SmartSpeech) as alternatives to OpenAI. Make sure you have valid API access to both services before setting up this workflow.
n8n AI Chatbot with Voice Recognition and GigaChat
This n8n workflow creates a sophisticated AI chatbot that integrates with a messenger service (likely Max Messenger, given the directory name, though the JSON doesn't explicitly state it) and leverages GigaChat for conversational AI. It includes voice recognition capabilities and a "salute speech" feature, making the interaction more dynamic and user-friendly.
The workflow is designed to process incoming messages, determine if they contain voice data, transcribe the voice, interact with an AI agent for a response, and then potentially convert the AI's text response back into speech.
What it does
- Receives Incoming Messages: The workflow is triggered by incoming messages, likely from a messenger platform.
- Checks for Voice Data: It uses a conditional
Ifnode to determine if the incoming message contains voice data (e.g., an audio file). - Transcribes Voice (if present): If voice data is detected, an
HTTP Requestnode is used to send the audio to a voice recognition service (like GigaChat's speech-to-text API or a similar service) for transcription. - Prepares Text for AI: The transcribed text (or the original text message if no voice was present) is prepared for the AI agent.
- Manages Conversation History: A
Simple Memorynode is used to maintain the conversational context, allowing the AI to remember previous turns in the conversation. - Generates AI Response: An
AI Agent(powered by LangChain) processes the message and the conversation history to generate a relevant response using GigaChat. - Prepares AI Response for Messenger: The AI's text response is formatted for delivery back to the messenger.
- Synthesizes Speech (Optional/Implied): While not explicitly shown with a dedicated text-to-speech node in this JSON snippet, the "salute speech" in the directory name implies that the AI's text response might be converted back into speech before being sent to the user.
- Sends Response: The final response (text or speech) is sent back to the user via the messenger.
Prerequisites/Requirements
- n8n Instance: A running n8n instance.
- Messenger Integration: Access and credentials for the messenger platform (e.g., Max Messenger) to send and receive messages. This typically involves a webhook trigger and an HTTP Request node for sending responses.
- GigaChat API Key: An API key for GigaChat or a similar large language model service to power the
AI Agent. - Voice Recognition Service: Access to a speech-to-text API (e.g., GigaChat's voice recognition, Google Cloud Speech-to-Text, or another compatible service) for transcribing audio messages.
- Text-to-Speech Service (Optional): If "salute speech" implies voice output, a text-to-speech API would be required.
Setup/Usage
- Import the Workflow: Download the provided JSON and import it into your n8n instance.
- Configure Messenger Trigger: Set up the webhook trigger (not shown in this JSON, but implied by the incoming message processing) to receive messages from your chosen messenger platform.
- Configure GigaChat/LLM Credentials: Update the
AI Agentnode with your GigaChat API key or credentials for your chosen Large Language Model. - Configure Voice Recognition HTTP Request:
- Modify the
HTTP Requestnode (ID 19) to point to your preferred speech-to-text API endpoint. - Ensure the request body and headers are correctly configured to send the audio data and receive the transcription.
- Add any necessary authentication for the speech-to-text service.
- Modify the
- Configure Messenger Response: Set up the final
HTTP Requestnode (or a dedicated messenger node, if available) to send the AI's response back to the user in the messenger. - Activate the Workflow: Once configured, activate the workflow to start processing messages.
Related Templates
Track competitor SEO keywords with Decodo + GPT-4.1-mini + Google Sheets
This workflow automates competitor keyword research using OpenAI LLM and Decodo for intelligent web scraping. Who this is for SEO specialists, content strategists, and growth marketers who want to automate keyword research and competitive intelligence. Marketing analysts managing multiple clients or websites who need consistent SEO tracking without manual data pulls. Agencies or automation engineers using Google Sheets as an SEO data dashboard for keyword monitoring and reporting. What problem this workflow solves Tracking competitor keywords manually is slow and inconsistent. Most SEO tools provide limited API access or lack contextual keyword analysis. This workflow solves that by: Automatically scraping any competitor’s webpage with Decodo. Using OpenAI GPT-4.1-mini to interpret keyword intent, density, and semantic focus. Storing structured keyword insights directly in Google Sheets for ongoing tracking and trend analysis. What this workflow does Trigger — Manually start the workflow or schedule it to run periodically. Input Setup — Define the website URL and target country (e.g., https://dev.to, france). Data Scraping (Decodo) — Fetch competitor web content and metadata. Keyword Analysis (OpenAI GPT-4.1-mini) Extract primary and secondary keywords. Identify focus topics and semantic entities. Generate a keyword density summary and SEO strength score. Recommend optimization and internal linking opportunities. Data Structuring — Clean and convert GPT output into JSON format. Data Storage (Google Sheets) — Append structured keyword data to a Google Sheet for long-term tracking. Setup Prerequisites If you are new to Decode, please signup on this link visit.decodo.com n8n account with workflow editor access Decodo API credentials OpenAI API key Google Sheets account connected via OAuth2 Make sure to install the Decodo Community node. Create a Google Sheet Add columns for: primarykeywords, seostrengthscore, keyworddensity_summary, etc. Share with your n8n Google account. Connect Credentials Add credentials for: Decodo API credentials - You need to register, login and obtain the Basic Authentication Token via Decodo Dashboard OpenAI API (for GPT-4o-mini) Google Sheets OAuth2 Configure Input Fields Edit the “Set Input Fields” node to set your target site and region. Run the Workflow Click Execute Workflow in n8n. View structured results in your connected Google Sheet. How to customize this workflow Track Multiple Competitors → Use a Google Sheet or CSV list of URLs; loop through them using the Split In Batches node. Add Language Detection → Add a Gemini or GPT node before keyword analysis to detect content language and adjust prompts. Enhance the SEO Report → Expand the GPT prompt to include backlink insights, metadata optimization, or readability checks. Integrate Visualization → Connect your Google Sheet to Looker Studio for SEO performance dashboards. Schedule Auto-Runs → Use the Cron Node to run weekly or monthly for competitor keyword refreshes. Summary This workflow automates competitor keyword research using: Decodo for intelligent web scraping OpenAI GPT-4.1-mini for keyword and SEO analysis Google Sheets for live tracking and reporting It’s a complete AI-powered SEO intelligence pipeline ideal for teams that want actionable insights on keyword gaps, optimization opportunities, and content focus trends, without relying on expensive SEO SaaS tools.
AI multi-agent executive team for entrepreneurs with Gemini, Perplexity and WhatsApp
This workflow is an AI-powered multi-agent system built for startup founders and small business owners who want to automate decision-making, accountability, research, and communication, all through WhatsApp. The “virtual executive team,” is designed to help small teams to work smarter. This workflow sends you market analysis, market and sales tips, It can also monitor what your competitors are doing using perplexity (Research agent) and help you stay a head, or make better decisions. And when you feeling stuck with your start-up accountability director is creative enough to break the barrier 🎯 Core Features 🧑💼 1. President (Super Agent) Acts as the main controller that coordinates all sub-agents. Routes messages, assigns tasks, and ensures workflow synchronization between the AI Directors. 📊 2. Sales & Marketing Director Uses SerpAPI to search for market opportunities, leads, and trends. Suggests marketing campaigns, keywords, or outreach ideas. Can analyze current engagement metrics to adjust content strategy. 🕵️♀️ 3. Business Research Director Powered by Perplexity AI for competitive and market analysis. Monitors competitor moves, social media engagement, and product changes. Provides concise insights to help the founder adapt and stay ahead. ⏰ 4. Accountability Director Keeps the founder and executive team on track. Sends motivational nudges, task reminders, and progress reports. Promotes consistency and discipline — key traits for early-stage success. 🗓️ 5. Executive Secretary Handles scheduling, email drafting, and reminders. Connects with Google Calendar, Gmail, and Sheets through OAuth. Automates follow-ups, meeting summaries, and notifications directly via WhatsApp. 💬 WhatsApp as the Main Interface Interact naturally with your AI team through WhatsApp Business API. All responses, updates, and summaries are delivered to your chat. Ideal for founders who want to manage operations on the go. ⚙️ How It Works Trigger: The workflow starts from a WhatsApp Trigger node (via Meta Developer Account). Routing: The President agent analyzes the incoming message and determines which Director should handle it. Processing: Marketing or sales queries go to the Sales & Marketing Director. Research questions are handled by the Business Research Director. Accountability tasks are assigned to the Accountability Director. Scheduling or communication requests are managed by the Secretary. Collaboration: Each sub-agent returns results to the President, who summarizes and sends the reply back via WhatsApp. Memory: Context is maintained between sessions, ensuring personalized and coherent communication. 🧩 Integrations Required Gemini API – for general intelligence and task reasoning Supabase- for RAG and postgres persistent memory Perplexity API – for business and competitor analysis SerpAPI – for market research and opportunity scouting Google OAuth – to connect Sheets, Calendar, and Gmail WhatsApp Business API – for message triggers and responses 🚀 Benefits Acts like a team of tireless employees available 24/7. Saves time by automating research, reminders, and communication. Enhances accountability and strategy consistency for founders. Keeps operations centralized in a simple WhatsApp interface. 🧰 Setup Steps Create API credentials for: WhatsApp (via Meta Developer Account) Gemini, Perplexity, and SerpAPI Google OAuth (Sheets, Calendar, Gmail) Create a supabase account at supabase Add the credentials in the corresponding n8n nodes. Customize the system prompts for each Director based on your startup’s needs. Activate and start interacting with your virtual executive team on WhatsApp. Use Case You are a small organisation or start-up that can not afford hiring; marketing department, research department and secretar office, then this workflow is for you 💡 Need Customization? Want to tailor it for your startup or integrate with CRM tools like Notion or HubSpot? You can easily extend the workflow or contact the creator for personalized support. Consider adjusting the system prompt to suite your business
🎓 How to transform unstructured email data into structured format with AI agent
This workflow automates the process of extracting structured, usable information from unstructured email messages across multiple platforms. It connects directly to Gmail, Outlook, and IMAP accounts, retrieves incoming emails, and sends their content to an AI-powered parsing agent built on OpenAI GPT models. The AI agent analyzes each email, identifies relevant details, and returns a clean JSON structure containing key fields: From – sender’s email address To – recipient’s email address Subject – email subject line Summary – short AI-generated summary of the email body The extracted information is then automatically inserted into an n8n Data Table, creating a structured database of email metadata and summaries ready for indexing, reporting, or integration with other tools. --- Key Benefits ✅ Full Automation: Eliminates manual reading and data entry from incoming emails. ✅ Multi-Source Integration: Handles data from different email providers seamlessly. ✅ AI-Driven Accuracy: Uses advanced language models to interpret complex or unformatted content. ✅ Structured Storage: Creates a standardized, query-ready dataset from previously unstructured text. ✅ Time Efficiency: Processes emails in real time, improving productivity and response speed. *✅ Scalability: Easily extendable to handle additional sources or extract more data fields. --- How it works This workflow automates the transformation of unstructured email data into a structured, queryable format. It operates through a series of connected steps: Email Triggering: The workflow is initiated by one of three different email triggers (Gmail, Microsoft Outlook, or a generic IMAP account), which constantly monitor for new incoming emails. AI-Powered Parsing & Structuring: When a new email is detected, its raw, unstructured content is passed to a central "Parsing Agent." This agent uses a specified OpenAI language model to intelligently analyze the email text. Data Extraction & Standardization: Following a predefined system prompt, the AI agent extracts key information from the email, such as the sender, recipient, subject, and a generated summary. It then forces the output into a strict JSON structure using a "Structured Output Parser" node, ensuring data consistency. Data Storage: Finally, the clean, structured data (the from, to, subject, and summarize fields) is inserted as a new row into a specified n8n Data Table, creating a searchable and reportable database of email information. --- Set up steps To implement this workflow, follow these configuration steps: Prepare the Data Table: Create a new Data Table within n8n. Define the columns with the following names and string type: From, To, Subject, and Summary. Configure Email Credentials: Set up the credential connections for the email services you wish to use (Gmail OAuth2, Microsoft Outlook OAuth2, and/or IMAP). Ensure the accounts have the necessary permissions to read emails. Configure AI Model Credentials: Set up the OpenAI API credential with a valid API key. The workflow is configured to use the model, but this can be changed in the respective nodes if needed. Connect the Nodes: The workflow canvas is already correctly wired. Visually confirm that the email triggers are connected to the "Parsing Agent," which is connected to the "Insert row" (Data Table) node. Also, ensure the "OpenAI Chat Model" and "Structured Output Parser" are connected to the "Parsing Agent" as its AI model and output parser, respectively. Activate the Workflow: Save the workflow and toggle the "Active" switch to ON. The triggers will begin polling for new emails according to their schedule (e.g., every minute), and the automation will start processing incoming messages. --- Need help customizing? Contact me for consulting and support or add me on Linkedin.