Build a WhatsApp assistant with memory, Google Suite & multi-AI research and imaging
The "WhatsApp Productivity Assistant with Memory and AI Imaging" is a comprehensive n8n workflow that transforms your WhatsApp into a powerful, multi-talented AI assistant. It's designed to handle a wide range of tasks by understanding user messages, analyzing images, and connecting to various external tools and services. The assistant can hold natural conversations, remember past interactions using a MongoDB vector store (RAG), and decide which tool is best suited for a user's request. Whether you need to check your schedule, research a topic, get the latest news, create an image, or even analyze a picture you send, this workflow orchestrates it all seamlessly through a single WhatsApp chat interface.
The workflow is structured into several interconnected components:
- WhatsApp Trigger & Incoming Message Processing: This is the entry point, starting when a message (text or image) is received via WhatsApp. A
Route Message by Type (Image/Text)node then intelligently routes the message based on its content type. ATyping....node sends a typing indicator to the user for a better experience. If an image is received, it's downloaded, processed via an HTTP Request, and analyzed by theAnalyze imagenode. TheCode1node then standardizes both text and image analysis output into a single, unified input for the main AI agent. - Core AI Agent: This is the brain of the operation. The
AI Agent1node receives the user's input, maintains short-term conversational memory usingSimple Memory, and uses a powerful language model (gpt-oss-120b2orgpt-oss-120b1) to decide which tool or sub-agent to use. It orchestrates all the other agents and tools. - Productivity Tools Agent: This group of nodes connects the assistant to your personal productivity suite. It includes sub-agents and tools for managing
Google Calendar,Google Tasks, andGmail, allowing you to schedule events, manage to-dos, and read emails. It leverages a language model (gpt-4.1-miniorgemini-2.5-flash) for understanding and executing commands within these tools. - Research Tool Agent: This agent handles all research-related queries. It has access to multiple search tools (
Brave Web Search,Brave News Search,Wikipedia,Tavily, and a customperprlexciasearch) to find the most accurate and up-to-date information from the web. It uses a language model (gpt-oss-120borgpt-4.1-nanoChat Model1) for reasoning. - Long-Term Memory Webhook:
A dedicated sub-workflow (
Webhook2) that processes conversation history, extracts key information usingExtract Memory Info, and stores it in aMongoDB Atlas Vector Storefor long-term memory. This allows the AI agent to remember past preferences and facts. - Image Generation Webhook:
A specialized sub-workflow (
Webhook3) triggered when a user asks to create an image. It uses a dedicatedAI AgentwithMongoDB Atlas Vector Store1for contextual image prompt generation,Clean Prompt Text1to refine the prompt, anHTTP Requestto an external image generation API (e.g., Together.xyz), and then converts and sends the generated image back to the user via WhatsApp.
Use Cases
- Personal Assistant: Schedule appointments, create tasks, read recent emails, and manage your daily agenda directly from WhatsApp.
- Information Retrieval: Ask any factual, news, or research-based question and get real-time answers from various web sources.
- Creative Content Generation: Request the AI to generate images based on your descriptions for logos, artwork, or social media content.
- Smart Communication: Engage in natural, contextual conversations with an AI that remembers past interactions.
- Image Analysis: Send an image and ask the AI to describe its contents or answer questions about it.
Pre-conditions
Before importing and running this template, you will need:
- Self-hosted n8n Instance: This template requires a self-hosted n8n instance as it uses webhooks that need public accessibility.
- WhatsApp Business Account: A Meta Developer Account configured for WhatsApp Business Platform API access.
- MongoDB Atlas Account: A MongoDB Atlas cluster with a database and collection set up for the vector store.
- Google Cloud Project: Configured with API access for Google Calendar, Google Tasks, and Gmail.
- API Keys/Accounts for:
- OpenWeatherMap: For weather forecasts.
- Groq, OpenRouter, or Vercel AI Gateway: For various Language Models (e.g.,
gpt-oss-120b,gpt-5-nano,gpt-4o-mini). - Mistral Cloud: For embedding models (e.g.,
codestral-embed-2505). - Brave Search: For web and news searches.
- Tavily API: For structured search results.
- Together.xyz or similar Image Generation API: For creating images.
- Perplexity API (or self-hosted instance): For the
perprlexciatool (the current URLhttp://self hoseted perplexcia/api/searchimplies a self-hosted or custom endpoint).
- Publicly Accessible URLs: Your n8n instance and any custom webhook endpoints (like
perprlexcia) must be publicly accessible.
Requirements (n8n Credentials)
You will need to set up the following credentials within your n8n instance:
- WhatsApp OAuth account: For the
WhatsApp Triggernode. - WhatsApp account: For
Send message2,Send message3,Download media, andTyping....nodes. - Google Palm Api account: For
Analyze image,Google Gemini Chat Model,gemini-2.5-flash, andGoogle Gemini Chat Model5nodes. - OpenWeatherMap account: For the
Get Weather Forecastnode. - Groq account: For
gpt-oss-120bnode. - Google Calendar OAuth2Api account: For the Google Calendar tools.
- MongoDB account: For
MongoDB Atlas Vector Storenodes. - OpenRouter account: For
gpt-5-nanoandgpt-4.1-nanoChat Model1nodes. - Gmail account : For
Get many messagesandGet a messagenodes (ensure correct Gmail OAuth2 setup for each). - Google Tasks account: For the Google Tasks tools.
- Bearer Auth account: For
HTTP Request5(used in media download). - Brave Search account: For
Brave Web SearchandBrave News Searchnodes. - Vercel Ai Gateway Api account: For
gpt-4.1-mini,gpt-oss-120b,gpt-oss-120b2, andgpt-4.1-nanonodes. - HTTP Header Auth account: For
Tavily web search(create a new one named "Tavily API Key" withAuthorization: Bearer YOUR_TAVILY_API_KEY) andHTTP Request(for Together.xyz, e.g., "Together.xyz API Key"). - Mistral Cloud account: For
codestral-embed-2505,codestral-embed-, andcodestral-embed-2506nodes.
n8n WhatsApp Assistant with Multi-AI Research and Imaging
This n8n workflow creates a versatile WhatsApp assistant capable of engaging in conversational AI, performing multi-AI research, and generating images. It leverages various AI models and tools to provide a rich and interactive experience directly through WhatsApp.
What it does
This workflow orchestrates a sophisticated AI assistant accessible via WhatsApp:
- Receives WhatsApp Messages: It listens for incoming messages on a configured WhatsApp Business Cloud account.
- Initial Message Processing: The incoming message is processed to extract relevant information.
- Conditional AI Routing: It intelligently routes the message to different AI models or tools based on the message content.
- AI Agent: If the message requires complex reasoning or tool usage, it's directed to an AI Agent.
- Basic LLM Chain: For simpler conversational responses, it uses a Basic LLM Chain.
- AI Agent Capabilities:
- Memory: The AI Agent utilizes a Simple Memory to maintain conversation context.
- Research (Wikipedia): It can perform research using the Wikipedia tool to answer factual questions.
- Thinking Tool: A "Think" tool is available for the agent to process complex requests internally before responding.
- AI Agent Tool: The agent can potentially call other AI agents as tools for specialized tasks.
- Image Generation: Although not explicitly named, the presence of
Convert to FileandDefault Data Loadersuggests potential for image processing or generation, likely through an external API called by theHTTP Requestnode.
- Language Model Selection: The workflow supports multiple Large Language Models (LLMs) for flexibility:
- Google Gemini Chat Model
- Groq Chat Model
- OpenRouter Chat Model
- Vercel AI Gateway Chat Model
- Google Gemini (as a general AI node)
- Embeddings for Context: It uses Mistral Cloud Embeddings, indicating the ability to convert text into numerical representations for similarity search or context retrieval, likely with a vector store.
- Vector Store Integration: A MongoDB Atlas Vector Store is integrated, suggesting the workflow can store and retrieve conversational history or knowledge base documents for enhanced memory and context.
- Sends WhatsApp Replies: After processing, the workflow sends the AI-generated response back to the user via WhatsApp.
- HTTP Requests: Generic HTTP requests can be made, likely for integrating with other external APIs for specialized tasks (e.g., image generation APIs, custom data sources).
- Data Transformation: The
Edit Fields (Set)andConvert to Filenodes are used for data manipulation and file handling within the workflow.
Prerequisites/Requirements
To set up and run this workflow, you will need:
- n8n Instance: A running n8n instance.
- WhatsApp Business Cloud Account: Configured with appropriate credentials for the WhatsApp Trigger and WhatsApp Business Cloud nodes.
- AI API Keys/Credentials:
- Google Gemini: API key for Google Gemini Chat Model and Google Gemini node.
- Groq: API key for Groq Chat Model.
- OpenRouter: API key for OpenRouter Chat Model.
- Vercel AI Gateway: API key for Vercel AI Gateway Chat Model.
- Mistral Cloud Embeddings: API key for Mistral Cloud Embeddings.
- MongoDB Atlas Account: Credentials for the MongoDB Atlas Vector Store.
- OpenAI API Key (Implicit, often used with LangChain agents if not explicitly configured with other LLMs).
- Wikipedia Access: The Wikipedia tool typically doesn't require separate credentials but relies on external access.
Setup/Usage
- Import the Workflow: Download the provided JSON and import it into your n8n instance.
- Configure WhatsApp Credentials:
- Set up your WhatsApp Business Cloud credentials in the
WhatsApp TriggerandWhatsApp Business Cloudnodes.
- Set up your WhatsApp Business Cloud credentials in the
- Configure AI Credentials:
- Update the credentials for the
Google Gemini Chat Model,Groq Chat Model,OpenRouter Chat Model,Vercel AI Gateway Chat Model,Google Gemini, andEmbeddings Mistral Cloudnodes with your respective API keys.
- Update the credentials for the
- Configure MongoDB Atlas Vector Store:
- Set up the credentials and connection details for your
MongoDB Atlas Vector Storeto enable memory and document retrieval.
- Set up the credentials and connection details for your
- Review AI Agent Configuration:
- Examine the
AI Agentnode to understand its prompt, tools, and memory configuration. Adjust as needed for your specific use case. - Ensure the
Simple Memorynode is correctly linked and configured.
- Examine the
- Customize Logic (Optional):
- Modify the
IfandSwitchnodes to adjust the routing logic based on your desired assistant behavior. - The
HTTP Requestnode can be configured to integrate with other services (e.g., image generation APIs) as part of the AI agent's tools.
- Modify the
- Activate the Workflow: Once all credentials and configurations are set, activate the workflow.
Your WhatsApp assistant will now be ready to receive messages and respond using its AI capabilities!
Related Templates
Automate Dutch Public Procurement Data Collection with TenderNed
TenderNed Public Procurement What This Workflow Does This workflow automates the collection of public procurement data from TenderNed (the official Dutch tender platform). It: Fetches the latest tender publications from the TenderNed API Retrieves detailed information in both XML and JSON formats for each tender Parses and extracts key information like organization names, titles, descriptions, and reference numbers Filters results based on your custom criteria Stores the data in a database for easy querying and analysis Setup Instructions This template comes with sticky notes providing step-by-step instructions in Dutch and various query options you can customize. Prerequisites TenderNed API Access - Register at TenderNed for API credentials Configuration Steps Set up TenderNed credentials: Add HTTP Basic Auth credentials with your TenderNed API username and password Apply these credentials to the three HTTP Request nodes: "Tenderned Publicaties" "Haal XML Details" "Haal JSON Details" Customize filters: Modify the "Filter op ..." node to match your specific requirements Examples: specific organizations, contract values, regions, etc. How It Works Step 1: Trigger The workflow can be triggered either manually for testing or automatically on a daily schedule. Step 2: Fetch Publications Makes an API call to TenderNed to retrieve a list of recent publications (up to 100 per request). Step 3: Process & Split Extracts the tender array from the response and splits it into individual items for processing. Step 4: Fetch Details For each tender, the workflow makes two parallel API calls: XML endpoint - Retrieves the complete tender documentation in XML format JSON endpoint - Fetches metadata including reference numbers and keywords Step 5: Parse & Merge Parses the XML data and merges it with the JSON metadata and batch information into a single data structure. Step 6: Extract Fields Maps the raw API data to clean, structured fields including: Publication ID and date Organization name Tender title and description Reference numbers (kenmerk, TED number) Step 7: Filter Applies your custom filter criteria to focus on relevant tenders only. Step 8: Store Inserts the processed data into your database for storage and future analysis. Customization Tips Modify API Parameters In the "Tenderned Publicaties" node, you can adjust: offset: Starting position for pagination size: Number of results per request (max 100) Add query parameters for date ranges, status filters, etc. Add More Fields Extend the "Splits Alle Velden" node to extract additional fields from the XML/JSON data, such as: Contract value estimates Deadline dates CPV codes (procurement classification) Contact information Integrate Notifications Add a Slack, Email, or Discord node after the filter to get notified about new matching tenders. Incremental Updates Modify the workflow to only fetch new tenders by: Storing the last execution timestamp Adding date filters to the API query Only processing publications newer than the last run Troubleshooting No data returned? Verify your TenderNed API credentials are correct Check that you have setup youre filter proper Need help setting this up or interested in a complete tender analysis solution? Get in touch π LinkedIn β Wessel Bulte
π How to transform unstructured email data into structured format with AI agent
This workflow automates the process of extracting structured, usable information from unstructured email messages across multiple platforms. It connects directly to Gmail, Outlook, and IMAP accounts, retrieves incoming emails, and sends their content to an AI-powered parsing agent built on OpenAI GPT models. The AI agent analyzes each email, identifies relevant details, and returns a clean JSON structure containing key fields: From β senderβs email address To β recipientβs email address Subject β email subject line Summary β short AI-generated summary of the email body The extracted information is then automatically inserted into an n8n Data Table, creating a structured database of email metadata and summaries ready for indexing, reporting, or integration with other tools. --- Key Benefits β Full Automation: Eliminates manual reading and data entry from incoming emails. β Multi-Source Integration: Handles data from different email providers seamlessly. β AI-Driven Accuracy: Uses advanced language models to interpret complex or unformatted content. β Structured Storage: Creates a standardized, query-ready dataset from previously unstructured text. β Time Efficiency: Processes emails in real time, improving productivity and response speed. *β Scalability: Easily extendable to handle additional sources or extract more data fields. --- How it works This workflow automates the transformation of unstructured email data into a structured, queryable format. It operates through a series of connected steps: Email Triggering: The workflow is initiated by one of three different email triggers (Gmail, Microsoft Outlook, or a generic IMAP account), which constantly monitor for new incoming emails. AI-Powered Parsing & Structuring: When a new email is detected, its raw, unstructured content is passed to a central "Parsing Agent." This agent uses a specified OpenAI language model to intelligently analyze the email text. Data Extraction & Standardization: Following a predefined system prompt, the AI agent extracts key information from the email, such as the sender, recipient, subject, and a generated summary. It then forces the output into a strict JSON structure using a "Structured Output Parser" node, ensuring data consistency. Data Storage: Finally, the clean, structured data (the from, to, subject, and summarize fields) is inserted as a new row into a specified n8n Data Table, creating a searchable and reportable database of email information. --- Set up steps To implement this workflow, follow these configuration steps: Prepare the Data Table: Create a new Data Table within n8n. Define the columns with the following names and string type: From, To, Subject, and Summary. Configure Email Credentials: Set up the credential connections for the email services you wish to use (Gmail OAuth2, Microsoft Outlook OAuth2, and/or IMAP). Ensure the accounts have the necessary permissions to read emails. Configure AI Model Credentials: Set up the OpenAI API credential with a valid API key. The workflow is configured to use the model, but this can be changed in the respective nodes if needed. Connect the Nodes: The workflow canvas is already correctly wired. Visually confirm that the email triggers are connected to the "Parsing Agent," which is connected to the "Insert row" (Data Table) node. Also, ensure the "OpenAI Chat Model" and "Structured Output Parser" are connected to the "Parsing Agent" as its AI model and output parser, respectively. Activate the Workflow: Save the workflow and toggle the "Active" switch to ON. The triggers will begin polling for new emails according to their schedule (e.g., every minute), and the automation will start processing incoming messages. --- Need help customizing? Contact me for consulting and support or add me on Linkedin.
Tax deadline management & compliance alerts with GPT-4, Google Sheets & Slack
AI-Driven Tax Compliance & Deadline Management System Description Automate tax deadline monitoring with AI-powered insights. This workflow checks your tax calendar daily at 8 AM, uses GPT-4 to analyze upcoming deadlines across multiple jurisdictions, detects overdue and critical items, and sends intelligent alerts via email and Slack only when immediate action is required. Perfect for finance teams and accounting firms who need proactive compliance management without manual tracking. ποΈπ€π Good to Know AI-Powered: GPT-4 provides risk assessment and strategic recommendations Multi-Jurisdiction: Handles Federal, State, and Local tax requirements automatically Smart Alerts: Only notifies executives when deadlines are overdue or critical (β€3 days) Priority Classification: Categorizes deadlines as Overdue, Critical, High, or Medium priority Dual Notifications: Critical alerts to leadership + daily summaries to team channel Complete Audit Trail: Logs all checks and deadlines to Google Sheets for compliance records How It Works Daily Trigger - Runs at 8:00 AM every morning Fetch Data - Pulls tax calendar and company configuration from Google Sheets Analyze Deadlines - Calculates days remaining, filters by jurisdiction/entity type, categorizes by priority AI Analysis - GPT-4 provides strategic insights and risk assessment on upcoming deadlines Smart Routing - Only sends alerts if overdue or critical deadlines exist Critical Alerts - HTML email to executives + Slack alert for urgent items Team Updates - Slack summary to finance channel with all upcoming deadlines Logging - Records compliance check results to Google Sheets for audit trail Requirements Google Sheets Structure Sheet 1: TaxCalendar DeadlineID | DeadlineName | DeadlineDate | Jurisdiction | Category | AssignedTo | IsActive FED-Q1 | Form 1120 Q1 | 2025-04-15 | Federal | Income | John Doe | TRUE Sheet 2: CompanyConfig (single row) Jurisdictions | EntityType | FiscalYearEnd Federal, California | Corporation | 12-31 Sheet 3: ComplianceLog (auto-populated) Date | AlertLevel | TotalUpcoming | CriticalCount | OverdueCount 2025-01-15 | HIGH | 12 | 3 | 1 Credentials Needed Google Sheets - Service Account OAuth2 OpenAI - API Key (GPT-4 access required) SMTP - Email account for sending alerts Slack - Bot Token with chat:write permission Setup Steps Import workflow JSON into n8n Add all 4 credentials Replace these placeholders: YOURTAXCALENDAR_ID - Tax calendar sheet ID YOURCONFIGID - Company config sheet ID YOURLOGID - Compliance log sheet ID C12345678 - Slack channel ID tax@company.com - Sender email cfo@company.com - Recipient email Share all sheets with Google service account email Invite Slack bot to channels Test workflow manually Activate the trigger Customizing This Workflow Change Alert Thresholds: Edit "Analyze Deadlines" node: Critical: Change <= 3 to <= 5 for 5-day warning High: Change <= 7 to <= 14 for 2-week notice Medium: Change <= 30 to <= 60 for 2-month lookout Adjust Schedule: Edit "Daily Tax Check" trigger: Change hour/minute for different run time Add multiple trigger times for tax season (8 AM, 2 PM, 6 PM) Add More Recipients: Edit "Send Email" node: To: cfo@company.com, director@company.com CC: accounting@company.com BCC: archive@company.com Customize Email Design: Edit "Format Email" node to change colors, add logo, or modify layout Add SMS Alerts: Insert Twilio node after "Is Critical" for emergency notifications Integrate Task Management: Add HTTP Request node to create tasks in Asana/Jira for critical deadlines Troubleshooting | Issue | Solution | |-------|----------| | No deadlines found | Check date format (YYYY-MM-DD) and IsActive = TRUE | | AI analysis failed | Verify OpenAI API key and account credits | | Email not sending | Test SMTP credentials and check if critical condition met | | Slack not posting | Invite bot to channel and verify channel ID format | | Permission denied | Share Google Sheets with service account email | π Professional Services Need help with implementation or customization? Our team offers: π― Custom workflow development π’ Enterprise deployment support π Team training sessions π§ Ongoing maintenance π Custom reporting & dashboards π Additional API integrations Discover more workflows β Get in touch with us