Extract & organize email invoices with Gmail, Drive & OpenAI GPT
Who’s it for
This template is for founders, finance teams, and solo operators who receive lots of invoices by email and want them captured automatically in a single, searchable source of truth. If you’re tired of hunting through your inbox for invoice PDFs or “that one receipt from three months ago,” this is for you.
What it does / How it works
The workflow polls your Gmail inbox on a schedule and fetches new messages including their attachments. A JavaScript Code node restructures all attachments, and a PDF extraction node reads any attached PDFs.
An AI “Invoice Recognition Agent” then analyzes the email body and attachments to decide whether the email actually contains an invoice. If not, the workflow stops.
If it is an invoice, a second AI “Invoice Data Extractor” pulls structured fields such as date_email, date_invoice, invoice_nr, description, provider, net_amount, vat, gross_amount, label (saas/hardware/other), and currency. Depending on whether the invoice is in an attachment or directly in the email text, the workflow either:
- uploads the invoice file to Google Drive, or
- document a direct link to the mail,
then appends/updates a row in Google Sheets with all invoice parameters plus a Drive link, and finally marks the Gmail message as read.
How to set up
- Add and authenticate:
- Gmail credentials
- Google Sheets credentials
- Google Drive credentials
- OpenAI (or compatible) credentials for the AI nodes
- Create or select a Google Sheet with the expected columns (date_email, date_invoice, invoice_nr, description, provider, net_amount, vat, gross_amount, label, currency, link).
- Create or select a Google Drive folder where invoices/docs should be stored.
- Adjust the Gmail Trigger filters (labels, search query, polling interval) to match the mailbox you want to process.
- Update node credentials and resource IDs (Sheet, Drive folder) via the node UIs, not hardcoded in HTTP nodes.
Requirements
- n8n instance (cloud or self-hosted)
- Gmail account with OAuth2 setup
- Google Drive and Google Sheets access
- OpenAI (or compatible) API key configured in n8n
- Sufficient permissions to read emails, read/write Drive files, and edit the target Sheet
How to customize the workflow
- Change invoice categories: Extend the
labelenum (e.g., add “services”, “subscriptions”) in the extraction schema and adjust any downstream logic. - Refine invoice detection: Tweak the AI prompts to be more or less strict about what counts as an invoice or receipt.
- Add notifications: After updating the Sheet, send a Slack/Teams message or email summary for high-value invoices.
- Filter by sender or subject: Narrow the Gmail Trigger to specific vendors, labels, or keywords.
- Extend the data model: Add fields (e.g., cost center, project code) to the extractor prompt and Sheet mapping to fit your bookkeeping setup.
Extract and Organize Email Invoices with Gmail, Google Drive, and OpenAI GPT
This n8n workflow automates the process of identifying invoice emails, extracting key information using AI, saving the invoice files to Google Drive, and logging the details into a Google Sheet. It streamlines invoice management, reducing manual effort and ensuring accurate record-keeping.
What it does
- Triggers on new Gmail emails: The workflow starts whenever a new email is received in your specified Gmail account.
- Filters for invoice emails: It uses an AI Agent (powered by OpenAI GPT) to determine if the email content or attachments indicate an invoice.
- Extracts invoice data: If an email is identified as an invoice, the AI Agent extracts crucial information like vendor, amount, date, and invoice number from the email body or any attached files (e.g., PDFs).
- Saves attachments to Google Drive: Any relevant invoice attachments are uploaded to a designated folder in Google Drive.
- Logs data to Google Sheets: The extracted invoice details (vendor, amount, date, invoice number, Drive link) are appended as a new row in a specified Google Sheet for easy tracking and accounting.
- Handles non-invoice emails: Emails not identified as invoices are passed through a "No Operation" node, effectively ending their processing in this workflow.
Prerequisites/Requirements
- n8n Instance: A running n8n instance (cloud or self-hosted).
- Gmail Account: Connected to n8n with appropriate permissions to read emails.
- Google Drive Account: Connected to n8n with permissions to create and upload files.
- Google Sheets Account: Connected to n8n with permissions to append data to a spreadsheet.
- OpenAI API Key: For the AI Agent and OpenAI Chat Model nodes to process and extract information.
- Langchain Nodes: Ensure the
@n8n/n8n-nodes-langchainpackage is installed in your n8n instance.
Setup/Usage
- Import the workflow: Download the provided JSON and import it into your n8n instance.
- Configure Credentials:
- Gmail Trigger: Set up your Gmail OAuth2 credential to listen for new emails.
- Google Drive: Configure your Google Drive OAuth2 credential for file uploads.
- Google Sheets: Set up your Google Sheets OAuth2 credential for data logging.
- OpenAI Chat Model: Configure your OpenAI API Key credential.
- Customize Nodes:
- Gmail Trigger: Adjust the "Label or Folder" and "Search Query" (e.g.,
has:attachment "invoice") if you want to narrow down the emails it processes. - AI Agent: Review and adjust the prompt for the AI Agent to accurately identify invoices and extract specific fields relevant to your needs.
- Google Drive: Specify the "Parent Folder ID" where you want the invoice attachments to be saved.
- Google Sheets: Configure the "Spreadsheet ID" and "Sheet Name" where invoice data will be logged. Ensure the sheet has appropriate headers (e.g., Vendor, Amount, Date, Invoice Number, Drive Link).
- Code Node: The "Code" node likely contains logic to structure the data extracted by the AI Agent before it's written to Google Sheets. Review and adjust this JavaScript code if your AI extraction output format changes or if you need different data mapping.
- Extract from File: This node is likely used to process attachments (e.g., PDFs) to extract text for the AI Agent. Ensure it's configured to handle the file types you expect.
- Gmail Trigger: Adjust the "Label or Folder" and "Search Query" (e.g.,
- Activate the workflow: Once all credentials and configurations are set, activate the workflow. It will now run automatically, processing new incoming emails.
Related Templates
Automate Dutch Public Procurement Data Collection with TenderNed
TenderNed Public Procurement What This Workflow Does This workflow automates the collection of public procurement data from TenderNed (the official Dutch tender platform). It: Fetches the latest tender publications from the TenderNed API Retrieves detailed information in both XML and JSON formats for each tender Parses and extracts key information like organization names, titles, descriptions, and reference numbers Filters results based on your custom criteria Stores the data in a database for easy querying and analysis Setup Instructions This template comes with sticky notes providing step-by-step instructions in Dutch and various query options you can customize. Prerequisites TenderNed API Access - Register at TenderNed for API credentials Configuration Steps Set up TenderNed credentials: Add HTTP Basic Auth credentials with your TenderNed API username and password Apply these credentials to the three HTTP Request nodes: "Tenderned Publicaties" "Haal XML Details" "Haal JSON Details" Customize filters: Modify the "Filter op ..." node to match your specific requirements Examples: specific organizations, contract values, regions, etc. How It Works Step 1: Trigger The workflow can be triggered either manually for testing or automatically on a daily schedule. Step 2: Fetch Publications Makes an API call to TenderNed to retrieve a list of recent publications (up to 100 per request). Step 3: Process & Split Extracts the tender array from the response and splits it into individual items for processing. Step 4: Fetch Details For each tender, the workflow makes two parallel API calls: XML endpoint - Retrieves the complete tender documentation in XML format JSON endpoint - Fetches metadata including reference numbers and keywords Step 5: Parse & Merge Parses the XML data and merges it with the JSON metadata and batch information into a single data structure. Step 6: Extract Fields Maps the raw API data to clean, structured fields including: Publication ID and date Organization name Tender title and description Reference numbers (kenmerk, TED number) Step 7: Filter Applies your custom filter criteria to focus on relevant tenders only. Step 8: Store Inserts the processed data into your database for storage and future analysis. Customization Tips Modify API Parameters In the "Tenderned Publicaties" node, you can adjust: offset: Starting position for pagination size: Number of results per request (max 100) Add query parameters for date ranges, status filters, etc. Add More Fields Extend the "Splits Alle Velden" node to extract additional fields from the XML/JSON data, such as: Contract value estimates Deadline dates CPV codes (procurement classification) Contact information Integrate Notifications Add a Slack, Email, or Discord node after the filter to get notified about new matching tenders. Incremental Updates Modify the workflow to only fetch new tenders by: Storing the last execution timestamp Adding date filters to the API query Only processing publications newer than the last run Troubleshooting No data returned? Verify your TenderNed API credentials are correct Check that you have setup youre filter proper Need help setting this up or interested in a complete tender analysis solution? Get in touch 🔗 LinkedIn – Wessel Bulte
AI-powered code review with linting, red-marked corrections in Google Sheets & Slack
Advanced Code Review Automation (AI + Lint + Slack) Who’s it for For software engineers, QA teams, and tech leads who want to automate intelligent code reviews with both AI-driven suggestions and rule-based linting — all managed in Google Sheets with instant Slack summaries. How it works This workflow performs a two-layer review system: Lint Check: Runs a lightweight static analysis to find common issues (e.g., use of var, console.log, unbalanced braces). AI Review: Sends valid code to Gemini AI, which provides human-like review feedback with severity classification (Critical, Major, Minor) and visual highlights (red/orange tags). Formatter: Combines lint and AI results, calculating an overall score (0–10). Aggregator: Summarizes results for quick comparison. Google Sheets Writer: Appends results to your review log. Slack Notification: Posts a concise summary (e.g., number of issues and average score) to your team’s channel. How to set up Connect Google Sheets and Slack credentials in n8n. Replace placeholders (<YOURSPREADSHEETID>, <YOURSHEETGIDORNAME>, <YOURSLACKCHANNEL_ID>). Adjust the AI review prompt or lint rules as needed. Activate the workflow — reviews will start automatically whenever new code is added to the sheet. Requirements Google Sheets and Slack integrations enabled A configured AI node (Gemini, OpenAI, or compatible) Proper permissions to write to your target Google Sheet How to customize Add more linting rules (naming conventions, spacing, forbidden APIs) Extend the AI prompt for project-specific guidelines Customize the Slack message formatting Export analytics to a dashboard (e.g., Notion or Data Studio) Why it’s valuable This workflow brings realistic, team-oriented AI-assisted code review to n8n — combining the speed of automated linting with the nuance of human-style feedback. It saves time, improves code quality, and keeps your team’s review history transparent and centralized.
🎓 How to transform unstructured email data into structured format with AI agent
This workflow automates the process of extracting structured, usable information from unstructured email messages across multiple platforms. It connects directly to Gmail, Outlook, and IMAP accounts, retrieves incoming emails, and sends their content to an AI-powered parsing agent built on OpenAI GPT models. The AI agent analyzes each email, identifies relevant details, and returns a clean JSON structure containing key fields: From – sender’s email address To – recipient’s email address Subject – email subject line Summary – short AI-generated summary of the email body The extracted information is then automatically inserted into an n8n Data Table, creating a structured database of email metadata and summaries ready for indexing, reporting, or integration with other tools. --- Key Benefits ✅ Full Automation: Eliminates manual reading and data entry from incoming emails. ✅ Multi-Source Integration: Handles data from different email providers seamlessly. ✅ AI-Driven Accuracy: Uses advanced language models to interpret complex or unformatted content. ✅ Structured Storage: Creates a standardized, query-ready dataset from previously unstructured text. ✅ Time Efficiency: Processes emails in real time, improving productivity and response speed. *✅ Scalability: Easily extendable to handle additional sources or extract more data fields. --- How it works This workflow automates the transformation of unstructured email data into a structured, queryable format. It operates through a series of connected steps: Email Triggering: The workflow is initiated by one of three different email triggers (Gmail, Microsoft Outlook, or a generic IMAP account), which constantly monitor for new incoming emails. AI-Powered Parsing & Structuring: When a new email is detected, its raw, unstructured content is passed to a central "Parsing Agent." This agent uses a specified OpenAI language model to intelligently analyze the email text. Data Extraction & Standardization: Following a predefined system prompt, the AI agent extracts key information from the email, such as the sender, recipient, subject, and a generated summary. It then forces the output into a strict JSON structure using a "Structured Output Parser" node, ensuring data consistency. Data Storage: Finally, the clean, structured data (the from, to, subject, and summarize fields) is inserted as a new row into a specified n8n Data Table, creating a searchable and reportable database of email information. --- Set up steps To implement this workflow, follow these configuration steps: Prepare the Data Table: Create a new Data Table within n8n. Define the columns with the following names and string type: From, To, Subject, and Summary. Configure Email Credentials: Set up the credential connections for the email services you wish to use (Gmail OAuth2, Microsoft Outlook OAuth2, and/or IMAP). Ensure the accounts have the necessary permissions to read emails. Configure AI Model Credentials: Set up the OpenAI API credential with a valid API key. The workflow is configured to use the model, but this can be changed in the respective nodes if needed. Connect the Nodes: The workflow canvas is already correctly wired. Visually confirm that the email triggers are connected to the "Parsing Agent," which is connected to the "Insert row" (Data Table) node. Also, ensure the "OpenAI Chat Model" and "Structured Output Parser" are connected to the "Parsing Agent" as its AI model and output parser, respectively. Activate the Workflow: Save the workflow and toggle the "Active" switch to ON. The triggers will begin polling for new emails according to their schedule (e.g., every minute), and the automation will start processing incoming messages. --- Need help customizing? Contact me for consulting and support or add me on Linkedin.