Scrape product info from website URLs in Google Sheets using Dumpling AI
π What this workflow does
This workflow automatically scrapes product information from any website URL entered into a Google Sheet and stores the extracted product details into another sheet. It uses Dumpling AI to extract product data such as name, price, description, and reviews.
π€ Who is this for
This is ideal for:
- Lead generation specialists capturing product info from prospect websites
- eCommerce researchers collecting data on competitor product listings
- Sales teams building enriched product databases from lead URLs
- Anyone who needs to automate product scraping from multiple websites
β Requirements
- A Google Sheet with a column labeled
Websitewhere URLs will be added - A second sheet (e.g.,
product details) where extracted data will be saved - Dumpling AI API access to perform the extraction
- Connected Google Sheets credentials in n8n
βοΈ How to set up
- Replace the Google Sheet and tab IDs in the workflow with your own.
- Make sure your source sheet includes a
Websitecolumn. - Connect your Dumpling AI and Google Sheets credentials.
- Make sure the output sheet has the following headers:
productNamepriceproductDescription
(The workflow supportsreview, but itβs optional.)
- Activate the workflow to start processing new rows.
π How it works (Workflow Steps)
- Watch New Website URL in Google Sheets: Triggers when a new row is added with a website URL.
- Extract Product Info with Dumpling AI: Sends the URL to Dumpling AIβs extract endpoint using a defined schema for product details.
- Split Extracted Products: Separates multiple products into individual items if the page contains more than one.
- Append Product Info to Google Sheets: Adds the structured results to the specified product details sheet.
π οΈ Customization Ideas
- Add a column to store the original source URL alongside each product
- Use OpenAI to generate short SEO summaries for each product
- Add filters to ignore pages without valid product details
- Send Slack or email notifications when new products are added to the sheet
Scrape Product Information from Website URLs in Google Sheets using Dumpling AI
This n8n workflow automates the process of extracting product information from a list of website URLs stored in a Google Sheet. It leverages the Dumpling AI API to intelligently scrape data from each URL and then writes the extracted information back to the same Google Sheet.
What it does
- Triggers on new Google Sheet rows: The workflow is activated whenever new rows are added to a specified Google Sheet.
- Reads URLs from Google Sheet: It reads the URLs from the newly added rows in the Google Sheet.
- Splits items for individual processing: Each URL is processed as a separate item to ensure independent scraping.
- Scrapes product info using Dumpling AI: For each URL, it makes an HTTP request to the Dumpling AI API to scrape product details.
- Updates Google Sheet with scraped data: The extracted product information (e.g., product name, price, description) is then written back to the Google Sheet, enriching the original data.
Prerequisites/Requirements
- n8n Instance: A running n8n instance (cloud or self-hosted).
- Google Account: A Google account with access to Google Sheets.
- Dumpling AI API Key: An API key for Dumpling AI. You will need to sign up for their service to obtain one.
- Google Sheets Credential: An n8n credential configured for Google Sheets (OAuth2 recommended).
- HTTP Request Credential (for Dumpling AI): An n8n credential for HTTP Request, likely an API Key or Bearer Token type, to authenticate with the Dumpling AI API.
Setup/Usage
- Import the workflow: Download the provided JSON and import it into your n8n instance.
- Configure Google Sheets Trigger:
- Select your Google Sheets credential.
- Specify the Spreadsheet ID and Sheet Name where your URLs are located.
- Ensure the trigger is set to listen for new rows.
- Configure HTTP Request (Dumpling AI):
- Select or create an HTTP Request credential for Dumpling AI. This will typically be an API Key or Bearer Token.
- Update the URL to the Dumpling AI API endpoint for scraping (e.g.,
https://api.dumpling.ai/scrape). - Modify the Body of the request to send the URL from the Google Sheet (e.g.,
{"url": "{{$json.url_column_name}}"whereurl_column_nameis the header of your URL column in Google Sheets). - Adjust any other parameters required by the Dumpling AI API.
- Configure Google Sheets (Write Back):
- Select your Google Sheets credential.
- Specify the same Spreadsheet ID and Sheet Name.
- Set the Operation to
Update. - Map the output from the Dumpling AI HTTP Request node to the appropriate columns in your Google Sheet (e.g.,
Product Name,Price,Description). You will need to define how to match the scraped data back to the original rows (e.g., byRow Indexor a unique identifier).
- Activate the workflow: Once configured, activate the workflow. It will now automatically process new URLs added to your Google Sheet.
Related Templates
Generate song lyrics and music from text prompts using OpenAI and Fal.ai Minimax
Spark your creativity instantly in any chatβturn a simple prompt like "heartbreak ballad" into original, full-length lyrics and a professional AI-generated music track, all without leaving your conversation. π What This Template Does This chat-triggered workflow harnesses AI to generate detailed, genre-matched song lyrics (at least 600 characters) from user messages, then queues them for music synthesis via Fal.ai's minimax-music model. It polls asynchronously until the track is ready, delivering lyrics and audio URL back in chat. Crafts original, structured lyrics with verses, choruses, and bridges using OpenAI Submits to Fal.ai for melody, instrumentation, and vocals aligned to the style Handles long-running generations with smart looping and status checks Returns complete song package (lyrics + audio link) for seamless sharing π§ Prerequisites n8n account (self-hosted or cloud with chat integration enabled) OpenAI account with API access for GPT models Fal.ai account for AI music generation π Required Credentials OpenAI API Setup Go to platform.openai.com β API keys (sidebar) Click "Create new secret key" β Name it (e.g., "n8n Songwriter") Copy the key and add to n8n as "OpenAI API" credential type Test by sending a simple chat completion request Fal.ai HTTP Header Auth Setup Sign up at fal.ai β Dashboard β API Keys Generate a new API key β Copy it In n8n, create "HTTP Header Auth" credential: Name="Fal.ai", Header Name="Authorization", Header Value="Key [Your API Key]" Test with a simple GET to their queue endpoint (e.g., /status) βοΈ Configuration Steps Import the workflow JSON into your n8n instance Assign OpenAI API credentials to the "OpenAI Chat Model" node Assign Fal.ai HTTP Header Auth to the "Generate Music Track", "Check Generation Status", and "Fetch Final Result" nodes Activate the workflowβchat trigger will appear in your n8n chat interface Test by messaging: "Create an upbeat pop song about road trips" π― Use Cases Content Creators: YouTubers generating custom jingles for videos on the fly, streamlining production from idea to audio export Educators: Music teachers using chat prompts to create era-specific folk tunes for classroom discussions, fostering interactive learning Gift Personalization: Friends crafting anniversary R&B tracks from shared memories via quick chats, delivering emotional audio surprises Artist Brainstorming: Songwriters prototyping hip-hop beats in real-time during sessions, accelerating collaboration and iteration β οΈ Troubleshooting Invalid JSON from AI Agent: Ensure the system prompt stresses valid JSON; test the agent standalone with a sample query Music Generation Fails (401/403): Verify Fal.ai API key has minimax-music access; check usage quotas in dashboard Status Polling Loops Indefinitely: Bump wait time to 45-60s for complex tracks; inspect fal.ai queue logs for bottlenecks Lyrics Under 600 Characters: Tweak agent prompt to enforce fuller structures like [V1][C][V2][B][C]; verify output length in executions
Auto-reply & create Linear tickets from Gmail with GPT-5, gotoHuman & human review
This workflow automatically classifies every new email from your linked mailbox, drafts a personalized reply, and creates Linear tickets for bugs or feature requests. It uses a human-in-the-loop with gotoHuman and continuously improves itself by learning from approved examples. How it works The workflow triggers on every new email from your linked mailbox. Self-learning Email Classifier: an AI model categorizes the email into defined categories (e.g., Bug Report, Feature Request, Sales Opportunity, etc.). It fetches previously approved classification examples from gotoHuman to refine decisions. Self-learning Email Writer: the AI drafts a reply to the email. It learns over time by using previously approved replies from gotoHuman, with per-classification context to tailor tone and style (e.g., different style for sales vs. bug reports). Human Review in gotoHuman: review the classification and the drafted reply. Drafts can be edited or retried. Approved values are used to train the self-learning agents. Send approved Reply: the approved response is sent as a reply to the email thread. Create ticket: if the classification is Bug or Feature Request, a ticket is created by another AI agent in Linear. Human Review in gotoHuman: How to set up Most importantly, install the gotoHuman node before importing this template! (Just add the node to a blank canvas before importing) Set up credentials for gotoHuman, OpenAI, your email provider (e.g. Gmail), and Linear. In gotoHuman, select and create the pre-built review template "Support email agent" or import the ID: 6fzuCJlFYJtlu9mGYcVT. Select this template in the gotoHuman node. In the "gotoHuman: Fetch approved examples" http nodes you need to add your formId. It is the ID of the review template that you just created/imported in gotoHuman. Requirements gotoHuman (human supervision, memory for self-learning) OpenAI (classification, drafting) Gmail or your preferred email provider (for email trigger+replies) Linear (ticketing) How to customize Expand or refine the categories used by the classifier. Update the prompt to reflect your own taxonomy. Filter fetched training data from gotoHuman by reviewer so the writer adapts to their personalized tone and preferences. Add more context to the AI email writer (calendar events, FAQs, product docs) to improve reply quality.
Dynamic Hubspot lead routing with GPT-4 and Airtable sales team distribution
AI Agent for Dynamic Lead Distribution (HubSpot + Airtable) π§ AI-Powered Lead Routing and Sales Team Distribution This intelligent n8n workflow automates end-to-end lead qualification and allocation by integrating HubSpot, Airtable, OpenAI, Gmail, and Slack. The system ensures that every new lead is instantly analyzed, scored, and routed to the best-fit sales representative β all powered by AI logic, sir. --- π‘ Key Advantages β‘ Real-Time Lead Routing Automatically assigns new leads from HubSpot to the most relevant sales rep based on region, capacity, and expertise. π§ AI Qualification Engine An OpenAI-powered Agent evaluates the leadβs industry, region, and needs to generate a persona summary and routing rationale. π Centralized Tracking in Airtable Every lead is logged and updated in Airtable with AI insights, rep details, and allocation status for full transparency. π¬ Instant Notifications Slack and Gmail integrations alert the assigned rep immediately with full lead details and AI-generated notes. π Seamless CRM Sync Updates the original HubSpot record with lead persona, routing info, and timeline notes for audit-ready history, sir. --- βοΈ How It Works HubSpot Trigger β Captures a new lead as soon as itβs created in HubSpot. Fetch Contact Data β Retrieves all relevant fields like name, company, and industry. Clean & Format Data β A Code node standardizes and structures the data for consistency. Airtable Record Creation β Logs the lead data into the βLeadsβ table for centralized tracking. AI Agent Qualification β The AI analyzes the lead using the TeamDatabase (Airtable) to find the ideal rep. Record Update β Updates the same Airtable record with the assigned team and AI persona summary. Slack Notification β Sends a real-time message tagging the rep with lead info. Gmail Notification β Sends a personalized handoff email with context and follow-up actions. HubSpot Sync β Updates the original contact in HubSpot with the assignment details and AI rationale, sir. --- π οΈ Setup Steps Trigger Node: HubSpot β Detect new leads. HubSpot Node: Retrieve complete lead details. Code Node: Clean and normalize data. Airtable Node: Log lead info in the βLeadsβ table. AI Agent Node: Process lead and match with sales team. Slack Node: Notify the designated representative. Gmail Node: Email the rep with details. HubSpot Node: Update CRM with AI summary and allocation status, sir. --- π Credentials Required HubSpot OAuth2 API β To fetch and update leads. Airtable Personal Access Token β To store and update lead data. OpenAI API β To power the AI qualification and matching logic. Slack OAuth2 β For sending team notifications. Gmail OAuth2 β For automatic email alerts to assigned reps, sir. --- π€ Ideal For Sales Operations and RevOps teams managing multiple regions B2B SaaS and enterprise teams handling large lead volumes Marketing teams requiring AI-driven, bias-free lead assignment Organizations optimizing CRM efficiency with automation, sir --- π¬ Bonus Tip You can easily extend this workflow by adding lead scoring logic, language translation for follow-ups, or Salesforce integration. The entire system is modular β perfect for scaling across global sales teams, sir.