Back to Catalog

Find Instagram collaboration leads using Apify scraping and GPT-4o evaluation

Robert BreenRobert Breen
1022 views
2/3/2026
Official Page

This no-code n8n workflow finds recent Instagram posts by hashtag, scrapes profile data, and uses an AI agent to evaluate whether each account is a good collaboration lead. The workflow filters based on the number of followers and the content of their bio, and outputs structured reasoning for outreach decisions.

Perfect for creators, marketers, or business developers looking to automate influencer or community partnership prospecting—especially in niche ecosystems like n8n.


✅ Key Features

  • 🔍 Hashtag Discovery: Finds recent Instagram posts from a specified hashtag (e.g., #n8n)
  • 👤 Account Scraping: Retrieves profile details such as follower count and biography
  • 🧠 AI Evaluation: Uses OpenAI and LangChain to determine if the profile is a good fit for outreach
  • 📦 Structured Output: Returns a JSON object with "Yes/No" lead status and reasoning
  • 🛠️ Manual Execution: Run on demand using the manual trigger

🧰 What You'll Need

| Tool / API | Purpose | Setup Steps | |-------------------------|------------------------------------------|-------------| | Apify Account | To access Instagram scraping actors | Create account → Generate API Token → Use in httpQueryAuth credential in n8n | | OpenAI API Key | To power the AI decision-making agent | Sign up at OpenAI → Create API key → Paste into OpenAI credential in n8n | | LangChain Plugin for n8n | AI Orchestration with System Message | Install LangChain nodes from Community Nodes (already installed in this workflow) |


🔧 Step-by-Step Setup

1️⃣ Manual Trigger

  • Node: When clicking ‘Execute workflow’
  • Use: Allows you to run the workflow manually while testing.

2️⃣ Define Hashtag

  • Node: Create Search Term
  • Value: Sets "n8n" as the default Instagram hashtag to scan.
  • You can edit this to any other hashtag you'd like.

3️⃣ Find Recent Posts

  • Node: Find Recent Posts
  • API: Apify Instagram Hashtag Scraper
  • Auth Setup:
    • Go to your Apify Console
    • Click “Create new token”
    • In n8n, create a new HTTP Query Auth credential
      • Set token in the token query param (e.g., ?token=yourTokenHere)
    • Choose the credential in this node

4️⃣ Scrape Each Profile

  • Node: Scrape Accounts
  • API: Apify Instagram Profile Scraper
  • Body: JSON with usernames from the hashtag search
  • Note: Uses the same httpQueryAuth credential as the previous node.

5️⃣ Extract Fields

  • Node: Set bio and follower count
  • What it does: Extracts biography and followersCount from the profile JSON and stores them in clean variables for AI input.

6️⃣ AI Lead Scoring

  • Node: AI Agent
  • Purpose: Uses GPT-4o-mini to analyze the bio and follower count
  • Prompt Details:

7️⃣ AI Model

  • Node: OpenAI Chat Model
  • Model: gpt-4o-mini
  • Credential: Connect your OpenAI account via API Key.
  • Go to OpenAI API Keys
  • Copy your key and create a new OpenAI API credential in n8n.

8️⃣ Output Parser

  • Node: Structured Output Parser
  • What it does: Parses the response from the AI into structured JSON for further use (e.g., storing leads, sending to Airtable, etc.)

🧪 Sample Output

{
"lead status": "Yes",
"Reasoning": "The user has 3.5k followers and their bio shows they build automations with n8n."
}

---

## 📬 Need More Help?

If you'd like assistance setting this up, customizing it to your niche, or expanding it to score and store leads automatically — I can help!

**👤 Robert Breen**  
Automation Consultant | AI Workflow Designer | n8n Expert  
📧 [robert@ynteractive.com](mailto:robert@ynteractive.com)  
🌐 [ynteractive.com](https://ynteractive.com)  
🔗 [LinkedIn](https://www.linkedin.com/in/robert-breen-29429625/)

---

n8n Workflow: AI Agent for Instagram Collaboration Lead Evaluation

This n8n workflow leverages an AI Agent to process and evaluate potential Instagram collaboration leads. It's designed to take raw data, likely from a scraping process, and apply intelligent analysis to identify valuable collaboration opportunities.

What it does

This workflow automates the following steps:

  1. Manual Trigger: The workflow is initiated manually, allowing for on-demand execution.
  2. HTTP Request (Placeholder): A placeholder HTTP Request node is present. In a full implementation, this would likely be used to fetch data from an external source, such as an API that provides scraped Instagram profiles or a database of potential leads.
  3. Edit Fields (Set): A Set node is included for data transformation. This would be used to prepare the data for the AI Agent, potentially renaming fields, extracting specific information, or structuring the input JSON.
  4. AI Agent (Langchain): The core of the workflow, this AI Agent node (powered by Langchain) will process the input data. It's configured to use an OpenAI Chat Model and a Structured Output Parser, indicating it's designed to understand natural language instructions and return data in a predefined, structured format. The agent's role is to evaluate the provided Instagram lead data against specific criteria to determine collaboration potential.
  5. OpenAI Chat Model (Langchain): This sub-node within the AI Agent is responsible for the actual AI processing, using OpenAI's chat capabilities to perform the evaluation.
  6. Structured Output Parser (Langchain): This sub-node ensures that the AI Agent's output is formatted consistently, likely into a JSON object, making it easy for subsequent nodes to consume and act upon the evaluation results.
  7. Sticky Note: A sticky note is present, likely for documentation or temporary notes within the workflow design.

Prerequisites/Requirements

  • n8n Instance: A running instance of n8n.
  • OpenAI API Key: Configured as a credential within n8n for the OpenAI Chat Model.
  • Apify Account (Implied): While not directly in the JSON, the directory name "7019-find-instagram-collaboration-leads-using-apify-scraping-and-gpt-4o-evaluation" strongly suggests that Apify is used upstream to scrape Instagram data, which this workflow would then process. You would need an Apify account and an Apify Actor to perform the scraping.
  • Understanding of AI Agent configuration: Familiarity with configuring Langchain AI Agents in n8n, including defining prompts, tools, and output structures.

Setup/Usage

  1. Import the Workflow: Import the provided JSON into your n8n instance.
  2. Configure Credentials:
    • Set up your OpenAI API Key as an n8n credential and link it to the "OpenAI Chat Model" node within the "AI Agent".
  3. Customize HTTP Request (if applicable): If you intend to fetch data from an external source, configure the "HTTP Request" node with the appropriate URL, headers, and body to retrieve your Instagram lead data.
  4. Configure Edit Fields (Set): Adjust the "Edit Fields" node to transform your raw input data into a format suitable for the AI Agent. This might involve mapping fields, extracting specific values, or creating new properties.
  5. Configure AI Agent:
    • Prompt Engineering: The most crucial step is to define the prompt for the "AI Agent" node. This prompt should clearly instruct the AI on how to evaluate an Instagram profile for collaboration potential. For example, it could ask the AI to consider follower count, engagement rate, niche relevance, content quality, and brand alignment.
    • Structured Output: Define the expected output structure in the "Structured Output Parser" node. This will guide the AI to return its evaluation in a consistent JSON format (e.g., {"is_good_lead": true, "reason": "...", "score": 8, "suggested_action": "..."}).
  6. Execute the Workflow: Click "Execute Workflow" on the "Manual Trigger" node to run the workflow and process your leads.
  7. Review Results: Inspect the output of the "AI Agent" node to see the AI's evaluation of your Instagram collaboration leads. You can then add subsequent nodes to, for example, save the evaluated leads to a database, send notifications, or further filter them.

Related Templates

AI-powered code review with linting, red-marked corrections in Google Sheets & Slack

Advanced Code Review Automation (AI + Lint + Slack) Who’s it for For software engineers, QA teams, and tech leads who want to automate intelligent code reviews with both AI-driven suggestions and rule-based linting — all managed in Google Sheets with instant Slack summaries. How it works This workflow performs a two-layer review system: Lint Check: Runs a lightweight static analysis to find common issues (e.g., use of var, console.log, unbalanced braces). AI Review: Sends valid code to Gemini AI, which provides human-like review feedback with severity classification (Critical, Major, Minor) and visual highlights (red/orange tags). Formatter: Combines lint and AI results, calculating an overall score (0–10). Aggregator: Summarizes results for quick comparison. Google Sheets Writer: Appends results to your review log. Slack Notification: Posts a concise summary (e.g., number of issues and average score) to your team’s channel. How to set up Connect Google Sheets and Slack credentials in n8n. Replace placeholders (<YOURSPREADSHEETID>, <YOURSHEETGIDORNAME>, <YOURSLACKCHANNEL_ID>). Adjust the AI review prompt or lint rules as needed. Activate the workflow — reviews will start automatically whenever new code is added to the sheet. Requirements Google Sheets and Slack integrations enabled A configured AI node (Gemini, OpenAI, or compatible) Proper permissions to write to your target Google Sheet How to customize Add more linting rules (naming conventions, spacing, forbidden APIs) Extend the AI prompt for project-specific guidelines Customize the Slack message formatting Export analytics to a dashboard (e.g., Notion or Data Studio) Why it’s valuable This workflow brings realistic, team-oriented AI-assisted code review to n8n — combining the speed of automated linting with the nuance of human-style feedback. It saves time, improves code quality, and keeps your team’s review history transparent and centralized.

higashiyama By higashiyama
90

Ai website scraper & company intelligence

AI Website Scraper & Company Intelligence Description This workflow automates the process of transforming any website URL into a structured, intelligent company profile. It's triggered by a form, allowing a user to submit a website and choose between a "basic" or "deep" scrape. The workflow extracts key information (mission, services, contacts, SEO keywords), stores it in a structured Supabase database, and archives a full JSON backup to Google Drive. It also features a secondary AI agent that automatically finds and saves competitors for each company, building a rich, interconnected database of company intelligence. --- Quick Implementation Steps Import the Workflow: Import the provided JSON file into your n8n instance. Install Custom Community Node: You must install the community node from: https://www.npmjs.com/package/n8n-nodes-crawl-and-scrape FIRECRAWL N8N Documentation https://docs.firecrawl.dev/developer-guides/workflow-automation/n8n Install Additional Nodes: n8n-nodes-crawl-and-scrape and n8n-nodes-mcp fire crawl mcp . Set up Credentials: Create credentials in n8n for FIRE CRAWL API,Supabase, Mistral AI, and Google Drive. Configure API Key (CRITICAL): Open the Web Search tool node. Go to Parameters → Headers and replace the hardcoded Tavily AI API key with your own. Configure Supabase Nodes: Assign your Supabase credential to all Supabase nodes. Ensure table names (e.g., companies, competitors) match your schema. Configure Google Drive Nodes: Assign your Google Drive credential to the Google Drive2 and save to Google Drive1 nodes. Select the correct Folder ID. Activate Workflow: Turn on the workflow and open the Webhook URL in the “On form submission” node to access the form. --- What It Does Form Trigger Captures user input: “Website URL” and “Scraping Type” (basic or deep). Scraping Router A Switch node routes the flow: Deep Scraping → AI-based MCP Firecrawler agent. Basic Scraping → Crawlee node. Deep Scraping (Firecrawl AI Agent) Uses Firecrawl and Tavily Web Search. Extracts a detailed JSON profile: mission, services, contacts, SEO keywords, etc. Basic Scraping (Crawlee) Uses Crawl and Scrape node to collect raw text. A Mistral-based AI extractor structures the data into JSON. Data Storage Stores structured data in Supabase tables (companies, company_basicprofiles). Archives a full JSON backup to Google Drive. Automated Competitor Analysis Runs after a deep scrape. Uses Tavily web search to find competitors (e.g., from Crunchbase). Saves competitor data to Supabase, linked by company_id. --- Who's It For Sales & Marketing Teams: Enrich leads with deep company info. Market Researchers: Build structured, searchable company databases. B2B Data Providers: Automate company intelligence collection. Developers: Use as a base for RAG or enrichment pipelines. --- Requirements n8n instance (self-hosted or cloud) Supabase Account: With tables like companies, competitors, social_links, etc. Mistral AI API Key Google Drive Credentials Tavily AI API Key (Optional) Custom Nodes: n8n-nodes-crawl-and-scrape --- How It Works Flow Summary Form Trigger: Captures “Website URL” and “Scraping Type”. Switch Node: deep → MCP Firecrawler (AI Agent). basic → Crawl and Scrape node. Scraping & Extraction: Deep path: Firecrawler → JSON structure. Basic path: Crawlee → Mistral extractor → JSON. Storage: Save JSON to Supabase. Archive in Google Drive. Competitor Analysis (Deep Only): Finds competitors via Tavily. Saves to Supabase competitors table. End: Finishes with a No Operation node. --- How To Set Up Import workflow JSON. Install community nodes (especially n8n-nodes-crawl-and-scrape from npm). Configure credentials (Supabase, Mistral AI, Google Drive). Add your Tavily API key. Connect Supabase and Drive nodes properly. Fix disconnected “basic” path if needed. Activate workflow. Test via the webhook form URL. --- How To Customize Change LLMs: Swap Mistral for OpenAI or Claude. Edit Scraper Prompts: Modify system prompts in AI agent nodes. Change Extraction Schema: Update JSON Schema in extractor nodes. Fix Relational Tables: Add Items node before Supabase inserts for arrays (social links, keywords). Enhance Automation: Add email/slack notifications, or replace form trigger with a Google Sheets trigger. --- Add-ons Automated Trigger: Run on new sheet rows. Notifications: Email or Slack alerts after completion. RAG Integration: Use the Supabase database as a chatbot knowledge source. --- Use Case Examples Sales Lead Enrichment: Instantly get company + competitor data from a URL. Market Research: Collect and compare companies in a niche. B2B Database Creation: Build a proprietary company dataset. --- WORKFLOW IMAGE --- Troubleshooting Guide | Issue | Possible Cause | Solution | |-------|----------------|-----------| | Form Trigger 404 | Workflow not active | Activate the workflow | | Web Search Tool fails | Missing Tavily API key | Replace the placeholder key | | FIRECRAWLER / find competitor fails | Missing MCP node | Install n8n-nodes-mcp | | Basic scrape does nothing | Switch node path disconnected | Reconnect “basic” output | | Supabase node error | Wrong table/column names | Match schema exactly | --- Need Help or More Workflows? Want to customize this workflow for your business or integrate it with your existing tools? Our team at Digital Biz Tech can tailor it precisely to your use case from automation logic to AI-powered enhancements. Contact: shilpa.raju@digitalbiz.tech For more such offerings, visit us: https://www.digitalbiz.tech ---

DIGITAL BIZ TECHBy DIGITAL BIZ TECH
923

Daily cash flow reports with Google Sheets, Slack & Email for finance teams

Simplify financial oversight with this automated n8n workflow. Triggered daily, it fetches cash flow and expense data from a Google Sheet, analyzes inflows and outflows, validates records, and generates a comprehensive daily report. The workflow sends multi-channel notifications via email and Slack, ensuring finance professionals stay updated with real-time financial insights. 💸📧 Key Features Daily automation keeps cash flow tracking current. Analyzes inflows and outflows for actionable insights. Multi-channel alerts enhance team visibility. Logs maintain a detailed record in Google Sheets. Workflow Process The Every Day node triggers a daily check at a set time. Get Cash Flow Data retrieves financial data from a Google Sheet. Analyze Inflows & Outflows processes the data to identify trends and totals. Validate Records ensures all entries are complete and accurate. If records are valid, it branches to: Sends Email Daily Report to finance team members. Send Slack Alert to notify the team instantly. Logs to Sheet appends the summary data to a Google Sheet for tracking. Setup Instructions Import the workflow into n8n and configure Google Sheets OAuth2 for data access. Set the daily trigger time (e.g., 9:00 AM IST) in the "Every Day" node. Test the workflow by adding sample cash flow data and verifying reports. Adjust analysis parameters as needed for specific financial metrics. Prerequisites Google Sheets OAuth2 credentials Gmail API Key for email reports Slack Bot Token (with chat:write permissions) Structured financial data in a Google Sheet Google Sheet Structure: Create a sheet with columns: Date Cash Inflow Cash Outflow Category Notes Updated At Modification Options Customize the "Analyze Inflows & Outflows" node to include custom financial ratios. Adjust the "Validate Records" filter to flag anomalies or missing data. Modify email and Slack templates with branded formatting. Integrate with accounting tools (e.g., Xero) for live data feeds. Set different trigger times to align with your financial review schedule. Discover more workflows – Get in touch with us

Oneclick AI SquadBy Oneclick AI Squad
619