Extract seed-funded startup data with RSS, GPT-4.1-MINI & BrightData to Excel
This workflow contains community nodes that are only compatible with the self-hosted version of n8n.
What It Does
This workflow automatically discovers recently seed-funded startups by monitoring RSS feeds for funding announcements. It uses Bright Data to scrape full article content, then extracts structured company information using OpenAI (GPT). The data is exported to an Excel sheet on OneDrive, providing sales teams with a real-time list of qualified leads without any manual effort.
How It Works
- Trigger & Article Discovery: Monitors curated RSS feeds for articles mentioning seed funding and triggers the workflow on new article detection.
- Content Scraping & Preparation: Scrapes full article content and converts it into clean markdown format for AI processing.
- Data Extraction with AI: Uses OpenAI to extract structured details like company name, website, LinkedIn profile, founders, and funding amount.
- Structured Data Output & Storage: Appends extracted data to an Excel sheet on OneDrive via Microsoft Graph API.
Prerequisites
- RSS Feed URL: A valid RSS feed source that provides seed funding articles for startups.
- Bright Data Credentials: Active Bright Data account with access credentials (API token ) to enable article scraping.
- OpenAI API Key: An OpenAI account with an API key and access to GPT-4.1-MINI models for data extraction.
- Microsoft OAuth2 API Credentials: OAuth2 credentials (Client ID, Secret, Tenant ID) with access scopes to use Microsoft Graph API for Excel integration.
- Excel Sheet in SharePoint: A pre-created Excel file hosted on OneDrive or SharePoint with the following column headers:
createdAt,companyName,companyWebsite,companyLinkedIn,fundingAmount,founderName,founderLinkedIn,articleLink - Excel File & Sheet Identifiers: The Drive ID, File ID and Sheet ID of your Excel sheet stored on OneDrive or SharePoint, required by the Microsoft Graph API for appending rows using the HTTP node in n8n.
Need help with the setup? Feel free to contact us
How to Set It Up
Follow these steps to configure and run the workflow:
-
Import the Workflow
- Copy the provided n8n workflow template.
- In your n8n instance, go to Editor UI > paste this workflow.
-
Configure the RSS Feed Node
- Open the RSS trigger node.
- Replace the default URL with your RSS feed URL.
- Ensure the polling interval matches your desired frequency (e.g., every 15 minutes or 1 hour).
-
Set Up Bright Data Node
- Add your Bright Data credentials.
- Follow the documentation to complete the setup.
-
Configure OpenAI Integration
- Add your OpenAI API key as a credential in n8n.
- Ensure the model is set to gpt-4.1-MINI.
- Follow the documentation to complete the setup.
-
Configure Excel File Integration
-
Open the HTTP node responsible for sending data to the Excel sheet via Microsoft Graph API.
-
Replace the placeholder values in the API endpoint URL with your actual File ID and Sheet ID from the Excel file stored on OneDrive or SharePoint.
https://graph.microsoft.com/v1.0/drives/{{drive-id}}/items/{{file-id}}/workbook/tables/{ {{ sheet-id }} }/rowsThis URL is used to append data to the specified Excel sheet range. -
Next, set up Microsoft OAuth2 credentials in n8n:
- Go to n8n > Credentials > Microsoft OAuth2 API.
- Provide the required values: - Client ID - Client Secret - Tenant ID - Scope
- Follow the documentation to complete the setup.
-
Once the credential is saved, connect it to the HTTP node making the Graph API call.
-
-
Activate the Workflow
- Set the workflow status to Active in n8n so it runs automatically when a new article appears in the RSS feed.
Need Help? Contact us for support and custom workflow development.
Extract Seed-Funded Startup Data with RSS, GPT-4 (Mini), and BrightData to Excel
This n8n workflow automates the process of identifying newly seed-funded startups from RSS feeds, enriching their data using an AI model, and preparing it for export or further analysis. It's designed to help you quickly discover and gather information on promising new companies.
What it does
This workflow performs the following key steps:
- Monitors RSS Feeds: It acts as a trigger, listening for new items in a configured RSS feed.
- Enriches Data with AI: For each new RSS item, it uses an OpenAI (GPT-4 mini) model to process the content. This step likely extracts specific details about the startup, such as its name, funding amount, investors, and a brief description, based on the prompt provided to the AI.
- Formats Output: The extracted data is then formatted into a structured output, potentially cleaning up fields or combining information.
- Generates Markdown Summary: A Markdown summary of the extracted startup information is generated, which could be used for reporting or display purposes.
- Prepares for Further Action: The final output is ready to be used by subsequent nodes, such as sending to a spreadsheet (e.g., Excel), a database, or a notification service.
Prerequisites/Requirements
To use this workflow, you will need:
- n8n Instance: A running n8n instance.
- RSS Feed URL: The URL of an RSS feed that publishes news about seed-funded startups (e.g., tech news sites, venture capital blogs).
- OpenAI API Key: An API key for OpenAI to utilize its language model (GPT-4 mini in this case) for data extraction and enrichment.
- (Potentially) BrightData Account: While not explicitly present in the provided JSON, the directory name suggests an integration with BrightData might be intended for web scraping or proxy services. If you plan to extend this workflow to scrape data from startup websites, a BrightData account might be necessary.
- (Potentially) Excel/Google Sheets/Database: A destination for the extracted data, such as an Excel file (manual export or via another n8n node), Google Sheet, or a database.
Setup/Usage
- Import the Workflow: Import the provided JSON into your n8n instance.
- Configure RSS Feed Trigger:
- Open the "RSS Feed Trigger" node.
- Enter the URL of the RSS feed you wish to monitor for startup funding news.
- Set the desired interval for checking the feed (e.g., every hour, daily).
- Configure OpenAI Credentials:
- Locate the "OpenAI" node.
- Add your OpenAI API Key as a credential.
- Review the prompt within the OpenAI node to ensure it aligns with the type of information you want to extract from the RSS feed content. Adjust as needed.
- Review and Customize "Edit Fields" and "Code" Nodes:
- The "Edit Fields (Set)" node likely renames or structures the data extracted by OpenAI. Review its configuration to ensure the fields are named as desired.
- The "Code" node may contain custom JavaScript logic for further data manipulation. Examine and modify it if specific transformations are required.
- Review "Markdown" Node: This node generates a Markdown summary. You can customize its content to include specific details from the extracted startup data.
- Add Output Node (Optional): To send the extracted data to Excel or another service, you will need to add an additional n8n node. For example:
- Google Sheets Node: To append data to a Google Sheet.
- Write Binary File Node: To create a CSV or JSON file that can be opened in Excel.
- HTTP Request Node: To send data to an external API (e.g., a custom endpoint for Excel integration).
- Activate the Workflow: Once configured, activate the workflow to start monitoring the RSS feed and processing new startup funding announcements.
Related Templates
Generate song lyrics and music from text prompts using OpenAI and Fal.ai Minimax
Spark your creativity instantly in any chatβturn a simple prompt like "heartbreak ballad" into original, full-length lyrics and a professional AI-generated music track, all without leaving your conversation. π What This Template Does This chat-triggered workflow harnesses AI to generate detailed, genre-matched song lyrics (at least 600 characters) from user messages, then queues them for music synthesis via Fal.ai's minimax-music model. It polls asynchronously until the track is ready, delivering lyrics and audio URL back in chat. Crafts original, structured lyrics with verses, choruses, and bridges using OpenAI Submits to Fal.ai for melody, instrumentation, and vocals aligned to the style Handles long-running generations with smart looping and status checks Returns complete song package (lyrics + audio link) for seamless sharing π§ Prerequisites n8n account (self-hosted or cloud with chat integration enabled) OpenAI account with API access for GPT models Fal.ai account for AI music generation π Required Credentials OpenAI API Setup Go to platform.openai.com β API keys (sidebar) Click "Create new secret key" β Name it (e.g., "n8n Songwriter") Copy the key and add to n8n as "OpenAI API" credential type Test by sending a simple chat completion request Fal.ai HTTP Header Auth Setup Sign up at fal.ai β Dashboard β API Keys Generate a new API key β Copy it In n8n, create "HTTP Header Auth" credential: Name="Fal.ai", Header Name="Authorization", Header Value="Key [Your API Key]" Test with a simple GET to their queue endpoint (e.g., /status) βοΈ Configuration Steps Import the workflow JSON into your n8n instance Assign OpenAI API credentials to the "OpenAI Chat Model" node Assign Fal.ai HTTP Header Auth to the "Generate Music Track", "Check Generation Status", and "Fetch Final Result" nodes Activate the workflowβchat trigger will appear in your n8n chat interface Test by messaging: "Create an upbeat pop song about road trips" π― Use Cases Content Creators: YouTubers generating custom jingles for videos on the fly, streamlining production from idea to audio export Educators: Music teachers using chat prompts to create era-specific folk tunes for classroom discussions, fostering interactive learning Gift Personalization: Friends crafting anniversary R&B tracks from shared memories via quick chats, delivering emotional audio surprises Artist Brainstorming: Songwriters prototyping hip-hop beats in real-time during sessions, accelerating collaboration and iteration β οΈ Troubleshooting Invalid JSON from AI Agent: Ensure the system prompt stresses valid JSON; test the agent standalone with a sample query Music Generation Fails (401/403): Verify Fal.ai API key has minimax-music access; check usage quotas in dashboard Status Polling Loops Indefinitely: Bump wait time to 45-60s for complex tracks; inspect fal.ai queue logs for bottlenecks Lyrics Under 600 Characters: Tweak agent prompt to enforce fuller structures like [V1][C][V2][B][C]; verify output length in executions
Auto-reply & create Linear tickets from Gmail with GPT-5, gotoHuman & human review
This workflow automatically classifies every new email from your linked mailbox, drafts a personalized reply, and creates Linear tickets for bugs or feature requests. It uses a human-in-the-loop with gotoHuman and continuously improves itself by learning from approved examples. How it works The workflow triggers on every new email from your linked mailbox. Self-learning Email Classifier: an AI model categorizes the email into defined categories (e.g., Bug Report, Feature Request, Sales Opportunity, etc.). It fetches previously approved classification examples from gotoHuman to refine decisions. Self-learning Email Writer: the AI drafts a reply to the email. It learns over time by using previously approved replies from gotoHuman, with per-classification context to tailor tone and style (e.g., different style for sales vs. bug reports). Human Review in gotoHuman: review the classification and the drafted reply. Drafts can be edited or retried. Approved values are used to train the self-learning agents. Send approved Reply: the approved response is sent as a reply to the email thread. Create ticket: if the classification is Bug or Feature Request, a ticket is created by another AI agent in Linear. Human Review in gotoHuman: How to set up Most importantly, install the gotoHuman node before importing this template! (Just add the node to a blank canvas before importing) Set up credentials for gotoHuman, OpenAI, your email provider (e.g. Gmail), and Linear. In gotoHuman, select and create the pre-built review template "Support email agent" or import the ID: 6fzuCJlFYJtlu9mGYcVT. Select this template in the gotoHuman node. In the "gotoHuman: Fetch approved examples" http nodes you need to add your formId. It is the ID of the review template that you just created/imported in gotoHuman. Requirements gotoHuman (human supervision, memory for self-learning) OpenAI (classification, drafting) Gmail or your preferred email provider (for email trigger+replies) Linear (ticketing) How to customize Expand or refine the categories used by the classifier. Update the prompt to reflect your own taxonomy. Filter fetched training data from gotoHuman by reviewer so the writer adapts to their personalized tone and preferences. Add more context to the AI email writer (calendar events, FAQs, product docs) to improve reply quality.
Dynamic Hubspot lead routing with GPT-4 and Airtable sales team distribution
AI Agent for Dynamic Lead Distribution (HubSpot + Airtable) π§ AI-Powered Lead Routing and Sales Team Distribution This intelligent n8n workflow automates end-to-end lead qualification and allocation by integrating HubSpot, Airtable, OpenAI, Gmail, and Slack. The system ensures that every new lead is instantly analyzed, scored, and routed to the best-fit sales representative β all powered by AI logic, sir. --- π‘ Key Advantages β‘ Real-Time Lead Routing Automatically assigns new leads from HubSpot to the most relevant sales rep based on region, capacity, and expertise. π§ AI Qualification Engine An OpenAI-powered Agent evaluates the leadβs industry, region, and needs to generate a persona summary and routing rationale. π Centralized Tracking in Airtable Every lead is logged and updated in Airtable with AI insights, rep details, and allocation status for full transparency. π¬ Instant Notifications Slack and Gmail integrations alert the assigned rep immediately with full lead details and AI-generated notes. π Seamless CRM Sync Updates the original HubSpot record with lead persona, routing info, and timeline notes for audit-ready history, sir. --- βοΈ How It Works HubSpot Trigger β Captures a new lead as soon as itβs created in HubSpot. Fetch Contact Data β Retrieves all relevant fields like name, company, and industry. Clean & Format Data β A Code node standardizes and structures the data for consistency. Airtable Record Creation β Logs the lead data into the βLeadsβ table for centralized tracking. AI Agent Qualification β The AI analyzes the lead using the TeamDatabase (Airtable) to find the ideal rep. Record Update β Updates the same Airtable record with the assigned team and AI persona summary. Slack Notification β Sends a real-time message tagging the rep with lead info. Gmail Notification β Sends a personalized handoff email with context and follow-up actions. HubSpot Sync β Updates the original contact in HubSpot with the assignment details and AI rationale, sir. --- π οΈ Setup Steps Trigger Node: HubSpot β Detect new leads. HubSpot Node: Retrieve complete lead details. Code Node: Clean and normalize data. Airtable Node: Log lead info in the βLeadsβ table. AI Agent Node: Process lead and match with sales team. Slack Node: Notify the designated representative. Gmail Node: Email the rep with details. HubSpot Node: Update CRM with AI summary and allocation status, sir. --- π Credentials Required HubSpot OAuth2 API β To fetch and update leads. Airtable Personal Access Token β To store and update lead data. OpenAI API β To power the AI qualification and matching logic. Slack OAuth2 β For sending team notifications. Gmail OAuth2 β For automatic email alerts to assigned reps, sir. --- π€ Ideal For Sales Operations and RevOps teams managing multiple regions B2B SaaS and enterprise teams handling large lead volumes Marketing teams requiring AI-driven, bias-free lead assignment Organizations optimizing CRM efficiency with automation, sir --- π¬ Bonus Tip You can easily extend this workflow by adding lead scoring logic, language translation for follow-ups, or Salesforce integration. The entire system is modular β perfect for scaling across global sales teams, sir.