Automate web research with GPT-4, Claude & Apify for content analysis and insights

1132 views

2/3/2026

Vision AI Voice Input AI Assistant Image Analysis Google Gemini

This n8n template demonstrates how to automate comprehensive web research using multiple AI models to find, analyze, and extract insights from authoritative sources.

Use cases are many: Try automating competitive analysis research, finding latest regulatory guidance from official sources, gathering authoritative content for reports, or conducting market research on industry developments!

Good to know

Each research query typically costs $0.08-$0.34 depending on the number of sources found and processed. The workflow includes smart filtering to minimize unnecessary API calls.

The workflow requires multiple AI services and may need additional setup time compared to simpler templates.

Qdrant storage is optional and can be removed without affecting performance.

How it works

Your research question gets transformed into optimized Google search queries that target authoritative sources while filtering out low-quality sites.

Apify's RAG Web Browser scrapes the content and converts pages to clean markdown format.

Claude Sonnet 4 evaluates each article for relevance and quality before full processing.

Articles that pass the filter get analyzed in parallel - one pipeline creates focused summaries while another extracts specific claims and evidence.

GPT-4.1 Mini ranks all findings and presents the top 3 most valuable insights and summaries.

All processed content gets stored in your Qdrant vector database to prevent duplicate processing and enable future reference.

How to use

The manual trigger node is used as an example but feel free to replace this with other triggers such as webhook, form submissions, or scheduled research.

You can modify the configuration variables in the Set Node to customize Qdrant URLs, collection names, and quality thresholds for your specific needs.

Requirements

OpenAI API account for GPT-4.1 Mini (query optimization, summarization, ranking) Anthropic API account for Claude Sonnet 4 (content filtering) Apify account for web scraping capabilities Qdrant vector database instance (local or cloud) Ollama with nomic-embed-text model for embeddings

Customizing this workflow

Web research automation can be adapted for many specialized use cases. Try focusing on specific domains like legal research (targeting .gov and .edu sites), medical research (PubMed and health authorities), or financial analysis (SEC filings and analyst reports).

Automate Web Research with GPT-4/Claude & Apify for Content Analysis and Insights

This n8n workflow automates the process of extracting content from web pages, analyzing it with advanced AI models (GPT-4 or Claude), and generating structured insights. It's designed to streamline web research, content summarization, and data extraction for various applications.

What it does

This workflow performs the following key steps:

Triggers Manually or via External Workflow: The workflow can be executed manually or by another n8n workflow.
Initial Data Setup: Sets up an initial item with a placeholder text field, which is intended to be replaced by actual web content.
Fetches Web Content (Placeholder): An HTTP Request node is present, likely intended to fetch content from a URL. In its current state, it's a placeholder.
Extracts & Processes Content:
- Code Node: A Code node is used to define a structured output schema (using Zod) for the AI to follow. This schema expects a title, summary, keywords, and main_points from the analyzed content.
- Character Text Splitter: Splits the input text into manageable chunks for processing by the AI.
- Default Data Loader: Loads the split text chunks as documents.
- Qdrant Vector Store: Stores the processed documents as vectors in a Qdrant vector database, enabling efficient semantic search and retrieval.
- Embeddings Ollama: Generates embeddings for the text chunks using the Ollama embedding model, which are then used by the Qdrant Vector Store.
Analyzes Content with AI Agent:
- AI Agent: Utilizes an AI Agent (LangChain) to perform complex reasoning and content analysis based on the provided text and the defined output schema.
- OpenAI Chat Model / Anthropic Chat Model: The AI Agent can leverage either an OpenAI (GPT-4) or Anthropic (Claude) chat model for its analytical tasks. The workflow includes both, allowing for flexibility.
- Structured Output Parser: Parses the AI agent's response to ensure it adheres to the predefined JSON schema, extracting the title, summary, keywords, and main_points.
Conditional AI Model Selection: An "If" node is included, suggesting a conditional logic to choose between different AI models or processing paths, though the specific condition is not defined in the provided JSON.
Merges Results: A Merge node is present, likely to combine results from different branches or steps of the workflow.

Prerequisites/Requirements

To use this workflow, you will need:

n8n Instance: A running instance of n8n.
OpenAI API Key: For using the OpenAI Chat Model (e.g., GPT-4).
Anthropic API Key: For using the Anthropic Chat Model (e.g., Claude).
Qdrant Vector Database: Access to a Qdrant instance for vector storage.
Ollama Embeddings: An Ollama instance running with the desired embedding model for text embeddings.
Apify (Implied): While not explicitly present in the provided JSON, the directory name suggests an integration with Apify for web scraping. If used, an Apify account and API key would be required.

Setup/Usage

Import the Workflow: Download the workflow JSON and import it into your n8n instance.
Configure Credentials:
- Set up credentials for your OpenAI and/or Anthropic API keys.
- Configure the Qdrant Vector Store node with your Qdrant instance details (host, API key).
- Configure the Embeddings Ollama node with your Ollama instance details.
Customize Web Content Source:
- The HTTP Request node (ID 19) is currently a placeholder. Replace it with a node that fetches the actual web content you wish to analyze. This could be an Apify node, another HTTP Request configured to scrape a specific URL, or a Webhook to receive content.
Adjust AI Agent Prompt and Schema:
- Review the Code node (ID 834) to understand the structured output schema. Modify it if you need different fields from the AI analysis.
- Adjust the AI Agent node (ID 1119) to refine the prompt and instructions given to the AI for content analysis.
Define Conditional Logic (Optional):
- If you intend to use the If node (ID 20) for conditional routing (e.g., choosing between OpenAI and Anthropic based on input), configure its conditions accordingly.
Activate and Execute: Once configured, activate the workflow. You can execute it manually or trigger it via an external source as per your setup.

This workflow provides a powerful foundation for automated web research and content analysis, leveraging the strengths of large language models and vector databases.

Related Templates

Create verified user profiles with email validation, PDF generation & Gmail delivery

Verified User Profile Creation - Automated Email Validation & PDF Generation --- Overview This comprehensive automation workflow streamlines the user onboarding process by validating email addresses, generating professional profile PDFs, and delivering them seamlessly to verified users. 🎯 What This Workflow Does: Receives User Data - Webhook trigger accepts user signup information (name, email, city, profession, bio) Validates Email Addresses - Uses VerifiEmail API to ensure only legitimate email addresses proceed Conditional Branching - Smart logic splits workflow based on email verification results Generates HTML Profile - Creates beautifully styled HTML templates with user information Converts to PDF - Transforms HTML into professional, downloadable PDF documents Email Delivery - Sends personalized welcome emails with PDF attachments to verified users Data Logging - Records all verified users in Google Sheets for analytics and tracking Rejection Handling - Notifies users with invalid emails and provides guidance ✨ Key Features: ✅ Email Verification - Prevents fake registrations and maintains data quality 📄 Professional PDF Generation - Beautiful, branded profile documents 📧 Automated Email Delivery - Personalized welcome messages with attachments 📊 Google Sheets Logging - Complete audit trail of all verified users 🔀 Smart Branching - Separate paths for valid and invalid emails 🎨 Modern Design - Clean, responsive HTML/CSS templates 🔒 Secure Webhook - POST endpoint for seamless form integration 🎯 Perfect Use Cases: User registration systems Community membership verification Professional certification programs Event registration with verified attendees Customer onboarding processes Newsletter signup verification Educational platform enrollments Membership card generation 📦 What's Included: Complete workflow with 12 informative sticky notes Pre-configured webhook endpoint Email verification integration PDF generation setup Gmail sending configuration Google Sheets logging Error handling guidelines Rejection email template 🛠️ Required Integrations: VerifiEmail - For email validation (https://verifi.email) HTMLcsstoPDF - For PDF generation (https://htmlcsstopdf.com) Gmail OAuth2 - For email delivery Google Sheets OAuth2 - For data logging ⚡ Quick Setup Time: 15-20 minutes 🎓 Skill Level: Beginner to Intermediate --- Benefits: ✅ Reduces manual verification work by 100% ✅ Prevents spam and fake registrations ✅ Delivers professional branded documents automatically ✅ Maintains complete audit trail ✅ Scales effortlessly with user growth ✅ Provides excellent user experience ✅ Easy integration with any form or application --- Technical Details: Trigger Type: Webhook (POST) Total Nodes: 11 (including 12 documentation sticky notes) Execution Time: ~3-5 seconds per user API Calls: 3 external (VerifiEmail, HTMLcsstoPDF, Google Sheets) Email Format: HTML with binary PDF attachment Data Storage: Google Sheets (optional) --- License: MIT (Free to use and modify) --- 🎁 BONUS FEATURES: Comprehensive sticky notes explaining each step Beautiful, mobile-responsive email template Professional PDF styling with modern design Easily customizable for your branding Ready-to-use webhook endpoint Error handling guidelines included --- Perfect for: Developers, No-code enthusiasts, Business owners, SaaS platforms, Community managers, Event organizers Start automating your user verification process today! 🚀

By Jitesh Dugar

Send severe weather alerts from Visual Crossing to Telegram

How it works This workflow automates the process of fetching weather forecasts for your home location, including severe weather alerts, and sends timely notifications. It uses the Visual Crossing API for detailed weather data and integrates with Telegram (or other messaging services) for messaging and alerts. Step-by-step In summary, the workflow runs every hour, grabs the current day's weather conditions for [your city/location of interest], and returns only those items that truly contain one or more weather alerts. 📅 Step 1: Hourly Trigger The workflow begins with the Hourly Trigger node, which is a scheduleTrigger. This node acts as the clock that initiates the entire process at regular hourly intervals. 🌤️ Step 2: Fetch Weather Data Immediately after the trigger, the workflow moves to the Meteo node, an httpRequest. This node makes an external API call to fetch weather data for your specified location. API Used: Visual Crossing Web Services Authentication: Uses your API key (key=[API KEY]) Response format: JSON 🌪🌀 Step 3: Check for Severe Weather The JSON weather data output is analyzed, and if severe weather conditions or alerts are detected, the workflow sends the alert via your preferred communication channel(s). Optional You can replace the Telegram node with email, WhatsApp, SMS notifications, or add multiple notification nodes to receive severe weather alerts across all desired channels.

By Razvan Bara

Generate influencer posts with GPT-4, Google Sheets, and Media APIs

This template transforms uploaded brand assets into AI-generated influencer-style posts — complete with captions, images, and videos — using n8n, OpenAI, and your preferred image/video generation APIs. --- 🧠 Who it’s for Marketers, creators, or brand teams who want to speed up content ideation and visual generation. Perfect for social-media teams looking to turn product photos and brand visuals into ready-to-review creative posts. --- ⚙️ How it works Upload your brand assets — A form trigger collects up to three files: product, background, and prop. AI analysis & content creation — An OpenAI LLM analyzes your brand tone and generates post titles, captions, and visual prompts. Media generation — Connected image/video generation workflows create corresponding visuals. Result storage — All captions, image URLs, and video URLs are automatically written to a Google Sheet for review or publishing. --- 🧩 How to set it up Replace all placeholders in nodes: <<YOURSHEETID>> <<FILEUPLOADBASE>> <<YOURAPIKEY>> <<YOURN8NDOMAIN>>/form/<<FORM_ID>> Add your own credentials in: Google Sheets HTTP Request AI/LLM nodes Execute the workflow or trigger via form. Check your connected Google Sheet for generated posts and media links. --- 🛠️ Requirements | Tool | Purpose | |------|----------| | OpenAI / compatible LLM key | Caption & idea generation | | Image/Video generation API | Creating visuals | | Google Sheets credentials | Storing results | | (Optional) n8n Cloud / self-hosted | To run the workflow | --- 🧠 Notes The workflow uses modular sub-workflows for image and video creation; you can connect your own generation nodes. All credentials and private URLs have been removed. Works seamlessly with both n8n Cloud and self-hosted setups. Output is meant for creative inspiration — review before posting publicly. --- 🧩 Why it’s useful Speeds up campaign ideation and content creation. Provides structured, reusable results in Google Sheets. Fully visual, modular, and customizable for any brand or industry. --- 🧠 Example Use Cases Influencer campaign planning Product launch creatives E-commerce catalog posts Fashion, lifestyle, or tech brand content --- ✅ Security & best practices No hardcoded keys or credentials included. All private URLs replaced with placeholders. Static data removed from the public JSON. Follows n8n’s template structure, node naming, and sticky-note annotation guidelines. --- 📦 Template info Name: AI-Powered Influencer Post Generator with Google Sheets and Image/Video APIs Category: AI / Marketing Automation / Content Generation Author: Palak Rathor Version: 1.0 (Public Release — October 2025)

By Palak Rathor

485