Healthcare policy monitoring with ScrapeGraphAI, Pipedrive and Matrix alerts

27 views

2/3/2026

Webhook HubSpot Zoho Creator Form Submission Data Archiving

Medical Research Tracker with Matrix and Pipedrive

⚠️ COMMUNITY TEMPLATE DISCLAIMER: This is a community-contributed template that uses ScrapeGraphAI (a community node). Please ensure you have the ScrapeGraphAI community node installed in your n8n instance before using this template.

This workflow automatically monitors selected government and healthcare-policy websites, extracts newly published or updated policy documents, logs them as deals in a Pipedrive pipeline, and announces critical changes in a Matrix room. It gives healthcare administrators and policy analysts a near real-time view of policy developments without manual web checks.

Pre-conditions/Requirements

Prerequisites

n8n instance (self-hosted or n8n cloud)
ScrapeGraphAI community node installed
Active Pipedrive account with at least one pipeline
Matrix account & accessible room for notifications
Basic knowledge of n8n credential setup

Required Credentials

ScrapeGraphAI API Key – Enables the scraping engine
Pipedrive OAuth2 / API Token – Creates & updates deals
Matrix Credentials – Homeserver URL, user, access token (or password)

Specific Setup Requirements

| Variable | Description | Example | |----------|-------------|---------| | POLICY_SITES | Comma-separated list of URLs to scrape | https://health.gov/policies,https://who.int/proposals | | PD_PIPELINE_ID | Pipedrive pipeline where deals are created | 5 | | PD_STAGE_ID_ALERT | Stage ID for “Review Needed” | 17 | | MATRIX_ROOM_ID | Room to send alerts (incl. leading !) | !policy:matrix.org |

Edit the initial Set node to provide these values before running.

How it works

Key Steps:

Scheduled Trigger: Runs every 6 hours (configurable) to start the monitoring cycle.
Code (URL List Builder): Generates an array from POLICY_SITES for downstream batching.
SplitInBatches: Iterates through each policy URL individually.
ScrapeGraphAI: Scrapes page titles, publication dates, and summary paragraphs.
If (New vs Existing): Compares scraped hash with last run; continues only for fresh content.
Merge (Aggregate Results): Collects all “new” policies into a single payload.
Set (Deal Formatter): Maps scraped data to Pipedrive deal fields.
Pipedrive Node: Creates or updates a deal per policy item.
Matrix Node: Posts a formatted alert message in the specified Matrix room.

Set up steps

Setup Time: 15-20 minutes

Install Community Node
– In n8n, go to Settings → Community Nodes → Install and search for ScrapeGraphAI.
Add Credentials
– Create New credentials for ScrapeGraphAI, Pipedrive, and Matrix under Credentials.
Configure Environment Variables
– Open the Set (Initial Config) node and replace placeholders (POLICY_SITES, PD_PIPELINE_ID, etc.) with your values.
Review Schedule
– Double-click the Schedule Trigger node to adjust the interval if needed.
Activate Workflow
– Click Activate. The workflow will run at the next scheduled interval.
Verify Outputs
– Check Pipedrive for new deals and the Matrix room for alert messages after the first run.

Node Descriptions

Core Workflow Nodes:

stickyNote – Provides an at-a-glance description of the workflow logic directly on the canvas.
scheduleTrigger – Fires the workflow periodically (default 6 hours).
code (URL List Builder) – Splits the POLICY_SITES variable into an array.
splitInBatches – Ensures each URL is processed individually to avoid timeouts.
scrapegraphAi – Parses HTML and extracts policy metadata using XPath/CSS selectors.
if (New vs Existing) – Uses hashing to ignore unchanged pages.
merge – Combines all new items so they can be processed in bulk.
set (Deal Formatter) – Maps scraped fields to Pipedrive deal properties.
matrix – Sends formatted messages to a Matrix room for team visibility.
pipedrive – Creates or updates deals representing each policy update.

Data Flow:

scheduleTrigger → code → splitInBatches → scrapegraphAi → if → merge → set → pipedrive → matrix

Customization Examples

1. Add another data field (e.g., policy author)

// Inside ScrapeGraphAI node → Selectors
{
  "title": "//h1/text()",
  "date": "//time/@datetime",
  "summary": "//p[1]/text()",
  "author": "//span[@class='author']/text()"   // new line
}

2. Switch notifications from Matrix to Email

// Replace Matrix node with “Send Email”
{
  "to": "policy-team@example.com",
  "subject": "New Healthcare Policy Detected: {{$json.title}}",
  "text": "Summary:\n{{$json.summary}}\n\nRead more at {{$json.url}}"
}

Data Output Format

The workflow outputs structured JSON data for each new policy article:

{
  "title": "Affordable Care Expansion Act – 2024",
  "url": "https://health.gov/policies/acea-2024",
  "date": "2024-06-14T09:00:00Z",
  "summary": "Proposes expansion of coverage to rural areas...",
  "source": "health.gov",
  "hash": "2d6f1c8e3b..."
}

Troubleshooting

Common Issues

ScrapeGraphAI returns empty objects
– Verify selectors match the current HTML structure; inspect the site with developer tools and update the node configuration.
Duplicate deals appear in Pipedrive
– Ensure the “Find or Create” option is enabled in the Pipedrive node, using the page hash or url as a unique key.

Performance Tips

Limit POLICY_SITES to under 50 URLs per run to avoid hitting rate limits.
Increase Schedule Trigger interval if you notice ScrapeGraphAI rate-limiting.

Pro Tips:

Store historical scraped data in a database node for long-term audit trails.
Use the n8n Workflow Executions page to replay failed runs without waiting for the next schedule.
Add an Error Trigger node to emit alerts if scraping or API calls fail.

Healthcare Policy Monitoring with ScrapegraphAI, Pipedrive, and Matrix Alerts

This n8n workflow automates the monitoring of healthcare policies, extracting key information, and then taking conditional actions based on the extracted data. It can update Pipedrive with new policy details and send alerts to a Matrix chat room for immediate notification.

What it does

This workflow streamlines the process of tracking healthcare policy changes by:

Triggering on a schedule: The workflow runs periodically to check for updates.
Executing a custom Python script: It calls an external Python script (presumably ScrapegraphAI) to scrape and analyze healthcare policy data.
Processing scraped data: The output from the Python script is then processed within n8n.
Filtering based on policy status: It checks if a policy is marked as "active".
Updating Pipedrive: If a policy is active, it creates or updates a deal in Pipedrive with relevant policy details.
Sending Matrix alerts: Regardless of the "active" status, it sends a notification to a Matrix chat room, providing a summary of the scraped policy information.
Looping through multiple items: If the scraping returns multiple policy items, the workflow processes each one individually.

Prerequisites/Requirements

To use this workflow, you will need:

n8n Instance: A running n8n instance.
Python Environment: A Python environment with ScrapegraphAI (or a similar web scraping library) configured and accessible to the n8n Execute Command node. The Execute Command node is not explicitly in the provided JSON, but inferred by the Code node's likely purpose.
Pipedrive Account: An active Pipedrive account with appropriate API access.
Matrix Account: A Matrix account and a room ID to send notifications to.
n8n Credentials: Configured credentials in n8n for Pipedrive and Matrix.

Setup/Usage

Import the workflow: Import the provided JSON into your n8n instance.
Configure Credentials:
- Set up your Pipedrive API key credential in n8n.
- Set up your Matrix API key/token and server URL credential in n8n.
Configure the Code node:
- Update the Code node to call your Python script (e.g., python /path/to/your/scrapegraphai_script.py). Ensure the script outputs JSON data that the subsequent n8n nodes can process.
Configure the Schedule Trigger node: Adjust the schedule to your desired frequency for monitoring (e.g., daily, hourly).
Configure the If node: Review and adjust the condition for "active" policies if your data structure or definition of "active" differs.
Configure the Pipedrive node: Map the fields from your scraped data to the appropriate fields in your Pipedrive deals (e.g., policy name to deal title, status to deal stage).
Configure the Matrix node: Customize the message content to include relevant policy details for your alerts.
Activate the workflow: Once configured, activate the workflow to start monitoring.

Related Templates

Automate Dutch Public Procurement Data Collection with TenderNed

TenderNed Public Procurement What This Workflow Does This workflow automates the collection of public procurement data from TenderNed (the official Dutch tender platform). It: Fetches the latest tender publications from the TenderNed API Retrieves detailed information in both XML and JSON formats for each tender Parses and extracts key information like organization names, titles, descriptions, and reference numbers Filters results based on your custom criteria Stores the data in a database for easy querying and analysis Setup Instructions This template comes with sticky notes providing step-by-step instructions in Dutch and various query options you can customize. Prerequisites TenderNed API Access - Register at TenderNed for API credentials Configuration Steps Set up TenderNed credentials: Add HTTP Basic Auth credentials with your TenderNed API username and password Apply these credentials to the three HTTP Request nodes: "Tenderned Publicaties" "Haal XML Details" "Haal JSON Details" Customize filters: Modify the "Filter op ..." node to match your specific requirements Examples: specific organizations, contract values, regions, etc. How It Works Step 1: Trigger The workflow can be triggered either manually for testing or automatically on a daily schedule. Step 2: Fetch Publications Makes an API call to TenderNed to retrieve a list of recent publications (up to 100 per request). Step 3: Process & Split Extracts the tender array from the response and splits it into individual items for processing. Step 4: Fetch Details For each tender, the workflow makes two parallel API calls: XML endpoint - Retrieves the complete tender documentation in XML format JSON endpoint - Fetches metadata including reference numbers and keywords Step 5: Parse & Merge Parses the XML data and merges it with the JSON metadata and batch information into a single data structure. Step 6: Extract Fields Maps the raw API data to clean, structured fields including: Publication ID and date Organization name Tender title and description Reference numbers (kenmerk, TED number) Step 7: Filter Applies your custom filter criteria to focus on relevant tenders only. Step 8: Store Inserts the processed data into your database for storage and future analysis. Customization Tips Modify API Parameters In the "Tenderned Publicaties" node, you can adjust: offset: Starting position for pagination size: Number of results per request (max 100) Add query parameters for date ranges, status filters, etc. Add More Fields Extend the "Splits Alle Velden" node to extract additional fields from the XML/JSON data, such as: Contract value estimates Deadline dates CPV codes (procurement classification) Contact information Integrate Notifications Add a Slack, Email, or Discord node after the filter to get notified about new matching tenders. Incremental Updates Modify the workflow to only fetch new tenders by: Storing the last execution timestamp Adding date filters to the API query Only processing publications newer than the last run Troubleshooting No data returned? Verify your TenderNed API credentials are correct Check that you have setup youre filter proper Need help setting this up or interested in a complete tender analysis solution? Get in touch 🔗 LinkedIn – Wessel Bulte

By Wessel Bulte

247

🎓 How to transform unstructured email data into structured format with AI agent

This workflow automates the process of extracting structured, usable information from unstructured email messages across multiple platforms. It connects directly to Gmail, Outlook, and IMAP accounts, retrieves incoming emails, and sends their content to an AI-powered parsing agent built on OpenAI GPT models. The AI agent analyzes each email, identifies relevant details, and returns a clean JSON structure containing key fields: From – sender’s email address To – recipient’s email address Subject – email subject line Summary – short AI-generated summary of the email body The extracted information is then automatically inserted into an n8n Data Table, creating a structured database of email metadata and summaries ready for indexing, reporting, or integration with other tools. --- Key Benefits ✅ Full Automation: Eliminates manual reading and data entry from incoming emails. ✅ Multi-Source Integration: Handles data from different email providers seamlessly. ✅ AI-Driven Accuracy: Uses advanced language models to interpret complex or unformatted content. ✅ Structured Storage: Creates a standardized, query-ready dataset from previously unstructured text. ✅ Time Efficiency: Processes emails in real time, improving productivity and response speed. *✅ Scalability: Easily extendable to handle additional sources or extract more data fields. --- How it works This workflow automates the transformation of unstructured email data into a structured, queryable format. It operates through a series of connected steps: Email Triggering: The workflow is initiated by one of three different email triggers (Gmail, Microsoft Outlook, or a generic IMAP account), which constantly monitor for new incoming emails. AI-Powered Parsing & Structuring: When a new email is detected, its raw, unstructured content is passed to a central "Parsing Agent." This agent uses a specified OpenAI language model to intelligently analyze the email text. Data Extraction & Standardization: Following a predefined system prompt, the AI agent extracts key information from the email, such as the sender, recipient, subject, and a generated summary. It then forces the output into a strict JSON structure using a "Structured Output Parser" node, ensuring data consistency. Data Storage: Finally, the clean, structured data (the from, to, subject, and summarize fields) is inserted as a new row into a specified n8n Data Table, creating a searchable and reportable database of email information. --- Set up steps To implement this workflow, follow these configuration steps: Prepare the Data Table: Create a new Data Table within n8n. Define the columns with the following names and string type: From, To, Subject, and Summary. Configure Email Credentials: Set up the credential connections for the email services you wish to use (Gmail OAuth2, Microsoft Outlook OAuth2, and/or IMAP). Ensure the accounts have the necessary permissions to read emails. Configure AI Model Credentials: Set up the OpenAI API credential with a valid API key. The workflow is configured to use the model, but this can be changed in the respective nodes if needed. Connect the Nodes: The workflow canvas is already correctly wired. Visually confirm that the email triggers are connected to the "Parsing Agent," which is connected to the "Insert row" (Data Table) node. Also, ensure the "OpenAI Chat Model" and "Structured Output Parser" are connected to the "Parsing Agent" as its AI model and output parser, respectively. Activate the Workflow: Save the workflow and toggle the "Active" switch to ON. The triggers will begin polling for new emails according to their schedule (e.g., every minute), and the automation will start processing incoming messages. --- Need help customizing? Contact me for consulting and support or add me on Linkedin.

By Davide

1616

Tax deadline management & compliance alerts with GPT-4, Google Sheets & Slack

AI-Driven Tax Compliance & Deadline Management System Description Automate tax deadline monitoring with AI-powered insights. This workflow checks your tax calendar daily at 8 AM, uses GPT-4 to analyze upcoming deadlines across multiple jurisdictions, detects overdue and critical items, and sends intelligent alerts via email and Slack only when immediate action is required. Perfect for finance teams and accounting firms who need proactive compliance management without manual tracking. 🏛️🤖📊 Good to Know AI-Powered: GPT-4 provides risk assessment and strategic recommendations Multi-Jurisdiction: Handles Federal, State, and Local tax requirements automatically Smart Alerts: Only notifies executives when deadlines are overdue or critical (≤3 days) Priority Classification: Categorizes deadlines as Overdue, Critical, High, or Medium priority Dual Notifications: Critical alerts to leadership + daily summaries to team channel Complete Audit Trail: Logs all checks and deadlines to Google Sheets for compliance records How It Works Daily Trigger - Runs at 8:00 AM every morning Fetch Data - Pulls tax calendar and company configuration from Google Sheets Analyze Deadlines - Calculates days remaining, filters by jurisdiction/entity type, categorizes by priority AI Analysis - GPT-4 provides strategic insights and risk assessment on upcoming deadlines Smart Routing - Only sends alerts if overdue or critical deadlines exist Critical Alerts - HTML email to executives + Slack alert for urgent items Team Updates - Slack summary to finance channel with all upcoming deadlines Logging - Records compliance check results to Google Sheets for audit trail Requirements Google Sheets Structure Sheet 1: TaxCalendar DeadlineID | DeadlineName | DeadlineDate | Jurisdiction | Category | AssignedTo | IsActive FED-Q1 | Form 1120 Q1 | 2025-04-15 | Federal | Income | John Doe | TRUE Sheet 2: CompanyConfig (single row) Jurisdictions | EntityType | FiscalYearEnd Federal, California | Corporation | 12-31 Sheet 3: ComplianceLog (auto-populated) Date | AlertLevel | TotalUpcoming | CriticalCount | OverdueCount 2025-01-15 | HIGH | 12 | 3 | 1 Credentials Needed Google Sheets - Service Account OAuth2 OpenAI - API Key (GPT-4 access required) SMTP - Email account for sending alerts Slack - Bot Token with chat:write permission Setup Steps Import workflow JSON into n8n Add all 4 credentials Replace these placeholders: YOURTAXCALENDAR_ID - Tax calendar sheet ID YOURCONFIGID - Company config sheet ID YOURLOGID - Compliance log sheet ID C12345678 - Slack channel ID tax@company.com - Sender email cfo@company.com - Recipient email Share all sheets with Google service account email Invite Slack bot to channels Test workflow manually Activate the trigger Customizing This Workflow Change Alert Thresholds: Edit "Analyze Deadlines" node: Critical: Change <= 3 to <= 5 for 5-day warning High: Change <= 7 to <= 14 for 2-week notice Medium: Change <= 30 to <= 60 for 2-month lookout Adjust Schedule: Edit "Daily Tax Check" trigger: Change hour/minute for different run time Add multiple trigger times for tax season (8 AM, 2 PM, 6 PM) Add More Recipients: Edit "Send Email" node: To: cfo@company.com, director@company.com CC: accounting@company.com BCC: archive@company.com Customize Email Design: Edit "Format Email" node to change colors, add logo, or modify layout Add SMS Alerts: Insert Twilio node after "Is Critical" for emergency notifications Integrate Task Management: Add HTTP Request node to create tasks in Asana/Jira for critical deadlines Troubleshooting | Issue | Solution | |-------|----------| | No deadlines found | Check date format (YYYY-MM-DD) and IsActive = TRUE | | AI analysis failed | Verify OpenAI API key and account credits | | Email not sending | Test SMTP credentials and check if critical condition met | | Slack not posting | Invite bot to channel and verify channel ID format | | Permission denied | Share Google Sheets with service account email | 📞 Professional Services Need help with implementation or customization? Our team offers: 🎯 Custom workflow development 🏢 Enterprise deployment support 🎓 Team training sessions 🔧 Ongoing maintenance 📊 Custom reporting & dashboards 🔗 Additional API integrations Discover more workflows – Get in touch with us

By Oneclick AI Squad