Extract Instagram profile data with Apify and store in Google Sheets
Workflow Overview
This cutting-edge n8n automation is a powerful social media intelligence gathering tool designed to transform Instagram profile research into a seamless, automated process. By intelligently combining web scraping, data formatting, and cloud storage technologies, this workflow:
-
Discovers Profile Insights:
- Automatically scrapes Instagram profile data
- Captures comprehensive profile metrics
- Extracts critical social media intelligence
-
Intelligent Data Capture:
- Retrieves follower counts
- Collects biographical information
- Captures profile picture and external links
-
Seamless Data Logging:
- Automatically stores data in Google Sheets
- Creates a living, updateable database
- Enables easy analysis and tracking
Key Benefits
- π€ Full Automation: Instant profile intelligence
- π‘ Comprehensive Insights: Detailed social media metrics
- π Effortless Tracking: Automated data collection
- π Multi-Purpose Research: Flexible data gathering
Workflow Architecture
πΉ Stage 1: Trigger & Input
- Form-Based Trigger: Manual username submission
- Webhook Support: Flexible data entry methods
- User-Driven Initiation
πΉ Stage 2: Web Scraping
- Apify Integration: Robust Instagram data extraction
- Comprehensive Profile Scanning:
- Followers count
- Following count
- Profile biography
- Profile picture URL
πΉ Stage 3: Data Formatting
- Intelligent Data Mapping
- Standardized Data Structure
- Preparation for Storage
πΉ Stage 4: Cloud Logging
- Google Sheets Integration
- Persistent Data Storage
- Easy Access and Analysis
Potential Use Cases
- Influencer Marketing: Talent identification
- Competitive Intelligence: Audience research
- Social Media Analysis: Performance tracking
- Recruitment: Talent scouting
- Brand Partnerships: Collaboration opportunities
Setup Requirements
-
Apify Account
- Instagram scraping actor
- API token
- Configured scraping parameters
-
Google Sheets
- Connected Google account
- Prepared tracking spreadsheet
- Appropriate sharing settings
-
n8n Installation
- Cloud or self-hosted instance
- Workflow configuration
- API credential management
Future Enhancement Suggestions
- π€ Advanced profile scoring
- π Engagement rate calculation
- π Real-time change alerts
- π Multi-platform profile tracking
- π§ AI-powered insights generation
Technical Considerations
- Implement robust error handling
- Use exponential backoff for API calls
- Maintain flexible data extraction strategies
- Ensure compliance with platform terms of service
Ethical Guidelines
- Respect user privacy
- Use data for legitimate research
- Maintain transparent data collection practices
- Provide opt-out mechanisms
Connect With Me
Ready to unlock social media insights?
π§ Email: Yaron@nofluff.online
π₯ YouTube: @YaronBeen
πΌ LinkedIn: Yaron Been
Transform your social media research with intelligent, automated workflows!
#InstagramDataScraping #SocialMediaIntelligence #InfluencerMarketing #DataAutomation #AIResearch #MarketingTechnology #SocialMediaAnalytics #ProfileIntelligence #WebScraping #MarketingTech
Extract Instagram Profile Data with Apify and Store in Google Sheets
This n8n workflow automates the process of extracting Instagram profile data using the Apify Instagram Profile Scraper and then storing that data in a Google Sheet. It's designed to be triggered manually via an n8n form, allowing users to input the Instagram profile URL they wish to scrape.
What it does
- Triggers on Form Submission: The workflow starts when a user submits a form, providing the Instagram profile URL they want to scrape.
- Scrapes Instagram Profile Data: It makes an HTTP request to the Apify Instagram Profile Scraper API, passing the provided Instagram URL to initiate the scraping process.
- Transforms Data: It processes the data received from Apify, specifically extracting the
urlandfull_namefields from theitemarray within the Apify response. This ensures only relevant data is prepared for storage. - Stores Data in Google Sheets: Finally, it appends the extracted Instagram profile URL and full name to a specified Google Sheet.
Prerequisites/Requirements
- n8n Instance: A running n8n instance to import and execute the workflow.
- Apify Account & API Key: An Apify account with access to the Instagram Profile Scraper and an associated API token.
- Google Account: A Google account with access to Google Sheets.
- Google Sheets Credential in n8n: A Google Sheets OAuth2 or Service Account credential configured in your n8n instance.
Setup/Usage
- Import the Workflow:
- Download the provided JSON file.
- In your n8n instance, go to "Workflows" and click "New".
- Click the "Import from JSON" button and paste the workflow JSON or upload the file.
- Configure Credentials:
- HTTP Request (Apify):
- Edit the "HTTP Request" node.
- In the URL field, replace
YOUR_APIFY_API_TOKENwith your actual Apify API token. - Ensure the
instagramUrlsparameter in the JSON body is correctly mapped to the input from the "On form submission" node (e.g.,{{ $('On form submission').item.json.instagramUrl }}).
- Google Sheets:
- Edit the "Google Sheets" node.
- Select or create a Google Sheets credential. If creating a new one, follow the n8n documentation for Google Sheets OAuth2 or Service Account setup.
- Specify the "Spreadsheet ID" and "Sheet Name" where you want to store the Instagram profile data.
- Ensure the "Values" are correctly mapped to the output of the "Edit Fields" node (e.g.,
{{ $json.url }},{{ $json.full_name }}).
- HTTP Request (Apify):
- Activate the Workflow:
- Click the "Activate" toggle in the top right corner of the workflow editor to enable the workflow.
- Use the Form Trigger:
- The "On form submission" node will generate a unique URL. You can access this URL to manually trigger the workflow by submitting an Instagram profile URL.
- Share this form URL with anyone who needs to trigger the scraping process.
Once configured and activated, submitting an Instagram profile URL via the n8n form will automatically scrape its data using Apify and add the extracted URL and full name to your designated Google Sheet.
Related Templates
Automate invoice processing with OCR, GPT-4 & Salesforce opportunity creation
PDF Invoice Extractor (AI) End-to-end pipeline: Watch Drive β Download PDF β OCR text β AI normalize to JSON β Upsert Buyer (Account) β Create Opportunity β Map Products β Create OLI via Composite API β Archive to OneDrive. --- Node by node (what it does & key setup) 1) Google Drive Trigger Purpose: Fire when a new file appears in a specific Google Drive folder. Key settings: Event: fileCreated Folder ID: google drive folder id Polling: everyMinute Creds: googleDriveOAuth2Api Output: Metadata { id, name, ... } for the new file. --- 2) Download File From Google Purpose: Get the file binary for processing and archiving. Key settings: Operation: download File ID: ={{ $json.id }} Creds: googleDriveOAuth2Api Output: Binary (default key: data) and original metadata. --- 3) Extract from File Purpose: Extract text from PDF (OCR as needed) for AI parsing. Key settings: Operation: pdf OCR: enable for scanned PDFs (in options) Output: JSON with OCR text at {{ $json.text }}. --- 4) Message a model (AI JSON Extractor) Purpose: Convert OCR text into strict normalized JSON array (invoice schema). Key settings: Node: @n8n/n8n-nodes-langchain.openAi Model: gpt-4.1 (or gpt-4.1-mini) Message role: system (the strict prompt; references {{ $json.text }}) jsonOutput: true Creds: openAiApi Output (per item): $.message.content β the parsed JSON (ensure itβs an array). --- 5) Create or update an account (Salesforce) Purpose: Upsert Buyer as Account using an external ID. Key settings: Resource: account Operation: upsert External Id Field: taxid_c External Id Value: ={{ $json.message.content.buyer.tax_id }} Name: ={{ $json.message.content.buyer.name }} Creds: salesforceOAuth2Api Output: Account record (captures Id) for downstream Opportunity. --- 6) Create an opportunity (Salesforce) Purpose: Create Opportunity linked to the Buyer (Account). Key settings: Resource: opportunity Name: ={{ $('Message a model').item.json.message.content.invoice.code }} Close Date: ={{ $('Message a model').item.json.message.content.invoice.issue_date }} Stage: Closed Won Amount: ={{ $('Message a model').item.json.message.content.summary.grand_total }} AccountId: ={{ $json.id }} (from Upsert Account output) Creds: salesforceOAuth2Api Output: Opportunity Id for OLI creation. --- 7) Build SOQL (Code / JS) Purpose: Collect unique product codes from AI JSON and build a SOQL query for PricebookEntry by Pricebook2Id. Key settings: pricebook2Id (hardcoded in script): e.g., 01sxxxxxxxxxxxxxxx Source lines: $('Message a model').first().json.message.content.products Output: { soql, codes } --- 8) Query PricebookEntries (Salesforce) Purpose: Fetch PricebookEntry.Id for each Product2.ProductCode. Key settings: Resource: search Query: ={{ $json.soql }} Creds: salesforceOAuth2Api Output: Items with Id, Product2.ProductCode (used for mapping). --- 9) Code in JavaScript (Build OLI payloads) Purpose: Join lines with PBE results and Opportunity Id β build OpportunityLineItem payloads. Inputs: OpportunityId: ={{ $('Create an opportunity').first().json.id }} Lines: ={{ $('Message a model').first().json.message.content.products }} PBE rows: from previous node items Output: { body: { allOrNone:false, records:[{ OpportunityLineItem... }] } } Notes: Converts discount_total β per-unit if needed (currently commented for standard pricing). Throws on missing PBE mapping or empty lines. --- 10) Create Opportunity Line Items (HTTP Request) Purpose: Bulk create OLIs via Salesforce Composite API. Key settings: Method: POST URL: https://<your-instance>.my.salesforce.com/services/data/v65.0/composite/sobjects Auth: salesforceOAuth2Api (predefined credential) Body (JSON): ={{ $json.body }} Output: Composite API results (per-record statuses). --- 11) Update File to One Drive Purpose: Archive the original PDF in OneDrive. Key settings: Operation: upload File Name: ={{ $json.name }} Parent Folder ID: onedrive folder id Binary Data: true (from the Download node) Creds: microsoftOneDriveOAuth2Api Output: Uploaded file metadata. --- Data flow (wiring) Google Drive Trigger β Download File From Google Download File From Google β Extract from File β Update File to One Drive Extract from File β Message a model Message a model β Create or update an account Create or update an account β Create an opportunity Create an opportunity β Build SOQL Build SOQL β Query PricebookEntries Query PricebookEntries β Code in JavaScript Code in JavaScript β Create Opportunity Line Items --- Quick setup checklist π Credentials: Connect Google Drive, OneDrive, Salesforce, OpenAI. π IDs: Drive Folder ID (watch) OneDrive Parent Folder ID (archive) Salesforce Pricebook2Id (in the JS SOQL builder) π§ AI Prompt: Use the strict system prompt; jsonOutput = true. π§Ύ Field mappings: Buyer tax id/name β Account upsert fields Invoice code/date/amount β Opportunity fields Product name must equal your Product2.ProductCode in SF. β Test: Drop a sample PDF β verify: AI returns array JSON only Account/Opportunity created OLI records created PDF archived to OneDrive --- Notes & best practices If PDFs are scans, enable OCR in Extract from File. If AI returns non-JSON, keep βReturn only a JSON arrayβ as the last line of the prompt and keep jsonOutput enabled. Consider adding validation on parsing.warnings to gate Salesforce writes. For discounts/taxes in OLI: Standard OLI fields donβt support per-line discount amounts directly; model them in UnitPrice or custom fields. Replace the Composite API URL with your orgβs domain or use the Salesforce nodeβs Bulk Upsert for simplicity.
Dynamic Hubspot lead routing with GPT-4 and Airtable sales team distribution
AI Agent for Dynamic Lead Distribution (HubSpot + Airtable) π§ AI-Powered Lead Routing and Sales Team Distribution This intelligent n8n workflow automates end-to-end lead qualification and allocation by integrating HubSpot, Airtable, OpenAI, Gmail, and Slack. The system ensures that every new lead is instantly analyzed, scored, and routed to the best-fit sales representative β all powered by AI logic, sir. --- π‘ Key Advantages β‘ Real-Time Lead Routing Automatically assigns new leads from HubSpot to the most relevant sales rep based on region, capacity, and expertise. π§ AI Qualification Engine An OpenAI-powered Agent evaluates the leadβs industry, region, and needs to generate a persona summary and routing rationale. π Centralized Tracking in Airtable Every lead is logged and updated in Airtable with AI insights, rep details, and allocation status for full transparency. π¬ Instant Notifications Slack and Gmail integrations alert the assigned rep immediately with full lead details and AI-generated notes. π Seamless CRM Sync Updates the original HubSpot record with lead persona, routing info, and timeline notes for audit-ready history, sir. --- βοΈ How It Works HubSpot Trigger β Captures a new lead as soon as itβs created in HubSpot. Fetch Contact Data β Retrieves all relevant fields like name, company, and industry. Clean & Format Data β A Code node standardizes and structures the data for consistency. Airtable Record Creation β Logs the lead data into the βLeadsβ table for centralized tracking. AI Agent Qualification β The AI analyzes the lead using the TeamDatabase (Airtable) to find the ideal rep. Record Update β Updates the same Airtable record with the assigned team and AI persona summary. Slack Notification β Sends a real-time message tagging the rep with lead info. Gmail Notification β Sends a personalized handoff email with context and follow-up actions. HubSpot Sync β Updates the original contact in HubSpot with the assignment details and AI rationale, sir. --- π οΈ Setup Steps Trigger Node: HubSpot β Detect new leads. HubSpot Node: Retrieve complete lead details. Code Node: Clean and normalize data. Airtable Node: Log lead info in the βLeadsβ table. AI Agent Node: Process lead and match with sales team. Slack Node: Notify the designated representative. Gmail Node: Email the rep with details. HubSpot Node: Update CRM with AI summary and allocation status, sir. --- π Credentials Required HubSpot OAuth2 API β To fetch and update leads. Airtable Personal Access Token β To store and update lead data. OpenAI API β To power the AI qualification and matching logic. Slack OAuth2 β For sending team notifications. Gmail OAuth2 β For automatic email alerts to assigned reps, sir. --- π€ Ideal For Sales Operations and RevOps teams managing multiple regions B2B SaaS and enterprise teams handling large lead volumes Marketing teams requiring AI-driven, bias-free lead assignment Organizations optimizing CRM efficiency with automation, sir --- π¬ Bonus Tip You can easily extend this workflow by adding lead scoring logic, language translation for follow-ups, or Salesforce integration. The entire system is modular β perfect for scaling across global sales teams, sir.
Automated weekly security audit reports with Gmail delivery
π N8N Security Audit Report - Automated Weekly Email π― What does this workflow do? This workflow automatically generates and emails a comprehensive security audit report for your N8N instance every week. It identifies potential security risks related to: Credentials π : Exposed or insecure credentials Nodes π§© : Sensitive nodes (Code, HTTP Request, SSH, FTP, etc.) Instance settings π’ : Global security configuration Community nodes π¦ : Third-party nodes that may pose risks The report includes direct links to affected workflows, execution statuses, and actionable recommendations. --- β¨ Key Features π Smart Risk Assessment Calculates overall risk level: π© Low / π§ Moderate / π₯ High Tracks unique credentials (not just total occurrences) Provides detailed breakdown by node type π Direct Workflow Links Clickable links to each workflow mentioned Shows last execution status (π’ success / π΄ failed) Displays execution timestamps π Bilingual Support Full support for French and English Switch language with a single variable π§ Beautiful HTML Email Clean, professional formatting Color-coded risk levels Emoji icons for easy scanning --- π Quick Setup (5 minutes) 1οΈβ£ Configure Credentials N8N API: Generate an API key in your N8N settings Gmail OAuth2: Set up OAuth2 for Gmail sending 2οΈβ£ Set Your Variables Edit the "Set Config Variables" node: javascript { "email_to": "your.email@domain.com", "project_name": "My-N8N-Project", "server_url": "https://n8n.yourdomain.com", // NO trailing slash! "Language": "EN" // or "FR" } 3οΈβ£ Test & Activate Click "Execute Workflow" to test Check your email inbox Activate for weekly automation --- π§ Example Report Output Subject: π Audit Report My-Project β Risk π§ Moderate Content: π Summary β’ Credentials involved: 8 (5 unique) β’ Nodes involved: 12 π» code: 4 π httpRequest: 3 π ssh: 2 β’ Community nodes: 1 β’ Overall risk level: π§ Moderate π Credentials Risk Report πΉ Credentials with full access π My AWS Credentials π Database Admin π Workflow: Data Processing Pipeline π’ (25-10-2024 06:15 β 06:16) π» Process Data π API Call π§© Nodes Risk Report [...detailed node analysis...] --- π¨ Customization Options Change Schedule Modify the "Schedule Trigger" node to run: Daily at 8 AM Monthly on the 1st Custom cron expression Add Recipients Add multiple emails in the Gmail node's toList parameter Adjust Risk Thresholds Edit the JavaScript in "Format Audit Report" nodes to customize when risk levels change Use Different Email Service Replace Gmail node with: SMTP Microsoft Outlook SendGrid Any email service N8N supports --- π‘ Use Cases β Compliance Monitoring: Track security posture for audits β Team Awareness: Keep your team informed of security status β Change Detection: Notice when new risky nodes are added β Best Practices: Get recommendations to improve security β Multi-Environment: Run separate instances for dev/staging/prod --- π§ Technical Details Nodes Used: 8 Credentials Required: 2 (N8N API + Gmail OAuth2) External Dependencies: None N8N Version: Compatible with latest N8N versions Execution Time: ~10-20 seconds --- π Requirements N8N instance with API access Gmail account (or other email service) N8N API key with audit permissions Valid SSL certificate for workflow links (recommended) --- π Troubleshooting Empty report? β Check your N8N API key has audit permissions Workflow links don't work? β Verify server_url is correct and has no trailing slash No execution status shown? β Workflows must have been executed at least once Wrong language displayed? β Set Language to exactly "FR" or "EN" (uppercase) --- π Why This Template? Unlike basic monitoring tools, this workflow: β Provides context-aware security analysis β Links directly to affected workflows β Shows real execution data (not just theoretical risks) β Calculates unique credential exposure (not just counts) β Supports bilingual reports β Delivers actionable recommendations --- π€ Feedback & Support Found this helpful? Please rate the template! Have suggestions? Drop a comment below. Pro tip: Combine this with N8N's native alerting for real-time incident response! --- Tags: security audit monitoring compliance automation email reporting credentials governance --- π License MIT - Feel free to modify and share!