From db4c603f8805d432918bb4b16e3de0426afed1ae Mon Sep 17 00:00:00 2001 From: nusquama Date: Thu, 4 Dec 2025 05:01:40 +0100 Subject: [PATCH] creation --- .../readme-11371.md | 514 ++++++++++++++++++ 1 file changed, 514 insertions(+) create mode 100644 workflows/Automate Lead Enrichment with Web Scraping, GPT-4o, Airtable and Slack Notifications-11371/readme-11371.md diff --git a/workflows/Automate Lead Enrichment with Web Scraping, GPT-4o, Airtable and Slack Notifications-11371/readme-11371.md b/workflows/Automate Lead Enrichment with Web Scraping, GPT-4o, Airtable and Slack Notifications-11371/readme-11371.md new file mode 100644 index 000000000..2caa4f406 --- /dev/null +++ b/workflows/Automate Lead Enrichment with Web Scraping, GPT-4o, Airtable and Slack Notifications-11371/readme-11371.md @@ -0,0 +1,514 @@ +Automate Lead Enrichment with Web Scraping, GPT-4o, Airtable and Slack Notifications + +https://n8nworkflows.xyz/workflows/automate-lead-enrichment-with-web-scraping--gpt-4o--airtable-and-slack-notifications-11371 + + +# Automate Lead Enrichment with Web Scraping, GPT-4o, Airtable and Slack Notifications + +### 1. Workflow Overview + +This workflow automates lead enrichment by processing form-submitted leads, scraping their websites for relevant content, analyzing the extracted data with GPT-4o, storing insights in Airtable, and sending Slack notifications to the sales team. It targets businesses seeking to optimize "speed-to-lead," automate prospect research, and enrich CRM data with AI-generated summaries. + +The workflow is logically divided into these blocks: + +- **1.1 Input Reception:** Captures lead data from an embedded web form. +- **1.2 Website Scraping:** Fetches the lead's homepage HTML content. +- **1.3 AI Content Analysis:** Uses GPT-4o to analyze homepage content; decides if further detail is needed. +- **1.4 Secondary Scraping (if needed):** Scrapes a fallback URL (e.g., "About Us" page) when homepage text is vague. +- **1.5 AI Secondary Analysis:** Analyzes the secondary page content to generate a concise summary. +- **1.6 Data Processing:** Cleans and aggregates AI output for CRM insertion. +- **1.7 CRM Logging:** Writes enriched lead data to Airtable. +- **1.8 Notifications:** Sends Slack messages with the new lead and AI insights. +- **1.9 Error Handling:** Logs scraping or processing failures into Airtable for manual review. + +--- + +### 2. Block-by-Block Analysis + +#### 1.1 Input Reception + +- **Overview:** + Collects lead details (Name, Email, Website) via a form submission trigger node. + +- **Nodes Involved:** + - On form submission + +- **Node Details:** + - **On form submission** + - **Type:** Form Trigger + - **Role:** Entry point capturing user input from a custom form titled "Contact Us" with fields for Name (required), Email (required, email type), and Website (required). + - **Configuration:** Button label "Submit"; no attribution appended; includes placeholder hints for fields. + - **Input:** User interaction on the form. + - **Output:** JSON object containing submitted Name, Email, Website, and timestamp. + - **Edge Cases:** Missing required fields prevented by form validation; invalid email format rejected by form; no direct error handling beyond form validation. + +--- + +#### 1.2 Website Scraping + +- **Overview:** + Scrapes the submitted website URL’s homepage HTML content to extract key textual elements for initial AI analysis. + +- **Nodes Involved:** + - Scrape home page + - HTML + +- **Node Details:** + - **Scrape home page** + - **Type:** HTTP Request + - **Role:** Fetches the homepage HTML of the submitted website URL, with logic to normalize the URL (adds https:// if missing, prepends www. for root domains). + - **Configuration:** Returns raw HTML text; follows redirects; on error continues workflow (does not fail). + - **Input:** Website URL from form submission node. + - **Output:** Raw homepage HTML content as text. + - **Edge Cases:** Invalid or unreachable URLs cause node to fail but continue error output; malformed URL input handled via try/catch logic. + - **HTML** + - **Type:** HTML Extractor + - **Role:** Parses homepage HTML content to extract: + - `

` text (key header) + - All `

` elements (array) + - Meta description content + - All links in `