Scrape And Summarize Webpages With Ai

1 Tools

Explore Tool Categories

Find Agencies for This Project

Know a Better Recipe?

Help the community by sharing your proven tool combinations.

Submit a Recipe

About this recipe

A recipe for Scrape And Summarize Webpages With Ai

Recipe Template

{
    "meta": {
        "instanceId": "408f9fb9940c3cb18ffdef0e0150fe342d6e655c3a9fac21f0f644e8bedabcd9"
    },
    "nodes": [
        {
            "id": "67850bd7-f9f4-4d5b-8c9e-bd1451247ba6",
            "name": "When clicking \"Execute Workflow\"",
            "type": "n8n-nodes-base.manualTrigger",
            "position": [
                -740,
                1000
            ],
            "parameters": [],
            "typeVersion": 1
        },
        {
            "id": "0d9133f9-b6d3-4101-95c6-3cd24cdb70c3",
            "name": "Fetch essay list",
            "type": "n8n-nodes-base.httpRequest",
            "position": [
                -520,
                1000
            ],
            "parameters": {
                "url": "http:\/\/www.paulgraham.com\/articles.html",
                "options": []
            },
            "typeVersion": 4.1
        },
        {
            "id": "ee634297-a456-4f70-a995-55b02950571e",
            "name": "Extract essay names",
            "type": "n8n-nodes-base.html",
            "position": [
                -300,
                1000
            ],
            "parameters": {
                "options": [],
                "operation": "extractHtmlContent",
                "dataPropertyName": "=data",
                "extractionValues": {
                    "values": [
                        {
                            "key": "essay",
                            "attribute": "href",
                            "cssSelector": "table table a",
                            "returnArray": true,
                            "returnValue": "attribute"
                        }
                    ]
                }
            },
            "typeVersion": 1
        },
        {
            "id": "83d75693-dbb8-44c4-8533-da06f611c59c",
            "name": "Fetch essay texts",
            "type": "n8n-nodes-base.httpRequest",
            "position": [
                360,
                1000
            ],
            "parameters": {
                "url": "=http:\/\/www.paulgraham.com\/{{ $json.essay }}",
                "options": []
            },
            "typeVersion": 4.1
        },
        {
            "id": "151022b5-8570-4176-bf3f-137f27ac7036",
            "name": "Extract title",
            "type": "n8n-nodes-base.html",
            "position": [
                700,
                700
            ],
            "parameters": {
                "options": [],
                "operation": "extractHtmlContent",
                "extractionValues": {
                    "values": [
                        {
                            "key": "title",
                            "cssSelector": "title"
                        }
                    ]
                }
            },
            "typeVersion": 1
        },
        {
            "id": "07bcf095-3c4d-4a72-9bcb-341411750ff5",
            "name": "Clean up",
            "type": "n8n-nodes-base.set",
            "position": [
                1360,
                980
            ],
            "parameters": {
                "fields": {
                    "values": [
                        {
                            "name": "title",
                            "stringValue": "={{ $json.title }}"
                        },
                        {
                            "name": "summary",
                            "stringValue": "={{ $json.response.text }}"
                        },
                        {
                            "name": "url",
                            "stringValue": "=http:\/\/www.paulgraham.com\/{{ $('Limit to first 3').item.json.essay }}"
                        }
                    ]
                },
                "include": "none",
                "options": []
            },
            "typeVersion": 3
        },
        {
            "id": "11285de0-3c5d-4296-a322-9b7585af9acc",
            "name": "Sticky Note",
            "type": "n8n-nodes-base.stickyNote",
            "position": [
                -580,
                920
            ],
            "parameters": {
                "width": 1071.752021563343,
                "height": 285.66037735849045,
                "content": "## Scrape latest Paul Graham essays"
            },
            "typeVersion": 1
        },
        {
            "id": "c32f905d-dd7a-4b68-bbe0-dd8115ee0944",
            "name": "Sticky Note1",
            "type": "n8n-nodes-base.stickyNote",
            "position": [
                620,
                920
            ],
            "parameters": {
                "width": 465.3908355795153,
                "height": 606.7924528301882,
                "content": "## Summarize them with GPT"
            },
            "typeVersion": 1
        },
        {
            "id": "29d264f4-df6d-4a41-ab38-58e1b1becc9a",
            "name": "Split out into items",
            "type": "n8n-nodes-base.splitOut",
            "position": [
                -80,
                1000
            ],
            "parameters": {
                "options": [],
                "fieldToSplitOut": "essay"
            },
            "typeVersion": 1
        },
        {
            "id": "ccfa3a1d-f170-44b4-a1f2-3573c88cae98",
            "name": "Limit to first 3",
            "type": "n8n-nodes-base.limit",
            "position": [
                140,
                1000
            ],
            "parameters": {
                "maxItems": 3
            },
            "typeVersion": 1
        },
        {
            "id": "c3d05068-9d1a-4ef5-8249-e7384dc617ee",
            "name": "Default Data Loader",
            "type": "@n8n\/n8n-nodes-langchain.documentDefaultDataLoader",
            "position": [
                820,
                1200
            ],
            "parameters": {
                "options": []
            },
            "typeVersion": 1
        },
        {
            "id": "db75adad-cb16-4e72-b16e-34684a733b05",
            "name": "Recursive Character Text Splitter",
            "type": "@n8n\/n8n-nodes-langchain.textSplitterRecursiveCharacterTextSplitter",
            "position": [
                820,
                1340
            ],
            "parameters": {
                "options": []
            },
            "typeVersion": 1
        },
        {
            "id": "022cc091-9b4c-45c2-bc8e-4037ec2d0d60",
            "name": "OpenAI Chat Model1",
            "type": "@n8n\/n8n-nodes-langchain.lmChatOpenAi",
            "position": [
                680,
                1200
            ],
            "parameters": {
                "model": "gpt-4o-mini",
                "options": []
            },
            "credentials": {
                "openAiApi": {
                    "id": "8gccIjcuf3gvaoEr",
                    "name": "OpenAi account"
                }
            },
            "typeVersion": 1
        },
        {
            "id": "cda47bb7-36c5-4d15-a1ef-0c66b1194825",
            "name": "Merge",
            "type": "n8n-nodes-base.merge",
            "position": [
                1160,
                980
            ],
            "parameters": {
                "mode": "combine",
                "options": [],
                "combineBy": "combineByPosition"
            },
            "typeVersion": 3
        },
        {
            "id": "28144e4c-e425-428d-b3d1-f563bfd4e5b3",
            "name": "Summarization Chain",
            "type": "@n8n\/n8n-nodes-langchain.chainSummarization",
            "position": [
                720,
                1000
            ],
            "parameters": {
                "options": [],
                "operationMode": "documentLoader"
            },
            "typeVersion": 2
        }
    ],
    "pinData": [],
    "connections": {
        "Merge": {
            "main": [
                [
                    {
                        "node": "Clean up",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "Extract title": {
            "main": [
                [
                    {
                        "node": "Merge",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "Fetch essay list": {
            "main": [
                [
                    {
                        "node": "Extract essay names",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "Limit to first 3": {
            "main": [
                [
                    {
                        "node": "Fetch essay texts",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "Fetch essay texts": {
            "main": [
                [
                    {
                        "node": "Extract title",
                        "type": "main",
                        "index": 0
                    },
                    {
                        "node": "Summarization Chain",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "OpenAI Chat Model1": {
            "ai_languageModel": [
                [
                    {
                        "node": "Summarization Chain",
                        "type": "ai_languageModel",
                        "index": 0
                    }
                ]
            ]
        },
        "Default Data Loader": {
            "ai_document": [
                [
                    {
                        "node": "Summarization Chain",
                        "type": "ai_document",
                        "index": 0
                    }
                ]
            ]
        },
        "Extract essay names": {
            "main": [
                [
                    {
                        "node": "Split out into items",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "Summarization Chain": {
            "main": [
                [
                    {
                        "node": "Merge",
                        "type": "main",
                        "index": 1
                    }
                ]
            ]
        },
        "Split out into items": {
            "main": [
                [
                    {
                        "node": "Limit to first 3",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "When clicking \"Execute Workflow\"": {
            "main": [
                [
                    {
                        "node": "Fetch essay list",
                        "type": "main",
                        "index": 0
                    }
                ]
            ]
        },
        "Recursive Character Text Splitter": {
            "ai_textSplitter": [
                [
                    {
                        "node": "Default Data Loader",
                        "type": "ai_textSplitter",
                        "index": 0
                    }
                ]
            ]
        }
    }
}

How to Use an n8n Template

1

Create a New Workflow

Click "New Workflow" in your n8n dashboard to get started.

2

Copy & Paste Template

First, copy this template: Click here to copy the JSON .
Then, in n8n, click the three dots (···) → "Import from file" and paste the JSON code.

3

Customize the Nodes

Go through each node in the workflow to update inputs like spreadsheet IDs, email addresses, or message content. Adjust field mappings to match your data.

4

Grant Access

For nodes that connect to external apps (like Google Sheets or Slack), you'll need to grant access. Connect your accounts using OAuth or an API key and save the credentials in the node.

5

Test It

Run the workflow by clicking "Execute Node" for each step or "Run Once" for the whole thing. Check the right sidebar to inspect data and debug any errors (they'll show up in red).

6

Activate Workflow

Once everything works as expected, click the "Activate" toggle to turn your workflow on. You're all set!