Drafter AI | Docs
ProductChangelogStatus
  • Getting Started
  • Build
    • Workflow Basics
      • Workflow Builder Overview
      • Builder: Input
      • Builder: Action
      • Builder: Filter
    • Create workflow
      • Launch workflow
      • Design output
      • Export output results
    • Align Pricing Model
  • Launch
    • Apply your brand
    • Connect a custom domain
    • Invite users
  • Scale
    • Process inputs in batches
    • Organize internal knowledge base
    • Template GPT3 prompts
    • Integrate Drafter AI to other apps with Make.com
    • Get started with developer API
  • Maintain
    • Restore output results
    • Export/Import workflows
    • Debug workflows
  • Building blocks
    • Inputs
      • Manual / Form
    • Actions
      • Text Generation (GPT)
      • Web Data Extraction
      • Sentiment Recognition
      • Key Phrases Extraction
      • Classification
      • Entity Extraction
      • Public Search
      • Internal Search
      • Translation
      • PDF Recognition
      • Regular Expression
      • Template
      • Twitter Search Data
      • Twitter Profile Data
      • Youtube Captions Extraction (soon)
      • Speech Recognition (Soon)
      • Speech Generation (Soon)
      • Image Generation (Soon)
      • Video Generation (Soon)
    • Flow Controls
      • Filter
      • Aggregate
      • Split
  • Direct Integrations
    • Make.com *
    • Google Sheets
    • Slack (soon)
    • Zapier (soon)
Powered by GitBook
On this page
  • Overview
  • How to Setup
  • Inputs and Outputs
  1. Building blocks
  2. Actions

Web Data Extraction

Extracts metadata and full page content of the web page.

PreviousText Generation (GPT)NextSentiment Recognition

Last updated 2 years ago

Overview

Web Data Extraction is a powerful block that allows you to extract metadata and full-page content from web pages. It is useful for a variety of applications, including web scraping, data mining, and content analysis.

It is highly customizable, allowing you to specify the types of data you want to extract and the format in which you want to receive it.

Whether you are a researcher, marketer, or data analyst, Web Data Extraction can help you extract valuable insights from the web.

Works best with web search components and knowledge extraction.

How to Setup

  1. Provide an input - URL data point. You can get it from another action block or manual input block.

  2. Select the output data points you want to extract.

Web content that has more than 4,000 characters (around 1000 English words) can't be processed live during the workflow.

To make using longer web page content possible, collect it to the "Documents" section first, and then perform "Internal Search" across it from the workflow.

Inputs and Outputs

Input
Output
Output Description

Target link (URL)

Meta Title (Text)

Title of the web page

Meta Description (Text)

Description of the page

Meta Image (Image)

Social media image of the page

Full-page text (Text)

Texts extracted from the full-page HTML, structured by headlines or paragraphs

Full page HTML (Text)

HTML of the target web page

Links to media on the page (URL) - Soon

Links to all the images or videos from the page

Links to other pages/websites on the page (URL) - Soon

Links to all other web pages from the target page

Get URL Block