> ## Documentation Index
> Fetch the complete documentation index at: https://docs.browseract.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Extract Data Node

> Scrapes structured data from the full page or the current visible area (independent of list loops).

Overview

The Extract Data node is used to extract structured data from the entire page or the current visible area. It is a general-purpose page data collection tool. Unlike Extract Data Item, it does not rely on Loop List and can be used independently.

<img src="https://mintcdn.com/browseract/kgQ66U5Yt7ZcVFAM/images/Gemini_Generated_Image_y72nxxy72nxxy72n.png?fit=max&auto=format&n=kgQ66U5Yt7ZcVFAM&q=85&s=b8674638b68269214e65a37c55c4f4ef" alt="Gemini_Generated_Image_y72nxxy72nxxy72n.png" width="2752" height="1536" data-path="images/Gemini_Generated_Image_y72nxxy72nxxy72n.png" />

**Use Cases**

* Extracting all matching data from a full page at once (e.g., comment lists, article lists, product info).
* Extracting a set of fields from a detail page (e.g., multiple specification fields on a product detail page).
* Accumulating data from multiple pages when combined with [Pagination](/learn/basics/node-types/pagination-node) inside a [Loop](/learn/basics/node-types/loop-node).

**Core Capabilities**

* Supports two extraction scopes: **Full Page** or **Visible Area**.
* Uses natural language to describe fields and positions to extract.
* Supports filtering conditions (filter by time, quantity, attributes, etc.).
* Can output formats like JSON, CSV, XML, Markdown (via Output Data node).

**Configuration Steps**

1. Add an **Extract Data** node.
2. Select extraction scope:
   * **Full Page:** Collect data from the entire page.
   * **Visible Area:** Collect data only from the current viewport.
3. Clearly specify content to extract in the field description. For example:
   * "Extract from product detail page: Product Name, Current Price, Original Price, Stock Status, Rating, Review Count, Seller Name, Shipping Info."
4. (Optional) Configure filtering criteria, for example:
   * "Only keep reviews from the last 30 days."
   * "Only extract products with a price greater than 100."
5. For cross-page extraction, place Extract Data inside a [Loop](/learn/basics/node-types/loop-node) or combine with [Pagination](/learn/basics/node-types/pagination-node).

<img src="https://mintcdn.com/browseract/6AEAYv_wQ0nrrJJQ/images/PixPin_2025-12-04_14-18-10.png?fit=max&auto=format&n=6AEAYv_wQ0nrrJJQ&q=85&s=2c2dce6d424806fd33a90f92142da2fb" alt="PixPin_2025-12-04_14-18-10.png" width="607" height="657" data-path="images/PixPin_2025-12-04_14-18-10.png" />

**Recommendations**

* For scenarios with complex page structures, consider using multiple Extract Data nodes to process different areas separately.
* Each Extract Data node executes extraction only once. For multi-page extraction, combine with [Loop](/learn/basics/node-types/loop-node) + [Pagination](/learn/basics/node-types/pagination-node).
* Consider adding a [**Wait**](/learn/basics/node-types/wait-node) node before extraction to ensure dynamic content has loaded.
* Field descriptions should be detailed, including field names and meanings, to facilitate subsequent use and maintenance.
