Scrape SERP Block

Perform web searches and scrape search engine results pages (SERP)

The Scrape SERP block enables users to perform web searches and retrieve data from search engine results pages (SERP). This block is particularly useful for data collection and analysis tasks, allowing for the extraction of information based on specific search terms.

Configuration (Required)

Web Page Limit
integer

Defines the maximum number of web pages to retrieve from the search results. Adjust this limit based on the volume of data required. The default value is 1.

Chunk Limit
integer

Specifies the maximum number of data chunks to retrieve from each web page. This helps manage the amount of data processed. The default value is 1.

Search Term
stringRequired

The search term used to query the search engine. Ensure the term is well-defined to obtain relevant results. This input supports Jinja templating for dynamic query generation.

See Workflow Logic & State > State Management for details on using dynamic variables in this block.

Outputs

The Scrape SERP block outputs a list of search results obtained from the web, formatted as JSON. This structured output allows for further processing and integration within the workflow.

Usage Context

Use this block to perform web searches and retrieve search results for analysis or further processing. It is particularly useful when you need to gather data from search engines programmatically.

Best Practices

  • Ensure that the search term is well-defined to get the most relevant results.
  • Adjust the page result and chunk limits to balance between data volume and performance.
  • Use Jinja templating to dynamically construct search queries based on workflow state.
Built with