Apache Nutch · JSON Structure

Apache Nutch Child Node Structure

A child (outlink) of a fetched node.

Type: object Properties: 2
Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

ChildNode is a JSON Structure definition published by Apache Nutch, describing 2 properties. It conforms to the https://json-structure.org/meta/core/v0/# meta-schema.

Properties

childUrl anchorText

Meta-schema: https://json-structure.org/meta/core/v0/#

JSON Structure

Raw ↑
{
  "$schema": "https://json-structure.org/meta/core/v0/#",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/json-structure/apache-nutch-child-node-structure.json",
  "name": "ChildNode",
  "description": "A child (outlink) of a fetched node.",
  "type": "object",
  "properties": {
    "childUrl": {
      "type": "string",
      "description": "The URL of the child node."
    },
    "anchorText": {
      "type": "string",
      "description": "The anchor text of the link."
    }
  }
}