Apache Nutch · JSON Structure

Apache Nutch Node Schema Structure

Schema describing the fields in a node reader response.

Type: object Properties: 6
Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

NodeSchema is a JSON Structure definition published by Apache Nutch, describing 6 properties. It conforms to the https://json-structure.org/meta/core/v0/# meta-schema.

Properties

key_url num_inlinks num_outlinks inlink_score outlink_score metadata

Meta-schema: https://json-structure.org/meta/core/v0/#

JSON Structure

Raw ↑
{
  "$schema": "https://json-structure.org/meta/core/v0/#",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/json-structure/apache-nutch-node-schema-structure.json",
  "name": "NodeSchema",
  "description": "Schema describing the fields in a node reader response.",
  "type": "object",
  "properties": {
    "key_url": {
      "type": "string",
      "example": "string"
    },
    "num_inlinks": {
      "type": "string",
      "example": "int"
    },
    "num_outlinks": {
      "type": "string",
      "example": "int"
    },
    "inlink_score": {
      "type": "string",
      "example": "float"
    },
    "outlink_score": {
      "type": "string",
      "example": "float"
    },
    "metadata": {
      "type": "string",
      "example": "string"
    }
  }
}