Apache Nutch · JSON Structure

Apache Nutch Nutch Config Structure

Configuration for creating a new Nutch configuration.

Type: object Properties: 3
Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

NutchConfig is a JSON Structure definition published by Apache Nutch, describing 3 properties. It conforms to the https://json-structure.org/meta/core/v0/# meta-schema.

Properties

configId force params

Meta-schema: https://json-structure.org/meta/core/v0/#

JSON Structure

Raw ↑
{
  "$schema": "https://json-structure.org/meta/core/v0/#",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/json-structure/apache-nutch-nutch-config-structure.json",
  "name": "NutchConfig",
  "description": "Configuration for creating a new Nutch configuration.",
  "type": "object",
  "properties": {
    "configId": {
      "type": "string",
      "description": "The identifier for this configuration."
    },
    "force": {
      "type": "boolean",
      "description": "If true, overwrites an existing configuration with the same ID.",
      "default": false
    },
    "params": {
      "type": "object",
      "additionalProperties": {
        "type": "string"
      },
      "description": "Key-value pairs of Nutch configuration properties."
    }
  },
  "example": {
    "configId": "my-config",
    "force": false,
    "params": {
      "http.agent.name": "MyBot",
      "http.robots.agents": "MyBot,*"
    }
  }
}