Apache Nutch · JSON Structure

Apache Nutch Service Config Structure

Configuration for service operations such as CommonCrawl data dumps.

Type: object Properties: 3 Required: 1
Web CrawlerIndexingSearchApacheJavaHadoopOpen Source

ServiceConfig is a JSON Structure definition published by Apache Nutch, describing 3 properties, of which 1 is required. It conforms to the https://json-structure.org/meta/core/v0/# meta-schema.

Properties

crawlId confId args

Meta-schema: https://json-structure.org/meta/core/v0/#

JSON Structure

Raw ↑
{
  "$schema": "https://json-structure.org/meta/core/v0/#",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-nutch/refs/heads/main/json-structure/apache-nutch-service-config-structure.json",
  "name": "ServiceConfig",
  "description": "Configuration for service operations such as CommonCrawl data dumps.",
  "type": "object",
  "properties": {
    "crawlId": {
      "type": "string",
      "description": "The crawl identifier."
    },
    "confId": {
      "type": "string",
      "description": "The configuration ID."
    },
    "args": {
      "type": "object",
      "additionalProperties": true,
      "description": "Additional arguments for the service operation."
    }
  },
  "required": [
    "crawlId"
  ],
  "example": {
    "crawlId": "crawl-01",
    "confId": "default",
    "args": {}
  }
}