Apache PDFBox · JSON Structure

Apache Pdfbox Document Info Structure

DocumentInfo schema from Apache PDFBox

Type: object Properties: 5
Document ProcessingJavaPDFText ExtractionApacheOpen Source

DocumentInfo is a JSON Structure definition published by Apache PDFBox, describing 5 properties. It conforms to the https://json-structure.org/meta/core/v0/# meta-schema.

Properties

documentId title pageCount fileSize version

Meta-schema: https://json-structure.org/meta/core/v0/#

JSON Structure

Raw ↑
{
  "$schema": "https://json-structure.org/meta/core/v0/#",
  "$id": "https://raw.githubusercontent.com/api-evangelist/apache-pdfbox/refs/heads/main/json-structure/apache-pdfbox-document-info-structure.json",
  "description": "DocumentInfo schema from Apache PDFBox",
  "type": "object",
  "properties": {
    "documentId": {
      "type": "string",
      "example": "doc-abc123"
    },
    "title": {
      "type": "string",
      "example": "My Document"
    },
    "pageCount": {
      "type": "int32",
      "example": 10
    },
    "fileSize": {
      "type": "int32",
      "example": 204800
    },
    "version": {
      "type": "string",
      "example": "1.4"
    }
  },
  "name": "DocumentInfo"
}