confluent-skill-creator

Create Confluent-specific skills for external users. Use this skill when users want to create, build, or author a new skill related to Confluent Cloud, Confluent Platform, Apache Kafka, WarpStream, Flink, Connectors, Schema Registry, Tableflow, CDC pipelines, or any Confluent product. Skills can be use-case focused (like data enrichment, CDC to Tableflow, stream processing workflows) or component-specific (like a Flink skill, Schema Registry skill, or Connector skill). Do NOT use this skill when users want to directly use Confluent products (e.g., build a pipeline, write a producer, deploy Flink SQL) — use the appropriate product-specific skill instead. This skill is specifically for creating new skills, not for using existing ones.

Provider: Confluent | the Data Streaming Platform Path in repo: skills/confluent-skill-creator/SKILL.md

Skill body

Confluent Skill Creator

A specialized skill creator for building Confluent product skills that are tested against real Confluent environments (local, platform, cloud, or WarpStream).

This skill extends the base skill-creator workflow with Confluent-specific requirements:

When to use this skill

Use this skill when a user wants to create a skill for external users that involves:

Core Workflow

  1. Understand scope and platform targeting (Confluent-specific)
  2. Capture intent and interview
  3. Write the skill draft
  4. Validate spec compliance (Confluent-specific)
  5. Set up test environment and credentials (Confluent-specific)
  6. Create test cases and run E2E tests (Confluent-specific)
  7. Evaluate with viewer
  8. Iterate until satisfied
  9. Optimize description
  10. Package and deliver

Skill Anatomy and Progressive Disclosure

Apply these structural principles to every skill you create. They extend the base skill-creator guidance; the spec-compliance checks in Step 4 enforce the specifics, while this section is the mental model.

Anatomy of a skill

skill-name/
├── SKILL.md (required)
│   ├── YAML frontmatter (name, description required)
│   └── Markdown instructions
└── Bundled resources (optional)
    ├── scripts/     — executable code for deterministic/repetitive tasks
    ├── references/  — docs loaded into context as needed
    └── assets/      — files used in output (templates, icons, fonts)

Progressive disclosure — a three-level loading system

Skills load context in three tiers, so the model only pays for what it needs:

  1. Metadata (name + description) — always in context (~100 words). This is the trigger; invest here.
  2. SKILL.md body — loaded whenever the skill triggers (<500 lines ideal).
  3. Bundled resources — loaded only as needed (unlimited; scripts can execute without being read into context).

Word counts are approximate — go longer when a skill genuinely needs it.

Key patterns:

Domain organization

When a skill spans multiple domains/frameworks, organize by variant and let the model read only the relevant file — the same pattern as cross-platform Confluent skills (references/confluent-cloud.md, references/warpstream.md, …):

cloud-deploy/
├── SKILL.md          # workflow + variant selection
└── references/
    ├── aws.md
    ├── gcp.md
    └── azure.md

Principle of lack of surprise

A skill’s behavior must not surprise a user who has read its description. Skills must never contain malware, exploit code, or anything that could compromise system security, and must not facilitate unauthorized access, data exfiltration, or other malicious activity. Benign creative framing (e.g. “roleplay as X”) is fine — misleading or malicious skills are not.


Step 1: Understand Scope and Platform Targeting

Skill scope

Understand what the user wants to build. Skills can be either use-case focused or component-specific — both are valid.

Use-case focused — solves a specific end-to-end problem:

Component-specific — covers a Confluent component’s capabilities:

If the user’s request is ambiguous, ask:

“What should this skill cover? It can be a specific use case (e.g., ‘CDC pipeline from Postgres to Iceberg’) or focused on a component (e.g., ‘Flink SQL skill’). What’s the scope you have in mind?”

Platform targeting

After understanding scope, determine platform targeting:

“Which platforms should this skill support?

  1. All supported platforms (Confluent Cloud, Confluent Platform, Apache Kafka, WarpStream)
  2. Specific platform only (e.g., Confluent Cloud only, WarpStream only)”

If platform-specific:

If cross-platform:

Once you understand scope and platform, proceed to the interview.


Step 2-3: Capture Intent and Write Skill

Follow the base skill-creator process for capturing intent and writing the skill draft. Ask about:

  1. What should this skill enable users to do?
  2. When should it trigger? (phrases users would say)
  3. What components are involved? (topics, schemas, Flink, connectors, etc.)
  4. How should the skill interact with Confluent? Ask the user:

    “How should this skill interact with Confluent?

    1. MCP tools (via Confluent MCP server — discover, manage, operate Confluent resources)
    2. CLI (Confluent CLI — confluent kafka topic create, confluent flink statement create, etc.)
    3. REST APIs (Confluent Cloud API, Schema Registry API, Kafka REST Proxy)
    4. Client libraries (confluent-kafka-python, Java clients, etc.)
    5. Combination (e.g., MCP for discovery + CLI for provisioning + client libs for data)”

    Based on the answer, the created skill must include:

    • The specific tool/CLI/API calls required, in execution order
    • Any prerequisite setup (MCP server config, CLI login, API key creation)
    • Error handling for each interaction method
  5. What’s the expected output? (working pipeline, sample data, SQL statements, etc.)

When writing the skill:

Plan-before-execute requirement

The created skill MUST include a plan-before-execute step. Before the skill performs any operations (creating topics, registering schemas, deploying Flink statements, etc.), it must:

  1. Present a numbered plan of all operations to the user
  2. List which tools/CLI commands/API calls will be made and in what order
  3. Wait for explicit user confirmation before proceeding
  4. Never execute resource-modifying operations without approval

See plan-template.md for the plan format with interaction method examples.


Step 4: Validate Spec Compliance

Before proceeding to testing, validate the created skill against the Agent Skills specification.

Frontmatter validation

Trigger overlap check

Before finalizing the description, check for unintended overlap with existing skills in the repo:

  1. Scan skills/*/SKILL.md in the target repo and extract each skill’s description: field
  2. Compare the new skill’s description keywords against existing skills
  3. If significant overlap exists (≥2 non-generic keywords shared), verify the boundary is clear — a user prompt should unambiguously trigger only one skill, not both
  4. If the boundary is ambiguous, refine the new skill’s description to make the scope distinction clear, or add explicit exclusions where needed

Structure validation

Required artifacts

Run validation

# If skills-ref is available
skills-ref validate ./created-skill

If validation fails: fix issues and re-validate. Do not proceed to testing until the skill passes.


Step 5: Set Up Test Environment and Credentials

Choose test environment

Ask the user which environment to test against:

“Which Confluent environment should we test against?

  1. Local Confluent (Docker, for development)
  2. Confluent Platform (self-managed on-prem)
  3. Confluent Cloud (managed SaaS)
  4. WarpStream (Kafka-compatible, object-storage-backed)

Note: If this skill targets a specific platform, we must test against that platform.”

Important rule: If the skill is designed for a specific platform, tests MUST run against that platform.

Security: credential file handling

Credentials may live in a .env file or a YAML file (credentials.yaml) — the rules below apply to whichever format the skill uses.

CRITICAL: Never read, display, or log credential files (.env or credentials.yaml) or their contents. See confluent-best-practices.md for the full security policy. Key rules:

Set up credentials

See credentials reference for environment-specific credential templates — available in both .env and YAML (credentials.yaml) form — covering Kafka, Schema Registry, Flink, Tableflow, and Connectors. Pick whichever format fits the skill; YAML is convenient for nested/structured config, .env for flat shell-style variables.


Step 6: Create Test Cases and Run E2E Tests

Test case structure

Each test case should exercise the full use case end-to-end. See plan-template.md for the CRUD operations plan format. Present the plan and wait for user confirmation before creating any resources.

Schema Registry defaults

Before running Flink operations, verify a compute pool is available:

python scripts/check_compute_pool.py --pool-id $FLINK_COMPUTE_POOL_ID

If no pool is available or capacity is full, notify the user and skip Flink tests.

Running the tests

Use the base skill-creator workflow for running tests (spawn subagents with skill vs without skill), but adapt for Confluent:

With-skill run:

Execute this task:
- Skill path: <path-to-skill>
- Task: <eval prompt>
- Environment: .env file at <workspace>/.env
- Save outputs to: <workspace>/iteration-<N>/eval-<ID>/with_skill/outputs/
- Outputs to save:
  - Consumed data from output topic
  - SQL statements executed (if applicable)
  - Schema definitions
  - Verification logs

Baseline run: Same prompt, no skill, same environment.

Bundled scripts

For common operations, use the bundled scripts in scripts/:

Include relevant scripts in the created skill’s scripts/ directory when packaging.


Step 7-8: Evaluate and Iterate

Follow the base skill-creator evaluation workflow:

  1. While tests run, draft assertions (if applicable)
  2. Capture timing data when runs complete
  3. Grade each run
  4. Aggregate benchmark
  5. Launch eval viewer
  6. Review feedback with user
  7. Improve skill based on feedback

Confluent-specific assertions — good assertions check:


Step 9-10: Optimize Description and Package

Description optimization

After the skill works well, optimize the triggering description. Focus on:

Packaging

Bundle the skill with:

Validation gate: Before delivering, run skills-ref validate ./created-skill one final time. Do not deliver a skill that fails validation.

PR submission checklist

When the skill is being added to a Confluent skills repo (e.g., confluentinc/agent-skills), ensure:

  1. README updated — add the new skill to the repo’s skill table in README.md
  2. Evals pass at 90%+ — run evals and include the score in the PR description
  3. SME reviewer — identify and assign a subject-matter expert for the skill’s domain
  4. DTX/DevRel reviewer — assign a reviewer from the DTX or Developer Advocates team

These are required gates in the PR template and will be checked by the confluent-skill-reviewer.


Reference Files

Skill frontmatter

compatibility: Requires Python 3.9+, confluent-kafka, fastavro, requests. Needs access to a Confluent environment (Cloud, Platform, local Docker, or WarpStream) for E2E testing. metadata: {"author" => "confluent", "version" => "1.0", "last_updated" => "2026-05-12"}