Skip to main content

Custom schema storage

By default, Pulsar stores data type schemas in Apache BookKeeper (which is deployed alongside Pulsar). You can, however, use another storage system if you wish. This doc walks you through creating your own schema storage implementation.

In order to use a non-default (i.e. non-BookKeeper) storage system for Pulsar schemas, you need to implement two Java interfaces: SchemaStorage and SchemaStorageFactory.

SchemaStorage interface

The SchemaStorage interface has the following methods:


public interface SchemaStorage {
// How schemas are updated
CompletableFuture<SchemaVersion> put(String key, byte[] value, byte[] hash);

// How schemas are fetched from storage
CompletableFuture<StoredSchema> get(String key, SchemaVersion version);

// How schemas are deleted
CompletableFuture<SchemaVersion> delete(String key);

// Utility method for converting a schema version byte array to a SchemaVersion object
SchemaVersion versionFromBytes(byte[] version);

// Startup behavior for the schema storage client
void start() throws Exception;

// Shutdown behavior for the schema storage client
void close() throws Exception;
}

For a full-fledged example schema storage implementation, see the BookKeeperSchemaStorage class.

SchemaStorageFactory interface


public interface SchemaStorageFactory {
@NotNull
SchemaStorage create(PulsarService pulsar) throws Exception;
}

For a full-fledged example schema storage factory implementation, see the BookKeeperSchemaStorageFactory class.

Deployment

In order to use your custom schema storage implementation, you'll need to:

  1. Package the implementation in a JAR file.
  2. Add that jar to the lib folder in your Pulsar binary or source distribution.
  3. Change the schemaRegistryStorageClassName configuration in broker.conf to your custom factory class (i.e. the SchemaStorageFactory implementation, not the SchemaStorage implementation).
  4. Start up Pulsar.