GCS Delete Blobs

Tools

Description

Use this tool to delete blobs inside a Google Cloud Storage bucket.

This tool can delete multiple blobs at the same time. It can filter blobs with include rules, exclude rules, or metadata filtering.

To use this tool, define a Google Cloud Storage source from which you want to delete blobs.

Usage

  1. Add the process action TOOL GCS Delete Blobs from the Process Palette, under the Tools section.

  2. Select a source:

    • Drag and drop one of the following Google Cloud Storage metadata nodes onto the <SOURCE> field of the tool:

      • Storage

      • Bucket

      • Folder

  3. Set other tool parameters as needed.

The tool inherits parameters from the metadata node you drag onto it.

Parameters

Name Default Description

XPath Expression For Source

$SOURCE

A valid XPath expression referencing a Bucket to use as a source location. The expression can return a storage, bucket, or folder node from a Google Cloud Storage metadata object.

Source Bucket Name

Manual entry of the source bucket name.

You can omit this parameter if XPath Expression For Source returns a valid reference to a bucket or one of its children.

Source Directory Path

Manual entry of the source directory. You can omit this parameter if XPath Expression For Source returns a valid reference to a directory or one of its children, or if the bucket itself is the root directory.

For better performance, use a directory as the source, or set this parameter for any static subdirectories. For example, specify this:

  • Source Directory Path → tmp

  • Source Blob Includes → *.txt

instead of this:

  • Source Directory Path → <empty>

  • Source Blob Includes → tmp/*.txt

Source Blob Includes

A list of blobs to include in the operation, as a semicolon-separated list of blob masks. An empty value matches all blobs.

When the source is a directory, or if you set the Source Directory Path, the blob mask evaluates inside this directory.

The following wildcard characters are supported:

?

Matches one character in a segment of the blob’s path.

*

Matches zero or more characters in a segment of the blob’s path.

**

Matches zero or more segments of the blob’s path of the blob.

Examples:

  • to retrieve XML and JSON blobs in the current directory: *.xml;*.json

  • to retrieve XML blobs in any test subdirectory: **/test/*.xml

Source Blob Excludes

A list of blobs to exclude from the operation, as a semicolon-separated list of blob masks. An empty value matches all blobs.

When the source is a directory, or if you set the Source Directory Path, the blob mask evaluates inside this directory.

The following wildcard characters are supported:

?

Matches one character in a segment of the blob’s path.

*

Matches zero or more characters in a segment of the blob’s path.

**

Matches zero or more segments of the blob’s path of the blob.

Examples:

  • to ignore XML and JSON blobs in the current directory: *.xml;*.json

  • to ignore XML blobs in any test subdirectory: **/test/*.xml

Source Metadata

One or more key-value pairs to filter blobs based on their metadata in Google Cloud Storage. The tool only processes source blobs that match these values. You can set this parameter in the form of Java properties.

For instance:

metadata1=value1
metadata2=value2
#comment
metadata3=value3

For information about metadata in Google Cloud Storage, see the official documentation.