As of January 1, 2020 this library no longer supports Python 2 on the latest released version. Library versions released prior to that date will continue to be available. For more information please visit Python 2 support on Google Cloud.

Types for Google Cloud Dataflow v1beta3 API¶

class google.cloud.dataflow_v1beta3.types.AutoscalingAlgorithm(value)[source]¶

Bases: proto.enums.Enum

Specifies the algorithm used to determine the number of worker processes to run at any given point in time, based on the amount of data left to process, the number of workers, and how quickly existing workers are processing data.

Values:

AUTOSCALING_ALGORITHM_UNKNOWN (0):: The algorithm is unknown, or unspecified.
AUTOSCALING_ALGORITHM_NONE (1):: Disable autoscaling.
AUTOSCALING_ALGORITHM_BASIC (2):: Increase worker count over time to reduce job execution time.

class google.cloud.dataflow_v1beta3.types.AutoscalingEvent(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

A structured message reporting an autoscaling decision made by the Dataflow service.

current_num_workers¶

The current number of workers the job has.

Type: int

target_num_workers¶

The target number of workers the worker pool wants to resize to use.

Type: int

event_type¶

The type of autoscaling event to report.

Type: google.cloud.dataflow_v1beta3.types.AutoscalingEvent.AutoscalingEventType

description¶

A message describing why the system decided to adjust the current number of workers, why it failed, or why the system decided to not make any changes to the number of workers.

Type: google.cloud.dataflow_v1beta3.types.StructuredMessage

time¶

The time this event was emitted to indicate a new target or current num_workers value.

Type: google.protobuf.timestamp_pb2.Timestamp

worker_pool¶

A short and friendly name for the worker pool this event refers to.

Type: str

class AutoscalingEventType(value)[source]¶

Bases: proto.enums.Enum

Indicates the type of autoscaling event.

Values:

TYPE_UNKNOWN (0):: Default type for the enum. Value should never be returned.
TARGET_NUM_WORKERS_CHANGED (1):: The TARGET_NUM_WORKERS_CHANGED type should be used when the target worker pool size has changed at the start of an actuation. An event should always be specified as TARGET_NUM_WORKERS_CHANGED if it reflects a change in the target_num_workers.
CURRENT_NUM_WORKERS_CHANGED (2):: The CURRENT_NUM_WORKERS_CHANGED type should be used when actual worker pool size has been changed, but the target_num_workers has not changed.
ACTUATION_FAILURE (3):: The ACTUATION_FAILURE type should be used when we want to report an error to the user indicating why the current number of workers in the pool could not be changed. Displayed in the current status and history widgets.
NO_CHANGE (4):: Used when we want to report to the user a reason why we are not currently adjusting the number of workers. Should specify both target_num_workers, current_num_workers and a decision_message.

class google.cloud.dataflow_v1beta3.types.AutoscalingSettings(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Settings for WorkerPool autoscaling.

algorithm¶

The algorithm to use for autoscaling.

Type: google.cloud.dataflow_v1beta3.types.AutoscalingAlgorithm

max_num_workers¶

The maximum number of workers to cap scaling at.

Type: int

class google.cloud.dataflow_v1beta3.types.BigQueryIODetails(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Metadata for a BigQuery connector used by the job.

table¶

Table accessed in the connection.

Type: str

dataset¶

Dataset accessed in the connection.

Type: str

project_id¶

Project accessed in the connection.

Type: str

query¶

Query used to access data in the connection.

Type: str

class google.cloud.dataflow_v1beta3.types.BigTableIODetails(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Metadata for a Cloud Bigtable connector used by the job.

project_id¶

ProjectId accessed in the connection.

Type: str

instance_id¶

InstanceId accessed in the connection.

Type: str

table_id¶

TableId accessed in the connection.

Type: str

class google.cloud.dataflow_v1beta3.types.CheckActiveJobsRequest(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Request to check is active jobs exists for a project

project_id¶

The project which owns the jobs.

Type: str

class google.cloud.dataflow_v1beta3.types.CheckActiveJobsResponse(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Response for CheckActiveJobsRequest.

active_jobs_exist¶

If True, active jobs exists for project. False otherwise.

Type: bool

class google.cloud.dataflow_v1beta3.types.ComputationTopology(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

All configuration data for a particular Computation.

system_stage_name¶

The system stage name.

Type: str

computation_id¶

The ID of the computation.

Type: str

key_ranges¶

The key ranges processed by the computation.

Type: MutableSequence[google.cloud.dataflow_v1beta3.types.KeyRangeLocation]

inputs¶

The inputs to the computation.

Type: MutableSequence[google.cloud.dataflow_v1beta3.types.StreamLocation]

outputs¶

The outputs from the computation.

Type: MutableSequence[google.cloud.dataflow_v1beta3.types.StreamLocation]

state_families¶

The state family values.

Type: MutableSequence[google.cloud.dataflow_v1beta3.types.StateFamilyConfig]

class google.cloud.dataflow_v1beta3.types.ContainerSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Container Spec.

image¶

Name of the docker container image. E.g., gcr.io/project/some-image

Type: str

metadata¶

Metadata describing a template including description and validation rules.

Type: google.cloud.dataflow_v1beta3.types.TemplateMetadata

sdk_info¶

Required. SDK info of the Flex Template.

Type: google.cloud.dataflow_v1beta3.types.SDKInfo

default_environment¶

Default runtime environment for the job.

Type: google.cloud.dataflow_v1beta3.types.FlexTemplateRuntimeEnvironment

class google.cloud.dataflow_v1beta3.types.CreateJobFromTemplateRequest(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

A request to create a Cloud Dataflow job from a template.

project_id¶

Required. The ID of the Cloud Platform project that the job belongs to.

Type: str

job_name¶

Required. The job name to use for the created job.

Type: str

gcs_path¶

Required. A Cloud Storage path to the template from which to create the job. Must be a valid Cloud Storage URL, beginning with gs://.

This field is a member of oneof template.

Type: str

parameters¶

The runtime parameters to pass to the job.

Type: MutableMapping[str, str]

environment¶

The runtime environment for the job.

Type: google.cloud.dataflow_v1beta3.types.RuntimeEnvironment

location¶

The [regional endpoint] (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) to which to direct the request.

Type: str

class ParametersEntry(mapping=None, *, ignore_unknown_fields=False, **kwargs)¶: Bases: proto.message.Message

class google.cloud.dataflow_v1beta3.types.CreateJobRequest(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Request to create a Cloud Dataflow job.

project_id¶

The ID of the Cloud Platform project that the job belongs to.

Type: str

job¶

The job to create.

Type: google.cloud.dataflow_v1beta3.types.Job

view¶

The level of information requested in response.

Type: google.cloud.dataflow_v1beta3.types.JobView

replace_job_id¶

Deprecated. This field is now in the Job message.

Type: str

location¶

The [regional endpoint] (https://cloud.google.com/dataflow/docs/concepts/regional-endpoints) that contains this job.

Type: str

class google.cloud.dataflow_v1beta3.types.CustomSourceLocation(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Identifies the location of a custom souce.

stateful¶

Whether this source is stateful.

Type: bool

class google.cloud.dataflow_v1beta3.types.DataDiskAssignment(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Data disk assignment for a given VM instance.

vm_instance¶

VM instance name the data disks mounted to, for example “myproject-1014-104817-4c2-harness-0”.

Type: str

data_disks¶

Mounted data disks. The order is important a data disk’s 0-based index in this list defines which persistent directory the disk is mounted to, for example the list of { “myproject-1014-104817-4c2-harness-0-disk-0” }, { “myproject-1014-104817-4c2-harness-0-disk-1” }.

Type: MutableSequence[str]

class google.cloud.dataflow_v1beta3.types.DatastoreIODetails(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Metadata for a Datastore connector used by the job.

namespace¶

Namespace used in the connection.

Type: str

project_id¶

ProjectId accessed in the connection.

Type: str

class google.cloud.dataflow_v1beta3.types.DebugOptions(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Describes any options that have an effect on the debugging of pipelines.

enable_hot_key_logging¶

When true, enables the logging of the literal hot key to the user’s Cloud Logging.

Type: bool

class google.cloud.dataflow_v1beta3.types.DefaultPackageSet(value)[source]¶

Bases: proto.enums.Enum

The default set of packages to be staged on a pool of workers.

Values:

DEFAULT_PACKAGE_SET_UNKNOWN (0):: The default set of packages to stage is unknown, or unspecified.
DEFAULT_PACKAGE_SET_NONE (1):: Indicates that no packages should be staged at the worker unless explicitly specified by the job.
DEFAULT_PACKAGE_SET_JAVA (2):: Stage packages typically useful to workers written in Java.
DEFAULT_PACKAGE_SET_PYTHON (3):: Stage packages typically useful to workers written in Python.

class google.cloud.dataflow_v1beta3.types.DeleteSnapshotRequest(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Request to delete a snapshot.

project_id¶

The ID of the Cloud Platform project that the snapshot belongs to.

Type: str

snapshot_id¶

The ID of the snapshot.

Type: str

location¶

The location that contains this snapshot.

Type: str

class google.cloud.dataflow_v1beta3.types.DeleteSnapshotResponse(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Response from deleting a snapshot.

class google.cloud.dataflow_v1beta3.types.Disk(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Describes the data disk used by a workflow job.

size_gb¶

Size of disk in GB. If zero or unspecified, the service will attempt to choose a reasonable default.

Type: int

disk_type¶

Disk storage type, as defined by Google Compute Engine. This must be a disk type appropriate to the project and zone in which the workers will run. If unknown or unspecified, the service will attempt to choose a reasonable default.

For example, the standard persistent disk type is a resource name typically ending in “pd-standard”. If SSD persistent disks are available, the resource name typically ends with “pd-ssd”. The actual valid values are defined the Google Compute Engine API, not by the Cloud Dataflow API; consult the Google Compute Engine documentation for more information about determining the set of available disk types for a particular project and zone.

Google Compute Engine Disk types are local to a particular project in a particular zone, and so the resource name will typically look something like this:

compute.googleapis.com/projects/project-id/zones/zone/diskTypes/pd-standard

Type: str

mount_point¶

Directory in a VM where disk is mounted.

Type: str

class google.cloud.dataflow_v1beta3.types.DisplayData(mapping=None, *, ignore_unknown_fields=False, **kwargs)[source]¶

Bases: proto.message.Message

Data provided with a pipeline or transform to provide descriptive info.

This message has oneof fields (mutually exclusive fields). For each oneof, at most one member field can be set at the same time. Setting any member of the oneof automatically clears all other members.

key¶

The key identifying the display data. This is intended to be used as a label for the display data when viewed in a dax monitoring system.

Type: str

namespace¶

The namespace for the key. This is usually a class name or programming language namespace (i.e. python module) which defines the display data. This allows a dax monitoring system to specially handle the data and perform custom rendering.

Type: str

str_value¶

Contains value if the data is of string type.