Class: Google::Cloud::AIPlatform::V1::ResourcePool::AutoscalingSpec

Inherits:
Object
  • Object
show all
Extended by:
Protobuf::MessageExts::ClassMethods
Includes:
Protobuf::MessageExts
Defined in:
proto_docs/google/cloud/aiplatform/v1/persistent_resource.rb

Overview

The min/max number of replicas allowed if enabling autoscaling

Instance Attribute Summary collapse

Instance Attribute Details

#max_replica_count::Integer

Returns Optional. max replicas in the node pool, must be ≥ replica_count and > min_replica_count or will throw error.

Returns:

  • (::Integer)

    Optional. max replicas in the node pool, must be ≥ replica_count and > min_replica_count or will throw error



189
190
191
192
# File 'proto_docs/google/cloud/aiplatform/v1/persistent_resource.rb', line 189

class AutoscalingSpec
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods
end

#min_replica_count::Integer

Returns Optional. min replicas in the node pool, must be ≤ replica_count and < max_replica_count or will throw error. For autoscaling enabled Ray-on-Vertex, we allow min_replica_count of a resource_pool to be 0 to match the OSS Ray behavior(https://docs.ray.io/en/latest/cluster/vms/user-guides/configuring-autoscaling.html#cluster-config-parameters). As for Persistent Resource, the min_replica_count must be > 0, we added a corresponding validation inside CreatePersistentResourceRequestValidator.java.

Returns:



189
190
191
192
# File 'proto_docs/google/cloud/aiplatform/v1/persistent_resource.rb', line 189

class AutoscalingSpec
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods
end