Class: Google::Cloud::AIPlatform::V1::ResourcePool::AutoscalingSpec
- Inherits:
-
Object
- Object
- Google::Cloud::AIPlatform::V1::ResourcePool::AutoscalingSpec
- Extended by:
- Protobuf::MessageExts::ClassMethods
- Includes:
- Protobuf::MessageExts
- Defined in:
- proto_docs/google/cloud/aiplatform/v1/persistent_resource.rb
Overview
The min/max number of replicas allowed if enabling autoscaling
Instance Attribute Summary collapse
-
#max_replica_count ⇒ ::Integer
Optional.
-
#min_replica_count ⇒ ::Integer
Optional.
Instance Attribute Details
#max_replica_count ⇒ ::Integer
Returns Optional. max replicas in the node pool, must be ≥ replica_count and > min_replica_count or will throw error.
189 190 191 192 |
# File 'proto_docs/google/cloud/aiplatform/v1/persistent_resource.rb', line 189 class AutoscalingSpec include ::Google::Protobuf::MessageExts extend ::Google::Protobuf::MessageExts::ClassMethods end |
#min_replica_count ⇒ ::Integer
Returns Optional. min replicas in the node pool, must be ≤ replica_count and < max_replica_count or will throw error. For autoscaling enabled Ray-on-Vertex, we allow min_replica_count of a resource_pool to be 0 to match the OSS Ray behavior(https://docs.ray.io/en/latest/cluster/vms/user-guides/configuring-autoscaling.html#cluster-config-parameters). As for Persistent Resource, the min_replica_count must be > 0, we added a corresponding validation inside CreatePersistentResourceRequestValidator.java.
189 190 191 192 |
# File 'proto_docs/google/cloud/aiplatform/v1/persistent_resource.rb', line 189 class AutoscalingSpec include ::Google::Protobuf::MessageExts extend ::Google::Protobuf::MessageExts::ClassMethods end |