Class: Google::Cloud::AIPlatform::V1::Featurestore::OnlineServingConfig

Inherits:
Object
  • Object
show all
Extended by:
Protobuf::MessageExts::ClassMethods
Includes:
Protobuf::MessageExts
Defined in:
proto_docs/google/cloud/aiplatform/v1/featurestore.rb

Overview

OnlineServingConfig specifies the details for provisioning online serving resources.

Defined Under Namespace

Classes: Scaling

Instance Attribute Summary collapse

Instance Attribute Details

#fixed_node_count::Integer

Returns The number of nodes for the online store. The number of nodes doesn't scale automatically, but you can manually update the number of nodes. If set to 0, the featurestore will not have an online store and cannot be used for online serving.

Returns:

  • (::Integer)

    The number of nodes for the online store. The number of nodes doesn't scale automatically, but you can manually update the number of nodes. If set to 0, the featurestore will not have an online store and cannot be used for online serving.



99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
# File 'proto_docs/google/cloud/aiplatform/v1/featurestore.rb', line 99

class OnlineServingConfig
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # Online serving scaling configuration. If min_node_count and
  # max_node_count are set to the same value, the cluster will be configured
  # with the fixed number of node (no auto-scaling).
  # @!attribute [rw] min_node_count
  #   @return [::Integer]
  #     Required. The minimum number of nodes to scale down to. Must be greater
  #     than or equal to 1.
  # @!attribute [rw] max_node_count
  #   @return [::Integer]
  #     The maximum number of nodes to scale up to. Must be greater than
  #     min_node_count, and less than or equal to 10 times of 'min_node_count'.
  # @!attribute [rw] cpu_utilization_target
  #   @return [::Integer]
  #     Optional. The cpu utilization that the Autoscaler should be trying to
  #     achieve. This number is on a scale from 0 (no utilization) to 100
  #     (total utilization), and is limited between 10 and 80. When a cluster's
  #     CPU utilization exceeds the target that you have set, Bigtable
  #     immediately adds nodes to the cluster. When CPU utilization is
  #     substantially lower than the target, Bigtable removes nodes. If not set
  #     or set to 0, default to 50.
  class Scaling
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end
end

#scaling::Google::Cloud::AIPlatform::V1::Featurestore::OnlineServingConfig::Scaling

Returns Online serving scaling configuration. Only one of fixed_node_count and scaling can be set. Setting one will reset the other.

Returns:



99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
# File 'proto_docs/google/cloud/aiplatform/v1/featurestore.rb', line 99

class OnlineServingConfig
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # Online serving scaling configuration. If min_node_count and
  # max_node_count are set to the same value, the cluster will be configured
  # with the fixed number of node (no auto-scaling).
  # @!attribute [rw] min_node_count
  #   @return [::Integer]
  #     Required. The minimum number of nodes to scale down to. Must be greater
  #     than or equal to 1.
  # @!attribute [rw] max_node_count
  #   @return [::Integer]
  #     The maximum number of nodes to scale up to. Must be greater than
  #     min_node_count, and less than or equal to 10 times of 'min_node_count'.
  # @!attribute [rw] cpu_utilization_target
  #   @return [::Integer]
  #     Optional. The cpu utilization that the Autoscaler should be trying to
  #     achieve. This number is on a scale from 0 (no utilization) to 100
  #     (total utilization), and is limited between 10 and 80. When a cluster's
  #     CPU utilization exceeds the target that you have set, Bigtable
  #     immediately adds nodes to the cluster. When CPU utilization is
  #     substantially lower than the target, Bigtable removes nodes. If not set
  #     or set to 0, default to 50.
  class Scaling
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end
end