Class: Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics

Inherits:

Object

Object
Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics

show all

Includes:: Core::Hashable, Core::JsonObjectSupport

Defined in:: lib/google/apis/aiplatform_v1/classes.rb,
lib/google/apis/aiplatform_v1/representations.rb,
lib/google/apis/aiplatform_v1/representations.rb

Overview

Metrics for general pairwise text generation evaluation results.

Instance Attribute Summary collapse

#accuracy ⇒ Float
Fraction of cases where the autorater agreed with the human raters.
#baseline_model_win_rate ⇒ Float
Percentage of time the autorater decided the baseline model had the better response.
#cohens_kappa ⇒ Float
A measurement of agreement between the autorater and human raters that takes the likelihood of random agreement into account.
#f1_score ⇒ Float
Harmonic mean of precision and recall.
#false_negative_count ⇒ Fixnum
Number of examples where the autorater chose the baseline model, but humans preferred the model.
#false_positive_count ⇒ Fixnum
Number of examples where the autorater chose the model, but humans preferred the baseline model.
#human_preference_baseline_model_win_rate ⇒ Float
Percentage of time humans decided the baseline model had the better response.
#human_preference_model_win_rate ⇒ Float
Percentage of time humans decided the model had the better response.
#model_win_rate ⇒ Float
Percentage of time the autorater decided the model had the better response.
#precision ⇒ Float
Fraction of cases where the autorater and humans thought the model had a better response out of all cases where the autorater thought the model had a better response.
#recall ⇒ Float
Fraction of cases where the autorater and humans thought the model had a better response out of all cases where the humans thought the model had a better response.
#true_negative_count ⇒ Fixnum
Number of examples where both the autorater and humans decided that the model had the worse response.
#true_positive_count ⇒ Fixnum
Number of examples where both the autorater and humans decided that the model had the better response.

Instance Method Summary collapse

#initialize(**args) ⇒ GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics constructor
A new instance of GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics.
#update!(**args) ⇒ Object
Update properties of this object.

Constructor Details

#initialize(**args) ⇒ `GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics`

Returns a new instance of GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics.



25619
25620
25621

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25619

def initialize(**args)
   update!(**args)
end

Instance Attribute Details

#accuracy ⇒ `Float`

Fraction of cases where the autorater agreed with the human raters. Corresponds to the JSON property accuracy

Returns:

(Float)



25547
25548
25549

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25547

def accuracy
  @accuracy
end

#baseline_model_win_rate ⇒ `Float`

Percentage of time the autorater decided the baseline model had the better response. Corresponds to the JSON property baselineModelWinRate

Returns:

(Float)



25553
25554
25555

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25553

def baseline_model_win_rate
  @baseline_model_win_rate
end

#cohens_kappa ⇒ `Float`

A measurement of agreement between the autorater and human raters that takes the likelihood of random agreement into account. Corresponds to the JSON property cohensKappa

Returns:

(Float)



25559
25560
25561

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25559

def cohens_kappa
  @cohens_kappa
end

#f1_score ⇒ `Float`

Harmonic mean of precision and recall. Corresponds to the JSON property f1Score

Returns:

(Float)



25564
25565
25566

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25564

def f1_score
  @f1_score
end

#false_negative_count ⇒ `Fixnum`

Number of examples where the autorater chose the baseline model, but humans preferred the model. Corresponds to the JSON property falseNegativeCount

Returns:

(Fixnum)



25570
25571
25572

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25570

def false_negative_count
  @false_negative_count
end

#false_positive_count ⇒ `Fixnum`

Number of examples where the autorater chose the model, but humans preferred the baseline model. Corresponds to the JSON property falsePositiveCount

Returns:

(Fixnum)



25576
25577
25578

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25576

def false_positive_count
  @false_positive_count
end

#human_preference_baseline_model_win_rate ⇒ `Float`

Percentage of time humans decided the baseline model had the better response. Corresponds to the JSON property humanPreferenceBaselineModelWinRate

Returns:

(Float)



25581
25582
25583

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25581

def human_preference_baseline_model_win_rate
  @human_preference_baseline_model_win_rate
end

#human_preference_model_win_rate ⇒ `Float`

Percentage of time humans decided the model had the better response. Corresponds to the JSON property humanPreferenceModelWinRate

Returns:

(Float)



25586
25587
25588

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25586

def human_preference_model_win_rate
  @human_preference_model_win_rate
end

#model_win_rate ⇒ `Float`

Percentage of time the autorater decided the model had the better response. Corresponds to the JSON property modelWinRate

Returns:

(Float)



25591
25592
25593

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25591

def model_win_rate
  @model_win_rate
end

#precision ⇒ `Float`

Fraction of cases where the autorater and humans thought the model had a better response out of all cases where the autorater thought the model had a better response. True positive divided by all positive. Corresponds to the JSON property precision

Returns:

(Float)



25598
25599
25600

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25598

def precision
  @precision
end

#recall ⇒ `Float`

Fraction of cases where the autorater and humans thought the model had a better response out of all cases where the humans thought the model had a better response. Corresponds to the JSON property recall

Returns:

(Float)



25605
25606
25607

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25605

def recall
  @recall
end

#true_negative_count ⇒ `Fixnum`

Number of examples where both the autorater and humans decided that the model had the worse response. Corresponds to the JSON property trueNegativeCount

Returns:

(Fixnum)



25611
25612
25613

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25611

def true_negative_count
  @true_negative_count
end

#true_positive_count ⇒ `Fixnum`

Number of examples where both the autorater and humans decided that the model had the better response. Corresponds to the JSON property truePositiveCount

Returns:

(Fixnum)



25617
25618
25619

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25617

def true_positive_count
  @true_positive_count
end

Instance Method Details

#update!(**args) ⇒ `Object`

Update properties of this object

# File 'lib/google/apis/aiplatform_v1/classes.rb', line 25624

def update!(**args)
  @accuracy = args[:accuracy] if args.key?(:accuracy)
  @baseline_model_win_rate = args[:baseline_model_win_rate] if args.key?(:baseline_model_win_rate)
  @cohens_kappa = args[:cohens_kappa] if args.key?(:cohens_kappa)
  @f1_score = args[:f1_score] if args.key?(:f1_score)
  @false_negative_count = args[:false_negative_count] if args.key?(:false_negative_count)
  @false_positive_count = args[:false_positive_count] if args.key?(:false_positive_count)
  @human_preference_baseline_model_win_rate = args[:human_preference_baseline_model_win_rate] if args.key?(:human_preference_baseline_model_win_rate)
  @human_preference_model_win_rate = args[:human_preference_model_win_rate] if args.key?(:human_preference_model_win_rate)
  @model_win_rate = args[:model_win_rate] if args.key?(:model_win_rate)
  @precision = args[:precision] if args.key?(:precision)
  @recall = args[:recall] if args.key?(:recall)
  @true_negative_count = args[:true_negative_count] if args.key?(:true_negative_count)
  @true_positive_count = args[:true_positive_count] if args.key?(:true_positive_count)
end

Class: Google::Apis::AiplatformV1::GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics

Overview

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(**args) ⇒ GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics

Instance Attribute Details

#accuracy ⇒ Float

#baseline_model_win_rate ⇒ Float

#cohens_kappa ⇒ Float

#f1_score ⇒ Float

#false_negative_count ⇒ Fixnum

#false_positive_count ⇒ Fixnum

#human_preference_baseline_model_win_rate ⇒ Float

#human_preference_model_win_rate ⇒ Float

#model_win_rate ⇒ Float

#precision ⇒ Float

#recall ⇒ Float

#true_negative_count ⇒ Fixnum

#true_positive_count ⇒ Fixnum

Instance Method Details

#update!(**args) ⇒ Object

#initialize(**args) ⇒ `GoogleCloudAiplatformV1SchemaModelevaluationMetricsPairwiseTextGenerationEvaluationMetrics`

#accuracy ⇒ `Float`

#baseline_model_win_rate ⇒ `Float`

#cohens_kappa ⇒ `Float`

#f1_score ⇒ `Float`

#false_negative_count ⇒ `Fixnum`

#false_positive_count ⇒ `Fixnum`

#human_preference_baseline_model_win_rate ⇒ `Float`

#human_preference_model_win_rate ⇒ `Float`

#model_win_rate ⇒ `Float`

#precision ⇒ `Float`

#recall ⇒ `Float`

#true_negative_count ⇒ `Fixnum`

#true_positive_count ⇒ `Fixnum`

#update!(**args) ⇒ `Object`