Show / Hide Table of Contents

Class WordInfo

Word-specific information for recognized words.

Inheritance
object
WordInfo
Implements
IDirectResponseSchema
Inherited Members
object.Equals(object)
object.Equals(object, object)
object.GetHashCode()
object.GetType()
object.MemberwiseClone()
object.ReferenceEquals(object, object)
object.ToString()
Namespace: Google.Apis.Speech.v1p1beta1.Data
Assembly: Google.Apis.Speech.v1p1beta1.dll
Syntax
public class WordInfo : IDirectResponseSchema

Properties

Confidence

The confidence estimate between 0.0 and 1.0. A higher number indicates an estimated greater likelihood that the recognized words are correct. This field is set only for the top alternative of a non-streaming result or, of a streaming result where is_final=true. This field is not guaranteed to be accurate and users should not rely on it to be always provided. The default of 0.0 is a sentinel value indicating confidence was not set.

Declaration
[JsonProperty("confidence")]
public virtual float? Confidence { get; set; }
Property Value
Type Description
float?

ETag

The ETag of the item.

Declaration
public virtual string ETag { get; set; }
Property Value
Type Description
string

EndTime

Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

Declaration
[JsonProperty("endTime")]
public virtual object EndTime { get; set; }
Property Value
Type Description
object

SpeakerLabel

Output only. A label value assigned for every unique speaker within the audio. This field specifies which speaker was detected to have spoken this word. For some models, like medical_conversation this can be actual speaker role, for example "patient" or "provider", but generally this would be a number identifying a speaker. This field is only set if enable_speaker_diarization = 'true' and only for the top alternative.

Declaration
[JsonProperty("speakerLabel")]
public virtual string SpeakerLabel { get; set; }
Property Value
Type Description
string

SpeakerTag

Output only. A distinct integer value is assigned for every speaker within the audio. This field specifies which one of those speakers was detected to have spoken this word. Value ranges from '1' to diarization_speaker_count. speaker_tag is set if enable_speaker_diarization = 'true' and only for the top alternative. Note: Use speaker_label instead.

Declaration
[JsonProperty("speakerTag")]
public virtual int? SpeakerTag { get; set; }
Property Value
Type Description
int?

StartTime

Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word. This field is only set if enable_word_time_offsets=true and only in the top hypothesis. This is an experimental feature and the accuracy of the time offset can vary.

Declaration
[JsonProperty("startTime")]
public virtual object StartTime { get; set; }
Property Value
Type Description
object

Word

The word corresponding to this set of information.

Declaration
[JsonProperty("word")]
public virtual string Word { get; set; }
Property Value
Type Description
string

Implements

IDirectResponseSchema
In this article
Back to top Generated by DocFX