Class WordInfo
Word-specific information for recognized words.
Implements
Inherited Members
Namespace: Google.Apis.Speech.v1p1beta1.Data
Assembly: Google.Apis.Speech.v1p1beta1.dll
Syntax
public class WordInfo : IDirectResponseSchema
Properties
Confidence
The confidence estimate between 0.0 and 1.0. A higher number indicates an estimated greater likelihood that
the recognized words are correct. This field is set only for the top alternative of a non-streaming result
or, of a streaming result where is_final=true
. This field is not guaranteed to be accurate and users
should not rely on it to be always provided. The default of 0.0 is a sentinel value indicating confidence
was not set.
Declaration
[JsonProperty("confidence")]
public virtual float? Confidence { get; set; }
Property Value
Type | Description |
---|---|
float? |
ETag
The ETag of the item.
Declaration
public virtual string ETag { get; set; }
Property Value
Type | Description |
---|---|
string |
EndTime
Time offset relative to the beginning of the audio, and corresponding to the end of the spoken word. This
field is only set if enable_word_time_offsets=true
and only in the top hypothesis. This is an experimental
feature and the accuracy of the time offset can vary.
Declaration
[JsonProperty("endTime")]
public virtual object EndTime { get; set; }
Property Value
Type | Description |
---|---|
object |
SpeakerLabel
Output only. A label value assigned for every unique speaker within the audio. This field specifies which speaker was detected to have spoken this word. For some models, like medical_conversation this can be actual speaker role, for example "patient" or "provider", but generally this would be a number identifying a speaker. This field is only set if enable_speaker_diarization = 'true' and only for the top alternative.
Declaration
[JsonProperty("speakerLabel")]
public virtual string SpeakerLabel { get; set; }
Property Value
Type | Description |
---|---|
string |
SpeakerTag
Output only. A distinct integer value is assigned for every speaker within the audio. This field specifies which one of those speakers was detected to have spoken this word. Value ranges from '1' to diarization_speaker_count. speaker_tag is set if enable_speaker_diarization = 'true' and only for the top alternative. Note: Use speaker_label instead.
Declaration
[JsonProperty("speakerTag")]
public virtual int? SpeakerTag { get; set; }
Property Value
Type | Description |
---|---|
int? |
StartTime
Time offset relative to the beginning of the audio, and corresponding to the start of the spoken word. This
field is only set if enable_word_time_offsets=true
and only in the top hypothesis. This is an experimental
feature and the accuracy of the time offset can vary.
Declaration
[JsonProperty("startTime")]
public virtual object StartTime { get; set; }
Property Value
Type | Description |
---|---|
object |
Word
The word corresponding to this set of information.
Declaration
[JsonProperty("word")]
public virtual string Word { get; set; }
Property Value
Type | Description |
---|---|
string |