DocumentService¶
- class google.cloud.documentai_v1beta3.services.document_service.DocumentServiceAsyncClient(*, credentials: typing.Optional[google.auth.credentials.Credentials] = None, transport: typing.Optional[typing.Union[str, google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport, typing.Callable[[...], google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport]]] = 'grpc_asyncio', client_options: typing.Optional[google.api_core.client_options.ClientOptions] = None, client_info: google.api_core.gapic_v1.client_info.ClientInfo = <google.api_core.gapic_v1.client_info.ClientInfo object>)[source]¶
Service to call Cloud DocumentAI to manage document collection (dataset).
Instantiates the document service async client.
- Parameters
credentials (Optional[google.auth.credentials.Credentials]) – The authorization credentials to attach to requests. These credentials identify the application to the service; if none are specified, the client will attempt to ascertain the credentials from the environment.
transport (Optional[Union[str,DocumentServiceTransport,Callable[..., DocumentServiceTransport]]]) – The transport to use, or a Callable that constructs and returns a new transport to use. If a Callable is given, it will be called with the same set of initialization arguments as used in the DocumentServiceTransport constructor. If set to None, a transport is chosen automatically.
client_options (Optional[Union[google.api_core.client_options.ClientOptions, dict]]) –
Custom options for the client.
1. The
api_endpoint
property can be used to override the default endpoint provided by the client whentransport
is not explicitly provided. Only if this property is not set andtransport
was not explicitly provided, the endpoint is determined by the GOOGLE_API_USE_MTLS_ENDPOINT environment variable, which have one of the following values: “always” (always use the default mTLS endpoint), “never” (always use the default regular endpoint) and “auto” (auto-switch to the default mTLS endpoint if client certificate is present; this is the default value).2. If the GOOGLE_API_USE_CLIENT_CERTIFICATE environment variable is “true”, then the
client_cert_source
property can be used to provide a client certificate for mTLS transport. If not provided, the default SSL client certificate will be used if present. If GOOGLE_API_USE_CLIENT_CERTIFICATE is “false” or not set, no client certificate will be used.3. The
universe_domain
property can be used to override the default “googleapis.com” universe. Note thatapi_endpoint
property still takes precedence; anduniverse_domain
is currently not supported for mTLS.client_info (google.api_core.gapic_v1.client_info.ClientInfo) – The client info used to send a user-agent string along with API requests. If
None
, then default info will be used. Generally, you only need to set this if you’re developing your own client library.
- Raises
google.auth.exceptions.MutualTlsChannelError – If mutual TLS transport creation failed for any reason.
- property api_endpoint¶
Return the API endpoint used by the client instance.
- Returns
The API endpoint used by the client instance.
- Return type
- async batch_delete_documents(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.BatchDeleteDocumentsRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.api_core.operation_async.AsyncOperation [source]¶
Deletes a set of documents.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_batch_delete_documents(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) dataset_documents = documentai_v1beta3.BatchDatasetDocuments() dataset_documents.individual_document_ids.document_ids.gcs_managed_doc_id.gcs_uri = "gcs_uri_value" request = documentai_v1beta3.BatchDeleteDocumentsRequest( dataset="dataset_value", dataset_documents=dataset_documents, ) # Make the request operation = client.batch_delete_documents(request=request) print("Waiting for operation to complete...") response = (await operation).result() # Handle the response print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.BatchDeleteDocumentsRequest, dict]]) – The request object.
dataset (
str
) –Required. The dataset resource name. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An object representing a long-running operation.
The result type for the operation will be
google.cloud.documentai_v1beta3.types.BatchDeleteDocumentsResponse
Response of the delete documents operation.- Return type
- async cancel_operation(request: Optional[google.longrunning.operations_pb2.CancelOperationRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) None [source]¶
Starts asynchronous cancellation on a long-running operation.
The server makes a best effort to cancel the operation, but success is not guaranteed. If the server doesn’t support this method, it returns google.rpc.Code.UNIMPLEMENTED.
- Parameters
request (
CancelOperationRequest
) – The request object. Request message for CancelOperation method.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
None
- static common_billing_account_path(billing_account: str) str ¶
Returns a fully-qualified billing_account string.
- static common_location_path(project: str, location: str) str ¶
Returns a fully-qualified location string.
- static common_organization_path(organization: str) str ¶
Returns a fully-qualified organization string.
- static dataset_path(project: str, location: str, processor: str) str ¶
Returns a fully-qualified dataset string.
- static dataset_schema_path(project: str, location: str, processor: str) str ¶
Returns a fully-qualified dataset_schema string.
- classmethod from_service_account_file(filename: str, *args, **kwargs)[source]¶
- Creates an instance of this client using the provided credentials
file.
- Parameters
filename (str) – The path to the service account private key json file.
args – Additional arguments to pass to the constructor.
kwargs – Additional arguments to pass to the constructor.
- Returns
The constructed client.
- Return type
- classmethod from_service_account_info(info: dict, *args, **kwargs)[source]¶
- Creates an instance of this client using the provided credentials
info.
- Parameters
info (dict) – The service account private key info.
args – Additional arguments to pass to the constructor.
kwargs – Additional arguments to pass to the constructor.
- Returns
The constructed client.
- Return type
- classmethod from_service_account_json(filename: str, *args, **kwargs)¶
- Creates an instance of this client using the provided credentials
file.
- Parameters
filename (str) – The path to the service account private key json file.
args – Additional arguments to pass to the constructor.
kwargs – Additional arguments to pass to the constructor.
- Returns
The constructed client.
- Return type
- async get_dataset_schema(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.GetDatasetSchemaRequest, dict]] = None, *, name: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.types.dataset.DatasetSchema [source]¶
Gets the
DatasetSchema
of aDataset
.# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_get_dataset_schema(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) request = documentai_v1beta3.GetDatasetSchemaRequest( name="name_value", ) # Make the request response = await client.get_dataset_schema(request=request) # Handle the response print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.GetDatasetSchemaRequest, dict]]) – The request object. Request for
GetDatasetSchema
.name (
str
) –Required. The dataset schema resource name. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset/datasetSchema
This corresponds to the
name
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Dataset Schema.
- Return type
- async get_document(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.GetDocumentRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.types.document_service.GetDocumentResponse [source]¶
Returns relevant fields present in the requested document.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_get_document(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) document_id = documentai_v1beta3.DocumentId() document_id.gcs_managed_doc_id.gcs_uri = "gcs_uri_value" request = documentai_v1beta3.GetDocumentRequest( dataset="dataset_value", document_id=document_id, ) # Make the request response = await client.get_document(request=request) # Handle the response print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.GetDocumentRequest, dict]]) – The request object.
dataset (
str
) –Required. The resource name of the dataset that the document belongs to . Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Return type
- async get_location(request: Optional[google.cloud.location.locations_pb2.GetLocationRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.location.locations_pb2.Location [source]¶
Gets information about a location.
- Parameters
request (
GetLocationRequest
) – The request object. Request message for GetLocation method.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Location object.
- Return type
Location
- classmethod get_mtls_endpoint_and_cert_source(client_options: Optional[google.api_core.client_options.ClientOptions] = None)[source]¶
Return the API endpoint and client cert source for mutual TLS.
The client cert source is determined in the following order: (1) if GOOGLE_API_USE_CLIENT_CERTIFICATE environment variable is not “true”, the client cert source is None. (2) if client_options.client_cert_source is provided, use the provided one; if the default client cert source exists, use the default one; otherwise the client cert source is None.
The API endpoint is determined in the following order: (1) if client_options.api_endpoint if provided, use the provided one. (2) if GOOGLE_API_USE_CLIENT_CERTIFICATE environment variable is “always”, use the default mTLS endpoint; if the environment variable is “never”, use the default API endpoint; otherwise if client cert source exists, use the default mTLS endpoint, otherwise use the default API endpoint.
More details can be found at https://google.aip.dev/auth/4114.
- Parameters
client_options (google.api_core.client_options.ClientOptions) – Custom options for the client. Only the api_endpoint and client_cert_source properties may be used in this method.
- Returns
- returns the API endpoint and the
client cert source to use.
- Return type
- Raises
google.auth.exceptions.MutualTLSChannelError – If any errors happen.
- async get_operation(request: Optional[google.longrunning.operations_pb2.GetOperationRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.longrunning.operations_pb2.Operation [source]¶
Gets the latest state of a long-running operation.
- Parameters
request (
GetOperationRequest
) – The request object. Request message for GetOperation method.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An
Operation
object.- Return type
Operation
- classmethod get_transport_class(label: Optional[str] = None) Type[google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport] ¶
Returns an appropriate transport class.
- Parameters
label – The name of the desired transport. If none is provided, then the first transport in the registry is used.
- Returns
The transport class to use.
- async import_documents(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.ImportDocumentsRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.api_core.operation_async.AsyncOperation [source]¶
Import documents into a dataset.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_import_documents(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) batch_documents_import_configs = documentai_v1beta3.BatchDocumentsImportConfig() batch_documents_import_configs.dataset_split = "DATASET_SPLIT_UNASSIGNED" request = documentai_v1beta3.ImportDocumentsRequest( dataset="dataset_value", batch_documents_import_configs=batch_documents_import_configs, ) # Make the request operation = client.import_documents(request=request) print("Waiting for operation to complete...") response = (await operation).result() # Handle the response print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.ImportDocumentsRequest, dict]]) – The request object.
dataset (
str
) –Required. The dataset resource name. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An object representing a long-running operation.
The result type for the operation will be
google.cloud.documentai_v1beta3.types.ImportDocumentsResponse
Response of the import document operation.- Return type
- async list_documents(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.ListDocumentsRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.services.document_service.pagers.ListDocumentsAsyncPager [source]¶
Returns a list of documents present in the dataset.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_list_documents(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) request = documentai_v1beta3.ListDocumentsRequest( dataset="dataset_value", ) # Make the request page_result = client.list_documents(request=request) # Handle the response async for response in page_result: print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.ListDocumentsRequest, dict]]) – The request object.
dataset (
str
) –Required. The resource name of the dataset to be listed. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Iterating over this object will yield results and resolve additional pages automatically.
- Return type
google.cloud.documentai_v1beta3.services.document_service.pagers.ListDocumentsAsyncPager
- async list_locations(request: Optional[google.cloud.location.locations_pb2.ListLocationsRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.location.locations_pb2.ListLocationsResponse [source]¶
Lists information about the supported locations for this service.
- Parameters
request (
ListLocationsRequest
) – The request object. Request message for ListLocations method.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Response message for
ListLocations
method.- Return type
ListLocationsResponse
- async list_operations(request: Optional[google.longrunning.operations_pb2.ListOperationsRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.longrunning.operations_pb2.ListOperationsResponse [source]¶
Lists operations that match the specified filter in the request.
- Parameters
request (
ListOperationsRequest
) – The request object. Request message for ListOperations method.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Response message for
ListOperations
method.- Return type
ListOperationsResponse
- static parse_common_billing_account_path(path: str) Dict[str, str] ¶
Parse a billing_account path into its component segments.
- static parse_common_folder_path(path: str) Dict[str, str] ¶
Parse a folder path into its component segments.
- static parse_common_location_path(path: str) Dict[str, str] ¶
Parse a location path into its component segments.
- static parse_common_organization_path(path: str) Dict[str, str] ¶
Parse a organization path into its component segments.
- static parse_common_project_path(path: str) Dict[str, str] ¶
Parse a project path into its component segments.
- static parse_dataset_path(path: str) Dict[str, str] ¶
Parses a dataset path into its component segments.
- static parse_dataset_schema_path(path: str) Dict[str, str] ¶
Parses a dataset_schema path into its component segments.
- static parse_schema_path(path: str) Dict[str, str] ¶
Parses a schema path into its component segments.
- static schema_path(project: str, location: str, schema: str) str ¶
Returns a fully-qualified schema string.
- property transport: google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport¶
Returns the transport used by the client instance.
- Returns
The transport used by the client instance.
- Return type
DocumentServiceTransport
- property universe_domain: str¶
Return the universe domain used by the client instance.
- Returns
- The universe domain used
by the client instance.
- Return type
- async update_dataset(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.UpdateDatasetRequest, dict]] = None, *, dataset: Optional[google.cloud.documentai_v1beta3.types.dataset.Dataset] = None, update_mask: Optional[google.protobuf.field_mask_pb2.FieldMask] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.api_core.operation_async.AsyncOperation [source]¶
Updates metadata associated with a dataset. Note that this method requires the
documentai.googleapis.com/datasets.update
permission on the project, which is highly privileged. A user or service account with this permission can create new processors that can interact with any gcs bucket in your project.# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_update_dataset(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) dataset = documentai_v1beta3.Dataset() dataset.state = "INITIALIZED" request = documentai_v1beta3.UpdateDatasetRequest( dataset=dataset, ) # Make the request operation = client.update_dataset(request=request) print("Waiting for operation to complete...") response = (await operation).result() # Handle the response print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.UpdateDatasetRequest, dict]]) – The request object.
dataset (
google.cloud.documentai_v1beta3.types.Dataset
) –Required. The
name
field of theDataset
is used to identify the resource to be updated.This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.update_mask (
google.protobuf.field_mask_pb2.FieldMask
) –The update mask applies to the resource.
This corresponds to the
update_mask
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An object representing a long-running operation.
- The result type for the operation will be
google.cloud.documentai_v1beta3.types.Dataset
A singleton resource under a [Processor][google.cloud.documentai.v1beta3.Processor] which configures a collection of documents.
- The result type for the operation will be
- Return type
- async update_dataset_schema(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.UpdateDatasetSchemaRequest, dict]] = None, *, dataset_schema: Optional[google.cloud.documentai_v1beta3.types.dataset.DatasetSchema] = None, update_mask: Optional[google.protobuf.field_mask_pb2.FieldMask] = None, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.types.dataset.DatasetSchema [source]¶
Updates a
DatasetSchema
.# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 async def sample_update_dataset_schema(): # Create a client client = documentai_v1beta3.DocumentServiceAsyncClient() # Initialize request argument(s) request = documentai_v1beta3.UpdateDatasetSchemaRequest( ) # Make the request response = await client.update_dataset_schema(request=request) # Handle the response print(response)
- Parameters
request (Optional[Union[google.cloud.documentai_v1beta3.types.UpdateDatasetSchemaRequest, dict]]) – The request object. Request for
UpdateDatasetSchema
.dataset_schema (
google.cloud.documentai_v1beta3.types.DatasetSchema
) –Required. The name field of the
DatasetSchema
is used to identify the resource to be updated.This corresponds to the
dataset_schema
field on therequest
instance; ifrequest
is provided, this should not be set.update_mask (
google.protobuf.field_mask_pb2.FieldMask
) –The update mask applies to the resource.
This corresponds to the
update_mask
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry_async.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Dataset Schema.
- Return type
- class google.cloud.documentai_v1beta3.services.document_service.DocumentServiceClient(*, credentials: typing.Optional[google.auth.credentials.Credentials] = None, transport: typing.Optional[typing.Union[str, google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport, typing.Callable[[...], google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport]]] = None, client_options: typing.Optional[typing.Union[google.api_core.client_options.ClientOptions, dict]] = None, client_info: google.api_core.gapic_v1.client_info.ClientInfo = <google.api_core.gapic_v1.client_info.ClientInfo object>)[source]¶
Service to call Cloud DocumentAI to manage document collection (dataset).
Instantiates the document service client.
- Parameters
credentials (Optional[google.auth.credentials.Credentials]) – The authorization credentials to attach to requests. These credentials identify the application to the service; if none are specified, the client will attempt to ascertain the credentials from the environment.
transport (Optional[Union[str,DocumentServiceTransport,Callable[..., DocumentServiceTransport]]]) – The transport to use, or a Callable that constructs and returns a new transport. If a Callable is given, it will be called with the same set of initialization arguments as used in the DocumentServiceTransport constructor. If set to None, a transport is chosen automatically.
client_options (Optional[Union[google.api_core.client_options.ClientOptions, dict]]) –
Custom options for the client.
1. The
api_endpoint
property can be used to override the default endpoint provided by the client whentransport
is not explicitly provided. Only if this property is not set andtransport
was not explicitly provided, the endpoint is determined by the GOOGLE_API_USE_MTLS_ENDPOINT environment variable, which have one of the following values: “always” (always use the default mTLS endpoint), “never” (always use the default regular endpoint) and “auto” (auto-switch to the default mTLS endpoint if client certificate is present; this is the default value).2. If the GOOGLE_API_USE_CLIENT_CERTIFICATE environment variable is “true”, then the
client_cert_source
property can be used to provide a client certificate for mTLS transport. If not provided, the default SSL client certificate will be used if present. If GOOGLE_API_USE_CLIENT_CERTIFICATE is “false” or not set, no client certificate will be used.3. The
universe_domain
property can be used to override the default “googleapis.com” universe. Note that theapi_endpoint
property still takes precedence; anduniverse_domain
is currently not supported for mTLS.client_info (google.api_core.gapic_v1.client_info.ClientInfo) – The client info used to send a user-agent string along with API requests. If
None
, then default info will be used. Generally, you only need to set this if you’re developing your own client library.
- Raises
google.auth.exceptions.MutualTLSChannelError – If mutual TLS transport creation failed for any reason.
- __exit__(type, value, traceback)[source]¶
Releases underlying transport’s resources.
Warning
ONLY use as a context manager if the transport is NOT shared with other clients! Exiting the with block will CLOSE the transport and may cause errors in other clients!
- property api_endpoint¶
Return the API endpoint used by the client instance.
- Returns
The API endpoint used by the client instance.
- Return type
- batch_delete_documents(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.BatchDeleteDocumentsRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.api_core.operation.Operation [source]¶
Deletes a set of documents.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_batch_delete_documents(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) dataset_documents = documentai_v1beta3.BatchDatasetDocuments() dataset_documents.individual_document_ids.document_ids.gcs_managed_doc_id.gcs_uri = "gcs_uri_value" request = documentai_v1beta3.BatchDeleteDocumentsRequest( dataset="dataset_value", dataset_documents=dataset_documents, ) # Make the request operation = client.batch_delete_documents(request=request) print("Waiting for operation to complete...") response = operation.result() # Handle the response print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.BatchDeleteDocumentsRequest, dict]) – The request object.
dataset (str) –
Required. The dataset resource name. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An object representing a long-running operation.
The result type for the operation will be
google.cloud.documentai_v1beta3.types.BatchDeleteDocumentsResponse
Response of the delete documents operation.- Return type
- cancel_operation(request: Optional[google.longrunning.operations_pb2.CancelOperationRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) None [source]¶
Starts asynchronous cancellation on a long-running operation.
The server makes a best effort to cancel the operation, but success is not guaranteed. If the server doesn’t support this method, it returns google.rpc.Code.UNIMPLEMENTED.
- Parameters
request (
CancelOperationRequest
) – The request object. Request message for CancelOperation method.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
None
- static common_billing_account_path(billing_account: str) str [source]¶
Returns a fully-qualified billing_account string.
- static common_location_path(project: str, location: str) str [source]¶
Returns a fully-qualified location string.
- static common_organization_path(organization: str) str [source]¶
Returns a fully-qualified organization string.
- static dataset_path(project: str, location: str, processor: str) str [source]¶
Returns a fully-qualified dataset string.
- static dataset_schema_path(project: str, location: str, processor: str) str [source]¶
Returns a fully-qualified dataset_schema string.
- classmethod from_service_account_file(filename: str, *args, **kwargs)[source]¶
- Creates an instance of this client using the provided credentials
file.
- Parameters
filename (str) – The path to the service account private key json file.
args – Additional arguments to pass to the constructor.
kwargs – Additional arguments to pass to the constructor.
- Returns
The constructed client.
- Return type
- classmethod from_service_account_info(info: dict, *args, **kwargs)[source]¶
- Creates an instance of this client using the provided credentials
info.
- Parameters
info (dict) – The service account private key info.
args – Additional arguments to pass to the constructor.
kwargs – Additional arguments to pass to the constructor.
- Returns
The constructed client.
- Return type
- classmethod from_service_account_json(filename: str, *args, **kwargs)¶
- Creates an instance of this client using the provided credentials
file.
- Parameters
filename (str) – The path to the service account private key json file.
args – Additional arguments to pass to the constructor.
kwargs – Additional arguments to pass to the constructor.
- Returns
The constructed client.
- Return type
- get_dataset_schema(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.GetDatasetSchemaRequest, dict]] = None, *, name: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.types.dataset.DatasetSchema [source]¶
Gets the
DatasetSchema
of aDataset
.# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_get_dataset_schema(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) request = documentai_v1beta3.GetDatasetSchemaRequest( name="name_value", ) # Make the request response = client.get_dataset_schema(request=request) # Handle the response print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.GetDatasetSchemaRequest, dict]) – The request object. Request for
GetDatasetSchema
.name (str) –
Required. The dataset schema resource name. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset/datasetSchema
This corresponds to the
name
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Dataset Schema.
- Return type
- get_document(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.GetDocumentRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.types.document_service.GetDocumentResponse [source]¶
Returns relevant fields present in the requested document.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_get_document(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) document_id = documentai_v1beta3.DocumentId() document_id.gcs_managed_doc_id.gcs_uri = "gcs_uri_value" request = documentai_v1beta3.GetDocumentRequest( dataset="dataset_value", document_id=document_id, ) # Make the request response = client.get_document(request=request) # Handle the response print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.GetDocumentRequest, dict]) – The request object.
dataset (str) –
Required. The resource name of the dataset that the document belongs to . Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Return type
- get_location(request: Optional[google.cloud.location.locations_pb2.GetLocationRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.location.locations_pb2.Location [source]¶
Gets information about a location.
- Parameters
request (
GetLocationRequest
) – The request object. Request message for GetLocation method.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Location object.
- Return type
Location
- classmethod get_mtls_endpoint_and_cert_source(client_options: Optional[google.api_core.client_options.ClientOptions] = None)[source]¶
Deprecated. Return the API endpoint and client cert source for mutual TLS.
The client cert source is determined in the following order: (1) if GOOGLE_API_USE_CLIENT_CERTIFICATE environment variable is not “true”, the client cert source is None. (2) if client_options.client_cert_source is provided, use the provided one; if the default client cert source exists, use the default one; otherwise the client cert source is None.
The API endpoint is determined in the following order: (1) if client_options.api_endpoint if provided, use the provided one. (2) if GOOGLE_API_USE_CLIENT_CERTIFICATE environment variable is “always”, use the default mTLS endpoint; if the environment variable is “never”, use the default API endpoint; otherwise if client cert source exists, use the default mTLS endpoint, otherwise use the default API endpoint.
More details can be found at https://google.aip.dev/auth/4114.
- Parameters
client_options (google.api_core.client_options.ClientOptions) – Custom options for the client. Only the api_endpoint and client_cert_source properties may be used in this method.
- Returns
- returns the API endpoint and the
client cert source to use.
- Return type
- Raises
google.auth.exceptions.MutualTLSChannelError – If any errors happen.
- get_operation(request: Optional[google.longrunning.operations_pb2.GetOperationRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.longrunning.operations_pb2.Operation [source]¶
Gets the latest state of a long-running operation.
- Parameters
request (
GetOperationRequest
) – The request object. Request message for GetOperation method.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An
Operation
object.- Return type
Operation
- import_documents(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.ImportDocumentsRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.api_core.operation.Operation [source]¶
Import documents into a dataset.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_import_documents(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) batch_documents_import_configs = documentai_v1beta3.BatchDocumentsImportConfig() batch_documents_import_configs.dataset_split = "DATASET_SPLIT_UNASSIGNED" request = documentai_v1beta3.ImportDocumentsRequest( dataset="dataset_value", batch_documents_import_configs=batch_documents_import_configs, ) # Make the request operation = client.import_documents(request=request) print("Waiting for operation to complete...") response = operation.result() # Handle the response print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.ImportDocumentsRequest, dict]) – The request object.
dataset (str) –
Required. The dataset resource name. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An object representing a long-running operation.
The result type for the operation will be
google.cloud.documentai_v1beta3.types.ImportDocumentsResponse
Response of the import document operation.- Return type
- list_documents(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.ListDocumentsRequest, dict]] = None, *, dataset: Optional[str] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.services.document_service.pagers.ListDocumentsPager [source]¶
Returns a list of documents present in the dataset.
# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_list_documents(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) request = documentai_v1beta3.ListDocumentsRequest( dataset="dataset_value", ) # Make the request page_result = client.list_documents(request=request) # Handle the response for response in page_result: print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.ListDocumentsRequest, dict]) – The request object.
dataset (str) –
Required. The resource name of the dataset to be listed. Format:
projects/{project}/locations/{location}/processors/{processor}/dataset
This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Iterating over this object will yield results and resolve additional pages automatically.
- Return type
google.cloud.documentai_v1beta3.services.document_service.pagers.ListDocumentsPager
- list_locations(request: Optional[google.cloud.location.locations_pb2.ListLocationsRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.location.locations_pb2.ListLocationsResponse [source]¶
Lists information about the supported locations for this service.
- Parameters
request (
ListLocationsRequest
) – The request object. Request message for ListLocations method.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Response message for
ListLocations
method.- Return type
ListLocationsResponse
- list_operations(request: Optional[google.longrunning.operations_pb2.ListOperationsRequest] = None, *, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.longrunning.operations_pb2.ListOperationsResponse [source]¶
Lists operations that match the specified filter in the request.
- Parameters
request (
ListOperationsRequest
) – The request object. Request message for ListOperations method.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Response message for
ListOperations
method.- Return type
ListOperationsResponse
- static parse_common_billing_account_path(path: str) Dict[str, str] [source]¶
Parse a billing_account path into its component segments.
- static parse_common_folder_path(path: str) Dict[str, str] [source]¶
Parse a folder path into its component segments.
- static parse_common_location_path(path: str) Dict[str, str] [source]¶
Parse a location path into its component segments.
- static parse_common_organization_path(path: str) Dict[str, str] [source]¶
Parse a organization path into its component segments.
- static parse_common_project_path(path: str) Dict[str, str] [source]¶
Parse a project path into its component segments.
- static parse_dataset_path(path: str) Dict[str, str] [source]¶
Parses a dataset path into its component segments.
- static parse_dataset_schema_path(path: str) Dict[str, str] [source]¶
Parses a dataset_schema path into its component segments.
- static parse_schema_path(path: str) Dict[str, str] [source]¶
Parses a schema path into its component segments.
- static schema_path(project: str, location: str, schema: str) str [source]¶
Returns a fully-qualified schema string.
- property transport: google.cloud.documentai_v1beta3.services.document_service.transports.base.DocumentServiceTransport¶
Returns the transport used by the client instance.
- Returns
- The transport used by the client
instance.
- Return type
DocumentServiceTransport
- property universe_domain: str¶
Return the universe domain used by the client instance.
- Returns
The universe domain used by the client instance.
- Return type
- update_dataset(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.UpdateDatasetRequest, dict]] = None, *, dataset: Optional[google.cloud.documentai_v1beta3.types.dataset.Dataset] = None, update_mask: Optional[google.protobuf.field_mask_pb2.FieldMask] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.api_core.operation.Operation [source]¶
Updates metadata associated with a dataset. Note that this method requires the
documentai.googleapis.com/datasets.update
permission on the project, which is highly privileged. A user or service account with this permission can create new processors that can interact with any gcs bucket in your project.# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_update_dataset(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) dataset = documentai_v1beta3.Dataset() dataset.state = "INITIALIZED" request = documentai_v1beta3.UpdateDatasetRequest( dataset=dataset, ) # Make the request operation = client.update_dataset(request=request) print("Waiting for operation to complete...") response = operation.result() # Handle the response print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.UpdateDatasetRequest, dict]) – The request object.
dataset (google.cloud.documentai_v1beta3.types.Dataset) –
Required. The
name
field of theDataset
is used to identify the resource to be updated.This corresponds to the
dataset
field on therequest
instance; ifrequest
is provided, this should not be set.update_mask (google.protobuf.field_mask_pb2.FieldMask) –
The update mask applies to the resource.
This corresponds to the
update_mask
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
An object representing a long-running operation.
- The result type for the operation will be
google.cloud.documentai_v1beta3.types.Dataset
A singleton resource under a [Processor][google.cloud.documentai.v1beta3.Processor] which configures a collection of documents.
- The result type for the operation will be
- Return type
- update_dataset_schema(request: Optional[Union[google.cloud.documentai_v1beta3.types.document_service.UpdateDatasetSchemaRequest, dict]] = None, *, dataset_schema: Optional[google.cloud.documentai_v1beta3.types.dataset.DatasetSchema] = None, update_mask: Optional[google.protobuf.field_mask_pb2.FieldMask] = None, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ()) google.cloud.documentai_v1beta3.types.dataset.DatasetSchema [source]¶
Updates a
DatasetSchema
.# This snippet has been automatically generated and should be regarded as a # code template only. # It will require modifications to work: # - It may require correct/in-range values for request initialization. # - It may require specifying regional endpoints when creating the service # client as shown in: # https://googleapis.dev/python/google-api-core/latest/client_options.html from google.cloud import documentai_v1beta3 def sample_update_dataset_schema(): # Create a client client = documentai_v1beta3.DocumentServiceClient() # Initialize request argument(s) request = documentai_v1beta3.UpdateDatasetSchemaRequest( ) # Make the request response = client.update_dataset_schema(request=request) # Handle the response print(response)
- Parameters
request (Union[google.cloud.documentai_v1beta3.types.UpdateDatasetSchemaRequest, dict]) – The request object. Request for
UpdateDatasetSchema
.dataset_schema (google.cloud.documentai_v1beta3.types.DatasetSchema) –
Required. The name field of the
DatasetSchema
is used to identify the resource to be updated.This corresponds to the
dataset_schema
field on therequest
instance; ifrequest
is provided, this should not be set.update_mask (google.protobuf.field_mask_pb2.FieldMask) –
The update mask applies to the resource.
This corresponds to the
update_mask
field on therequest
instance; ifrequest
is provided, this should not be set.retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- Returns
Dataset Schema.
- Return type
- class google.cloud.documentai_v1beta3.services.document_service.pagers.ListDocumentsAsyncPager(method: Callable[[...], Awaitable[google.cloud.documentai_v1beta3.types.document_service.ListDocumentsResponse]], request: google.cloud.documentai_v1beta3.types.document_service.ListDocumentsRequest, response: google.cloud.documentai_v1beta3.types.document_service.ListDocumentsResponse, *, retry: Optional[Union[google.api_core.retry.retry_unary_async.AsyncRetry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ())[source]¶
A pager for iterating through
list_documents
requests.This class thinly wraps an initial
google.cloud.documentai_v1beta3.types.ListDocumentsResponse
object, and provides an__aiter__
method to iterate through itsdocument_metadata
field.If there are more pages, the
__aiter__
method will make additionalListDocuments
requests and continue to iterate through thedocument_metadata
field on the corresponding responses.All the usual
google.cloud.documentai_v1beta3.types.ListDocumentsResponse
attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.Instantiates the pager.
- Parameters
method (Callable) – The method that was originally called, and which instantiated this pager.
request (google.cloud.documentai_v1beta3.types.ListDocumentsRequest) – The initial request object.
response (google.cloud.documentai_v1beta3.types.ListDocumentsResponse) – The initial response object.
retry (google.api_core.retry.AsyncRetry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.
- class google.cloud.documentai_v1beta3.services.document_service.pagers.ListDocumentsPager(method: Callable[[...], google.cloud.documentai_v1beta3.types.document_service.ListDocumentsResponse], request: google.cloud.documentai_v1beta3.types.document_service.ListDocumentsRequest, response: google.cloud.documentai_v1beta3.types.document_service.ListDocumentsResponse, *, retry: Optional[Union[google.api_core.retry.retry_unary.Retry, google.api_core.gapic_v1.method._MethodDefault]] = _MethodDefault._DEFAULT_VALUE, timeout: Union[float, object] = _MethodDefault._DEFAULT_VALUE, metadata: Sequence[Tuple[str, str]] = ())[source]¶
A pager for iterating through
list_documents
requests.This class thinly wraps an initial
google.cloud.documentai_v1beta3.types.ListDocumentsResponse
object, and provides an__iter__
method to iterate through itsdocument_metadata
field.If there are more pages, the
__iter__
method will make additionalListDocuments
requests and continue to iterate through thedocument_metadata
field on the corresponding responses.All the usual
google.cloud.documentai_v1beta3.types.ListDocumentsResponse
attributes are available on the pager. If multiple requests are made, only the most recent response is retained, and thus used for attribute lookup.Instantiate the pager.
- Parameters
method (Callable) – The method that was originally called, and which instantiated this pager.
request (google.cloud.documentai_v1beta3.types.ListDocumentsRequest) – The initial request object.
response (google.cloud.documentai_v1beta3.types.ListDocumentsResponse) – The initial response object.
retry (google.api_core.retry.Retry) – Designation of what errors, if any, should be retried.
timeout (float) – The timeout for this request.
metadata (Sequence[Tuple[str, str]]) – Strings which should be sent along with the request as metadata.