Class: Google::Apis::DataprocV1::PySparkJob

Inherits:
Object
  • Object
show all
Includes:
Core::Hashable, Core::JsonObjectSupport
Defined in:
lib/google/apis/dataproc_v1/classes.rb,
lib/google/apis/dataproc_v1/representations.rb,
lib/google/apis/dataproc_v1/representations.rb

Overview

A Dataproc job for running Apache PySpark (https://spark.apache.org/docs/0.9.0/ python-programming-guide.html) applications on YARN.

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(**args) ⇒ PySparkJob

Returns a new instance of PySparkJob.



4511
4512
4513
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4511

def initialize(**args)
   update!(**args)
end

Instance Attribute Details

#archive_urisArray<String>

Optional. HCFS URIs of archives to be extracted into the working directory of each executor. Supported file types: .jar, .tar, .tar.gz, .tgz, and .zip. Corresponds to the JSON property archiveUris

Returns:

  • (Array<String>)


4465
4466
4467
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4465

def archive_uris
  @archive_uris
end

#argsArray<String>

Optional. The arguments to pass to the driver. Do not include arguments, such as --conf, that can be set as job properties, since a collision may occur that causes an incorrect job submission. Corresponds to the JSON property args

Returns:

  • (Array<String>)


4472
4473
4474
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4472

def args
  @args
end

#file_urisArray<String>

Optional. HCFS URIs of files to be placed in the working directory of each executor. Useful for naively parallel tasks. Corresponds to the JSON property fileUris

Returns:

  • (Array<String>)


4478
4479
4480
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4478

def file_uris
  @file_uris
end

#jar_file_urisArray<String>

Optional. HCFS URIs of jar files to add to the CLASSPATHs of the Python driver and tasks. Corresponds to the JSON property jarFileUris

Returns:

  • (Array<String>)


4484
4485
4486
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4484

def jar_file_uris
  @jar_file_uris
end

#logging_configGoogle::Apis::DataprocV1::LoggingConfig

The runtime logging config of the job. Corresponds to the JSON property loggingConfig



4489
4490
4491
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4489

def logging_config
  @logging_config
end

#main_python_file_uriString

Required. The HCFS URI of the main Python file to use as the driver. Must be a .py file. Corresponds to the JSON property mainPythonFileUri

Returns:

  • (String)


4495
4496
4497
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4495

def main_python_file_uri
  @main_python_file_uri
end

#propertiesHash<String,String>

Optional. A mapping of property names to values, used to configure PySpark. Properties that conflict with values set by the Dataproc API might be overwritten. Can include properties set in /etc/spark/conf/spark-defaults.conf and classes in user code. Corresponds to the JSON property properties

Returns:

  • (Hash<String,String>)


4503
4504
4505
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4503

def properties
  @properties
end

#python_file_urisArray<String>

Optional. HCFS file URIs of Python files to pass to the PySpark framework. Supported file types: .py, .egg, and .zip. Corresponds to the JSON property pythonFileUris

Returns:

  • (Array<String>)


4509
4510
4511
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4509

def python_file_uris
  @python_file_uris
end

Instance Method Details

#update!(**args) ⇒ Object

Update properties of this object



4516
4517
4518
4519
4520
4521
4522
4523
4524
4525
# File 'lib/google/apis/dataproc_v1/classes.rb', line 4516

def update!(**args)
  @archive_uris = args[:archive_uris] if args.key?(:archive_uris)
  @args = args[:args] if args.key?(:args)
  @file_uris = args[:file_uris] if args.key?(:file_uris)
  @jar_file_uris = args[:jar_file_uris] if args.key?(:jar_file_uris)
  @logging_config = args[:logging_config] if args.key?(:logging_config)
  @main_python_file_uri = args[:main_python_file_uri] if args.key?(:main_python_file_uri)
  @properties = args[:properties] if args.key?(:properties)
  @python_file_uris = args[:python_file_uris] if args.key?(:python_file_uris)
end