subcategory
Compute

databricks_node_type Data Source

-> Note If you have a fully automated setup with workspaces created by databricks_mws_workspaces or azurerm_databricks_workspace, please make sure to add depends_on attribute in order to prevent default auth: cannot configure default credentials errors.

Gets the smallest node type for databricks_cluster that fits search criteria, like amount of RAM or number of cores. AWS or Azure. Internally data source fetches node types available per cloud, similar to executing databricks clusters list-node-types, and filters it to return the smallest possible node with criteria.

-> Note This is experimental functionality, which aims to simplify things. In case of wrong parameters given (e.g. min_gpus = 876) or no nodes matching, data source will return cloud-default node type, even though it doesn't match search criteria specified by data source arguments: i3.xlarge for AWS or Standard_D3_v2 for Azure.

Example Usage

data "databricks_node_type" "with_gpu" {
  local_disk  = true
  min_cores   = 16
  gb_per_core = 1
  min_gpus    = 1
}

data "databricks_spark_version" "gpu_ml" {
  gpu = true
  ml  = true
}

resource "databricks_cluster" "research" {
  cluster_name            = "Research Cluster"
  spark_version           = data.databricks_spark_version.gpu_ml.id
  node_type_id            = data.databricks_node_type.with_gpu.id
  autotermination_minutes = 20
  autoscale {
    min_workers = 1
    max_workers = 50
  }
}

Argument Reference

Data source allows you to pick groups by the following attributes

min_memory_gb - (Optional) Minimum amount of memory per node in gigabytes. Defaults to 0.
gb_per_core - (Optional) Number of gigabytes per core available on instance. Conflicts with min_memory_gb. Defaults to 0.
min_cores - (Optional) Minimum number of CPU cores available on instance. Defaults to 0.
min_gpus - (Optional) Minimum number of GPU's attached to instance. Defaults to 0.
local_disk - (Optional) Pick only nodes with local storage. Defaults to false.
local_disk_min_size - (Optional) Pick only nodes that have size local storage greater or equal to given value. Defaults to 0.
category - (Optional, case insensitive string) Node category, which can be one of (depending on the cloud environment, could be checked with databricks clusters list-node-types|jq '.node_types[]|.category'|sort |uniq):
- General Purpose (all clouds)
- General Purpose (HDD) (Azure)
- Compute Optimized (all clouds)
- Memory Optimized (all clouds)
- Memory Optimized (Remote HDD) (Azure)
- Storage Optimized (AWS, Azure)
- GPU Accelerated (AWS, Azure)
photon_worker_capable - (Optional) Pick only nodes that can run Photon workers. Defaults to false.
photon_driver_capable - (Optional) Pick only nodes that can run Photon driver. Defaults to false.
graviton - (boolean, optional) if we should limit the search only to nodes with AWS Graviton CPUs. Default to false.
is_io_cache_enabled - (Optional) . Pick only nodes that have IO Cache. Defaults to false.
support_port_forwarding - (Optional) Pick only nodes that support port forwarding. Defaults to false.

Attribute Reference

Data source exposes the following attributes:

id - node type, that can be used for databricks_job, databricks_cluster, or databricks_instance_pool.

Related Resources

The following resources are used in the same context:

End to end workspace management guide.
databricks_cluster to create Databricks Clusters.
databricks_cluster_policy to create a databricks_cluster policy, which limits the ability to create clusters based on a set of rules.
databricks_instance_pool to manage instance pools to reduce cluster start and auto-scaling times by maintaining a set of idle, ready-to-use instances.
databricks_job to manage Databricks Jobs to run non-interactive code in a databricks_cluster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

node_type.md

node_type.md

databricks_node_type Data Source

Example Usage

Argument Reference

Attribute Reference

Related Resources

Files

node_type.md

Latest commit

History

node_type.md

File metadata and controls

databricks_node_type Data Source

Example Usage

Argument Reference

Attribute Reference

Related Resources