Skip to content

Latest commit

 

History

History
109 lines (64 loc) · 2.38 KB

File metadata and controls

109 lines (64 loc) · 2.38 KB
parent title nav_exclude
Infrastructure Components
AWS Glue-Crawler
false

AWS Glue-Crawler

source = "git::https://github.com/slalom-ggp/dataops-infra/tree/main/components/aws/glue-crawler?ref=main"

Overview

Glue is AWS's fully managed extract, transform, and load (ETL) service. A Glue crawler is used to access a data store and create table definitions. This can be used in conjuction with Amazon Athena to query flat files in S3 buckets using SQL.

Requirements

No requirements.

Providers

The following providers are used by this module:

  • aws

Required Inputs

The following input variables are required:

name_prefix

Description: Standard name_prefix module input. (Prefix counts towards 64-character max length for certain resource types.)

Type: string

environment

Description: Standard environment module input.

Type:

object({
    vpc_id          = string
    aws_region      = string
    public_subnets  = list(string)
    private_subnets = list(string)
  })

resource_tags

Description: Standard resource_tags module input.

Type: map(string)

glue_database_name

Description: Name of the Glue catalog database.

Type: string

glue_crawler_name

Description: Name of the Glue crawler.

Type: string

s3_target_bucket_name

Description: S3 target bucket for Glue crawler.

Type: string

target_path

Description: Path to crawler target file(s).

Type: string

Optional Inputs

No optional input.

Outputs

The following outputs are exported:

glue_crawler_name

Description: The name of the Glue crawler.


Source Files

Source code for this module is available using the links below.


NOTE: This documentation was auto-generated using terraform-docs and s-infra from slalom.dataops. Please do not attempt to manually update this file.