Introduction

Dagster is a system for building modern data applications.

Combining an elegant programming model and beautiful tools, Dagster allows infrastructure engineers, data engineers, and data scientists to seamlessly collaborate to process and produce the trusted, reliable data needed in today's world.

Install

To get started:

pip install dagster dagit

This installs two modules:

dagster | The core programming model and abstraction stack; stateless, single-node, single-process and multi-process execution engines; and a CLI tool for driving those engines.
dagit | A UI and rich development environment for Dagster, including a DAG browser, a type-aware config editor, and a streaming execution interface.

Learn

Next, jump right into our tutorial, or read our complete documentation. If you're actively using Dagster or have questions on getting started, we'd love to hear from you; come join our slack!

Contributing

For details on contributing or running the project for development, check out our contributing guide.

Integrations

Dagster works with the tools and systems that you're already using with your data, including:

Integration		Dagster Library
	Apache Airflow	dagster-airflow Allows Dagster pipelines to be scheduled and executed, either containerized or uncontainerized, as Apache Airflow DAGs.
	Apache Spark	dagster-spark · dagster-pyspark Libraries for interacting with Apache Spark and Pyspark.
	Dask	dagster-dask Provides a Dagster integration with Dask / Dask.Distributed.
	DataDog	dagster-datadog Provides a Dagster resource for publishing metrics to DataDog.
	Great Expectations	Expectations in Dagster The Great Expectations framework is designed to promote data quality checks for data warehouses. In Dagster, expectations are a first-class citizen.
/	Jupyter / Papermill	dagstermill Built on the papermill library, dagstermill is meant for integrating productionized Jupyter notebooks into dagster pipelines.
	PagerDuty	dagster-pagerduty A library for creating PagerDuty alerts from Dagster workflows.
	Snowflake	dagster-snowflake A library for interacting with the Snowflake Data Warehouse.
Cloud Providers
	AWS	dagster-aws A library for interacting with Amazon Web Services. Provides integrations with S3, EMR, and (coming soon!) Redshift.
	GCP	dagster-gcp A library for interacting with Google Cloud Platform. Provides integrations with BigQuery and Cloud Dataproc.

This list is growing as we are actively building more integrations, and we welcome contributions!

Example Projects

Several example projects are provided under the examples folder demonstrating how to use Dagster, including:

examples/airline-demo: A substantial demo project illustrating how these tools can be used together to manage a realistic data pipeline.
examples/event-pipeline-demo: An example illustrating a typical web event processing pipeline with S3, Scala Spark, and Snowflake.

Name		Name	Last commit message	Last commit date
Latest commit History 1,736 Commits
.buildkite		.buildkite
.circleci		.circleci
assets		assets
bin		bin
docs		docs
examples		examples
js_modules		js_modules
python_modules		python_modules
scala_modules		scala_modules
.arcconfig		.arcconfig
.arclint		.arclint
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pylintrc		.pylintrc
.read-the-docs-requirements.txt		.read-the-docs-requirements.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASING.md		RELEASING.md
azure-pipelines.yml		azure-pipelines.yml
pull_request_template.md		pull_request_template.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Install

Learn

Contributing

Integrations

Example Projects

About

Releases

Packages

Languages

License

atsuhiro/dagster

Folders and files

Latest commit

History

Repository files navigation

Introduction

Install

Learn

Contributing

Integrations

Example Projects

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages