This repository provides a reference architecture followed by end-to-end blueprints, scripts and a suite of Terraform modules for Google Cloud Platform (GCP), designed to automate the creation, governance and observability of a modern and robust data repository for educational institutions, looking into becoming a data-driven organization and so adhering digital education.
Some key differentiators of this approach are:
- Organization-wide landing zone used to bootstrap the deployment of data foundations, the actual infrastructure (BigQuery-based) in which the whole organization's data (structured and unstructured) will be sitting.
- Acceleration and automation of data ingestion through built-in connectors (Composer pipelines) to some of the most popular educational tools in the market, with Moodle being one of them.
- Acceleration and automation of data ingestion from Restful Open APIs via built-in connectors (Cloud Functions).
- Easily customizable via lean Terraform modules.
- Ran in a scenario where only pieces of it would be suitable? No problem. You can pick and choose whatever bits work best for you.
Important: The whole repository is meant to be cloned as a single unit to seed production usage.