Skip to content

Dorylus: Affordable, Scalable, and Accurate GNN Training

Notifications You must be signed in to change notification settings

MachineLearningSystem/dorylus

 
 

Repository files navigation

Dorylus: Affordable, Scalable, and Accurate GNN Training with Distributed CPU Servers and Serverless Threads

This is Dorylus, a Scalable, Resource-efficient & Affordable computation system for Graph Neural Networks, built upon an architecture combining cheap data servers on AWS EC2 with serverless computing on AWS Lambda Threads.

Dataserver originally is a push-based ASPIRE implementation, a cleaned up version of gift (forked on July 06, 2016). Implemented streaming-like processing as in Tornado (SIGMOD'16) paper.

Now the main logic of the engine has been completely simplified, and we integrate it with AWS Lambda threads. Ultimate goal is to achieve "Affordable AI" with the benefit of cheap scalability brought by serverless computing.

Check out our OSDI'21 paper for details of the design.

User Guide

Check our Wiki page for managing your EC2 clusters, building & running Dorylus.

About

Dorylus: Affordable, Scalable, and Accurate GNN Training

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 65.0%
  • Python 20.3%
  • Shell 6.6%
  • Cuda 6.0%
  • CMake 1.8%
  • C 0.2%
  • Makefile 0.1%