Overview

This project uses AI to generate art based on text prompts which can then be placed in your room using an AR application on android.

There are two main components to setup

The model server running the AI art generation service
The android app to get user input and render the image in AR

Architecture

Unity

The Unity application has two scenes

UI - responsible for collecting the text input. The main class which manages communication with the back end servers is the ModelService.cs file
ARArtScene - responsible for rendering the art. The main class which controls the image placement is TapToPlaceImage.cs

Model

The model service uses two models Vector Quantized Generative Adverserial Network (VQGAN) for image generation and Contrastive Language Image Pre-training (CLIP) to guide VQGAN into matching the text description. These are wrapped into a REST based service using the FastAPI framwork.

The server and model setup is handled using a Docker image based on NVIDIA's PyTorch base image and leveraging code from NerdyRodent's github VQGAN+CLIP repo.

Installation

Model Server

See model server README.md
Adjust TOKEN constant in server/app/app.py to be a unique string

Android App

Install Unity Hub (not beta version)
Then download and install Unity Version 2020.3.1 - Download Archive
Open project (adjust SERVER_TOKEN and SERVER_URL constants to match deployed back-end server configuration)
Connect your phone to your computer with USB debugging enabled and choose File->Build and Run to build and run the game on your device

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Overview

Architecture

Unity

Model

Installation

Model Server

Android App

Files

README.md

Latest commit

History

README.md

File metadata and controls

Overview

Architecture

Unity

Model

Installation

Model Server

Android App