Angular Web Audio melodies sample

Developer sample written in Angular demonstrating Gemini multimodal (image and audio) input and understanding. The user enters a prompt and the app generates images via VertexAI’s image generation which the user can after that preview in a three-dimensional gallery. The user has an input where they can ask a question about the images. Using Web Audio’s Speech Synthesis API we read Gemini’s answer for the images.

Pre-requisites

Node.js and npm

Download and install Go: https://docs.npmjs.com/downloading-and-installing-node-js-and-npm

Gemini API key

Launch Google AI Studio: https://aistudio.google.com/
Click “Get API Key”

Getting started

Compile and run the app:

npm i
npm start

In the text box with placeholder "API key" enter your Gemini API key. You can find instructions how to use the app under "Instructions" when you open the user interface.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.vscode		.vscode
docs		docs
src		src
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
angular.json		angular.json
package-lock.json		package-lock.json
package.json		package.json
server.ts		server.ts
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.spec.json		tsconfig.spec.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Angular Web Audio melodies sample

Pre-requisites

Getting started

About

Releases

Packages

Languages

License

mgechev/gemini-angular-drawing-demo

Folders and files

Latest commit

History

Repository files navigation

Angular Web Audio melodies sample

Pre-requisites

Getting started

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages