Systemprompt MCP Client

Website | Documentation | Blog | Get API Key

Free and Open Source Software: A modern voice-controlled AI interface powered by Google Gemini and Anthropic MCP (Model Control Protocol). Transform how you interact with AI through natural speech and multimodal inputs.

If you like this project, please consider starring it on GitHub and sharing it. It helps me get more visibility and support for this project and keep the lights on.

A modern Vite + TypeScript application that enables voice-controlled AI workflows through MCP (Model Control Protocol). This project revolutionizes how you interact `with AI systems by combining Google Gemini's multimodal capabilities with MCP's extensible tooling system.

🎥 Demo & Showcase

Voice-controlled AI interactions
Multimodal input processing
Tool execution and workflow automation
Real-time voice synthesis

Watch our video demonstration to see Systemprompt MCP Client in action:

Voice CMS and Agent creation: Watch Demo Video
Voice Agent with Google: Watch Demo Video

🎯 Why Systemprompt MCP?

Transform your AI interactions with a powerful voice-first interface that combines the best of:

Google Gemini's Multimodal AI: Understand and process text, voice, and visual inputs naturally
MCP (Model Control Protocol): Execute complex AI workflows with a robust tooling system
Voice-First Design: Control everything through natural speech, making AI interaction more intuitive

Perfect for:

Developers building voice-controlled AI applications
Teams needing a flexible AI workflow orchestration system
Organizations wanting to leverage Google Gemini's capabilities with extensible tooling

🎯 Core Features

Voice & Multimodal Intelligence

Natural Voice Control: Speak naturally to control AI workflows and execute commands
Multimodal Understanding: Process text, voice, and visual inputs simultaneously
Real-time Voice Synthesis: Get instant audio responses from your AI interactions

AI Workflow Orchestration

Extensible Tool System: Add custom tools and workflows through MCP
Workflow Automation: Chain multiple AI operations with voice commands
State Management: Robust handling of complex, multi-step AI interactions

Developer Experience

Modern Tech Stack: Built with Vite, React, TypeScript, and NextUI
Type Safety: Full TypeScript support with comprehensive type definitions
Hot Module Replacement: Fast development with instant feedback
Comprehensive Testing: Built-in testing infrastructure with high coverage

Enterprise Ready

Secure: Built-in security best practices for API key management
Scalable: Modular architecture supporting multiple LLM providers
Configurable: Extensive configuration options for different environments

🏗️ Architecture

The system follows a modular, feature-based architecture:

graph TD
    A[Web Interface] --> B[Feature Modules]
    B --> C[Multimodal Agent]
    B --> D[LLM Registry]
    B --> E[Server Management]

    C --> F[Voice Control]
    C --> G[Prompt Execution]

    D --> H[Model Configuration]
    D --> I[LLM Integration]

    E --> J[Server Config]
    E --> K[Prompt Management]

    style A fill:#f9f,stroke:#333,stroke-width:2px
    style B fill:#bbf,stroke:#333,stroke-width:2px
    style C,D,E fill:#ddf,stroke:#333,stroke-width:2px

Key Components

Multimodal Agent: Handles voice recognition, synthesis, and multimodal processing
LLM Registry: Manages different language models and their configurations
Server Management: Handles MCP server connections and tool orchestration
Voice Control: Processes natural language commands and converts them to actions
Prompt Management: Handles system prompts and their execution

🚀 Getting Started

Prerequisites

Node.js 16.x or higher
npm 7.x or higher
A modern browser with Web Speech API support

Development Setup

Clone the repository:

git clone https://github.com/Ejb503/multimodal-mcp-client.git
cd multimodal-mcp-client

Install dependencies:
```
npm install
```
Set up configuration files:
```
# Navigate to config directory
cd config

# Create local configuration files from templates
cp mcp.config.default.json mcp.config.json
cp agent.config.default.json agent.config.json
cp llm.config.default.json llm.config.json
```
Required Configuration:
- Get a Gemini API key from Google AI Studio
- Add it to llm.config.json in the apiKey field
- The app will not start without a valid API key in llm.config.json
Edit the other configuration files to add your specific settings:
- mcp.config.json: Configure MCP server connections
- agent.config.json: Set up agent configurations
Optional: You can get a free Systemprompt API key from systemprompt.io/console or configure any custom MCP server of your choice in mcp.config.json. With an API key, you can also use the systemprompt-mcp-core extension which provides additional agent management and prompt versioning capabilities.
Start the development server:
```
npm run dev
```
The development server will be available at http://localhost:5173

Build for production:

npm run build
npm run preview  # Preview the production build locally

🛠️ Tech Stack

Frontend: React 18, TypeScript, Vite 6
UI Components: NextUI, Tailwind CSS, Framer Motion
State Management: Zustand
Testing: Vitest, Testing Library
AI Integration: Google Generative AI SDK
MCP Protocol: @modelcontextprotocol/sdk
Development: ESLint, TypeScript 5.6

📦 Key Features

🎙️ Voice Control System

Natural language command processing
Real-time voice synthesis
Multi-language support
Voice activity detection

🤖 AI Integration

Google Gemini integration
Multimodal input processing
Real-time AI responses
Custom prompt management

🔧 MCP Tools

SSE and stdio server support
Custom tool creation
Workflow automation
State persistence

💼 Enterprise Features

Secure API key management
Multiple server configurations
Extensible architecture
Comprehensive logging

🧪 Testing & Quality

# Run tests
npm test

# Watch mode
npm run test:watch

# Coverage report
npm run test:coverage

📈 Version History

v0.3.6 - Current release
- Enhanced voice processing
- Updated to Vite 6
- Improved TypeScript support
- New UI components

🤝 Contributing

We welcome contributions! See our Contributing Guide for details.

🔐 Security

Secure API key handling
Environment-based configuration
Regular security updates
Protected server endpoints

📞 Support

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Google Gemini team for their powerful multimodal AI capabilities
Model Control Protocol (MCP) community
React and TypeScript communities
NextUI and Tailwind CSS teams
All contributors and maintainers

🔗 Resources

💪 Sponsored by Systemprompt

This project is proudly sponsored and maintained by Systemprompt. We're committed to advancing the field of AI tooling and making powerful AI interfaces accessible to everyone.

🚀 Extensions in Development

We're actively working on expanding the capabilities of Systemprompt MCP Client with exciting extensions:

Custom Tool Builder: Create and deploy your own MCP tools
Enterprise Workflow Templates: Pre-built workflows for common business scenarios
Advanced Voice Processing: Enhanced voice recognition and synthesis capabilities
Team Collaboration Features: Multi-user support and shared workflows

Stay tuned for updates and new releases! Follow us on GitHub or join our Discord community for the latest news.

🔌 Installing Extensions

To install extensions, follow these steps:

Navigate to the extensions folder:
```
cd extensions
```
Clone the desired extension repository:
```
git clone <repository-url>
```
Follow the installation instructions provided in the cloned repository.
Update the configuration:
- Add a link to Node/Python in the config/mcp.config.json or config/mcp.config.default.json.

Extensions

systemprompt-agent-server

Website | Documentation | Blog | Get API Key

A specialized Model Context Protocol (MCP) server that enables you to create, manage, and extend AI agents through a powerful prompt and tool management system. This server integrates with systemprompt.io to provide seamless creation, management, and versioning of system prompts through MCP. It works in conjunction with the multimodal-mcp-client to provide a complete voice-powered AI workflow solution.

An API KEY is required to use this server. This is currently free, although this may change in the future. You can get one here.

🌐 systemprompt-mcp-google Extension

A specialized Model Context Protocol (MCP) server that integrates Google services (Gmail, Calendar, etc.) into your AI workflows. This server enables seamless access to Google services through MCP, allowing AI agents to interact with Gmail, Google Calendar, and other Google services.

Prerequisites

Systemprompt API Key: Sign up at systemprompt.io/console and create a new API key.
MCP-Compatible Client: Use the Systemprompt MCP Client or any other MCP-compatible client.
Google Cloud Project: Set up a Google Cloud account, enable API access, and configure OAuth2 credentials.

Setup

Google Cloud Setup:
- Create a project in Google Cloud Console.
- Enable Gmail, Calendar, and Drive APIs.
- Create OAuth2 credentials and download the JSON file as credentials/google-credentials.json.
Server Configuration:
- Install the package: npm install systemprompt-mcp-google.
- Create the credentials directory: mkdir -p credentials.
- Run the authentication script: npm run auth-google.

Features

Gmail Integration: Read, send, and manage emails.
Calendar Integration: Create and manage events.
MCP Integration: Standard MCP interface with structured command responses.

Usage

Through MCP Client: Use any MCP client to send commands to this server for Gmail and Calendar operations.

For detailed setup and usage instructions, refer to the systemprompt-mcp-google documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
config		config
docs		docs
extensions		extensions
proxy		proxy
public		public
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
tsconfig.test.json		tsconfig.test.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

License

Ejb503/multimodal-mcp-client

Folders and files

Latest commit

History

Repository files navigation

Systemprompt MCP Client

📚 Table of Contents

🎥 Demo & Showcase

🎯 Why Systemprompt MCP?

🎯 Core Features

Voice & Multimodal Intelligence

AI Workflow Orchestration

Developer Experience

Enterprise Ready

🏗️ Architecture

Key Components

🚀 Getting Started

Prerequisites

Development Setup

🛠️ Tech Stack

📦 Key Features

🎙️ Voice Control System

🤖 AI Integration

🔧 MCP Tools

💼 Enterprise Features

🧪 Testing & Quality

📈 Version History

🤝 Contributing

🔐 Security

📞 Support

📄 License

🙏 Acknowledgments

🔗 Resources

💪 Sponsored by Systemprompt

🚀 Extensions in Development

🔌 Installing Extensions

Extensions

systemprompt-agent-server

🌐 systemprompt-mcp-google Extension

Prerequisites

Setup

Features

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages