Skip to content
Chris edited this page Jul 16, 2024 · 4 revisions

Overview

GPT-4o-oIP is designed to empower developers to create telephony use cases for state-of-the-art multimodal AI models with Asterisk, a powerful open-source private branch exchange. This project aims to reduce setup time and cost by providing preconfigured containers that run locally without subscription. It currently supports Google DialogFlow ES, with future plans to incorporate OpenAI's GPT-4o and Kyutai's Moshi AI when they are released to developers.

Features

AI Agent

  • Still waiting on the release of GPT-4o by OpenAI
  • Still waiting on the release of Moshi AI by Kyutai
  • Currently configured to work with DialogFlow ES by Google
    • Form-based bot builder
    • Natural language understanding (NLU) models
    • One-click telephony integration but you're here to run Asterisk locally, right?
    • Speech recognition and speech synthesis models
    • 40+ template agents for building conversations for dining out, hotel booking, navigation, IoT, and more
    • Integration into popular channels, such as Google Assistant, Slack, Twitter, and others
    • Performance and custom dashboards

Asterisk

  • Asterisk powering IP PBX systems and VoIP gateways
  • PrivateDial Lite, customizable Asterisk configuration based on mlan's PrivateDial
  • AutoBan, a built in intrusion detection and prevention system
  • Additionally provide the G.729 and G.723.1 audio codecs
  • Small image size based on Alpine Linux
  • Automatic integration of Let’s Encrypt TLS certificates using the reverse proxy Traefik
  • Persistent storage facilitated by configuration and run data being consolidated under /srv
  • Container audio using the pulse socket of the host
  • Use runit, providing an init scheme and service supervision
  • Health check
  • Log directed to docker daemon with configurable level
  • Multi-staged build providing the images mini, base, full and xtra
Clone this wiki locally