Back to Homes

Thalis AI Stack

The Thalis AI stack is an open-source platform designed to bring your original characters to virtual life. The platform is modular and self-hosted, allowing you to craft unique and immersive character experiences.

## Key Features

  • Self-hosting capabilities: Ensure that all data remains under your control, minimizing the risk of data breaches and unwanted surveillance.
  • Modular architecture: Enables easy updates and patches, safeguarding your system against potential vulnerabilities.
  • Custom models and workflows: Support for tailored character personalities, appearances, and traits.
  • Multi-GPU support: Faster processing and distributed deployments for availability and scalability.

## Generators and Tools

The Thalis AI stack uses a variety of open-source tools for inference, including:

  • SpeechT5 for high-quality text-to-speech synthesis
  • ComfyUI for high-resolution image generation using Stable Diffusion and Flux models
  • Ollama for streaming text generation

## Getting Started

To learn more about the Thalis AI stack, including its architecture, licensing, and community support, please visit our Github project page. Here, you’ll find detailed documentation, tutorials, and resources to help you get started with creating your own interactive character experiences.

## Comparison with Other Platforms

Compared to other platforms and the individual tools, the Thalis AI stack offers more ways to interact with your characters, while keeping your data private and secure. Support for custom LoRA networks allows you to customize your characters in ways that are not possible with other platforms.

CategoryThalis AIPopular Chatbot PlatformsComfyUIOllamaopen-webui
Self-hostedYesNoYesYesYes
Privacy & data controlYesNoYesYesYes
Completely open-sourceYesNoYesYesYes
Support for desktop GPUsYesNoYesYesYes
Text chatYesYesNo (1)Yes (2)Yes
Audio generationYesYesYes (5)NoYes
Image generationYesYesYesNoYes
Per-character image modelsYesNoYes (3)No (3)No
Per-character text modelsYesYesNo (3)Yes (3)Yes (4)
Character LoRA networksYesNoYesYesNo
Dynamic workflow generatorYesNoNoNoNo
Image prompt generatorYesNoYes (5)NoNo
Total12/124/128/116/116/11

Notes:

  1. ComfyUI has custom nodes that support text generation, but I am not aware of any that allow bidirectional chat
  2. Ollama supports text chat through the command-line tools, but it does not include a web UI
  3. ComfyUI and Ollama do not have character profiles, but they do allow you to choose the model for each chat or image
  4. open-webui does not have character profiles, but it allows you to select the model for each chat
  5. ComfyUI supports audio generation and image prompt generation with custom nodes

If you have another multi-modal AI stack that you would like to have included in this comparison, please let us know. Other self-hosted stacks are especially welcome, we would love to support other privacy-conscious open-source projects.