Open WebUI + Hermes Agent: Build Your Own Self-Hosted ChatGPT (2026)

Q: What is Hermes Agent?

Hermes Agent is a self-improving AI agent framework built by Nous Research. It has 118,000+ GitHub stars and features persistent memory, autonomous skill creation via GEPA, scheduled automation, and support for 200+ AI models. Unlike basic LLM tools, Hermes learns from every interaction and gets better over time.

Q: Is Open WebUI + Hermes Agent free?

Yes. Both Open WebUI and Hermes Agent are free and open-source under MIT license. You can run them on your own hardware at zero cost using local models via Ollama. If you use cloud models through OpenRouter, API costs are typically $15-80/month depending on usage.

Q: How do I connect Hermes Agent to Open WebUI?

Add API_SERVER_ENABLED=true to your Hermes Agent environment file, set an API key, run 'hermes gateway', then add the connection in Open WebUI under Admin Settings → Connections using the URL http://host.docker.internal:8642/v1 and your API key.

Q: What is GEPA in Hermes Agent?

GEPA (Genetic-Pareto Prompt Evolution) is Hermes Agent's self-improvement system. It rewrites its own prompting strategies every 15 tool calls, has been verified by ETH Zurich to deliver 33-38% speedup, and is the key feature that separates Hermes from other AI agents.

Q: Can I use Hermes Agent with local models?

Yes. Hermes Agent supports local models through Ollama, and you can switch between local and cloud models directly within the Open WebUI interface. This means you can run the entire stack privately on your own hardware with zero API costs.

Key Takeaways

Hermes Agent is the fastest-growing open-source AI agent with 118K+ GitHub stars — it learns from every task and improves itself via GEPA
Open WebUI is the #1 self-hosted AI chat interface with 134K+ stars and 290M+ downloads — a polished ChatGPT-style frontend
Together they give you a free, self-hosted ChatGPT replacement with persistent memory, 200+ models, and scheduled automation
Setup takes under 15 minutes with Docker — three config lines connect the two tools
Works on mobile, supports voice/video calls, file uploads, web search, image generation, and code preview

What Is Hermes Agent?
What Is Open WebUI?
Why This Combo Is the Best Self-Hosted ChatGPT
Step-by-Step Setup Guide
Advanced Features You Get for Free
Hermes Agent vs OpenClaw vs Claude Code
Pros and Cons
FAQ

If you've been paying $20 a month for ChatGPT Plus and wondering whether there's a better way, there is — and it's completely free. Hermes Agent and Open WebUI are two open-source projects that, when combined, give you something ChatGPT can't: a self-hosted AI that remembers everything, improves itself over time, and runs on your own hardware with zero subscription fees.

The problem with Hermes Agent on its own is the interface — you're stuck in a terminal or the official dashboard, which doesn't even have a chat option. Open WebUI solves that completely. It wraps Hermes in a polished, ChatGPT-style interface with file uploads, conversation management, model switching, and even voice calls. Together, these two projects have 250,000+ combined GitHub stars and they're both growing faster than anything else in the AI open-source space right now.

For AI Builders

Get Your AI Tool in Front of 50,000+ Monthly Readers

PopularAiTools.ai reaches developers, founders, and AI buyers actively searching for their next tool.

50K+

Monthly Visitors

1,000+

Tools Listed

8,500+

AI Resources

Submit Your AI Tool →

Infographic overview — Open WebUI + Hermes Agent self-hosted ChatGPT setup — Visual overview: how Open WebUI and Hermes Agent combine into a self-hosted ChatGPT replacement

Open WebUI + Hermes Agent key features — self-improving AI, persistent memory, 200+ models, Docker ready — Key features at a glance: what the Open WebUI + Hermes Agent stack brings to the table

What Is Hermes Agent?

Hermes Agent is a self-improving AI agent built by Nous Research — the same team behind the Hermes family of large language models. But Hermes Agent isn't just a model. It's a full autonomous agent framework that hit 100,000 GitHub stars in just 7 weeks, making it the fastest-growing open-source AI project of 2026.

What separates Hermes from tools like Claude Code or OpenClaw is its learning loop. Hermes uses a system called GEPA (Genetic-Pareto Prompt Evolution) that rewrites its own prompting strategies every 15 tool calls. ETH Zurich verified a 33-38% speedup from this self-improvement. In plain terms: Hermes gets better at your specific tasks the more you use it.

Hermes Agent GitHub repository — 118K+ stars, self-improving AI agent by Nous Research — Hermes Agent on GitHub — 118K+ stars and the fastest-growing open-source AI agent in 2026

The feature list is stacked. Persistent memory across sessions with FTS5 search. Built-in cron scheduler for automated tasks. Parallel subagent processing. Support for 200+ models via OpenRouter, Nous Portal, OpenAI, and local Ollama. 47 built-in tools including web search, code execution, and file management. And it runs on hardware as modest as a $5 VPS.

The catch? Hermes Agent's native interface is a terminal CLI. The official dashboard lets you manage skills, sessions, and scheduled tasks — but it doesn't actually let you chat with the agent. That's where Open WebUI comes in.

FLASHCARDS Test Your Knowledge: Hermes Agent & Open WebUI

Q: What is Hermes Agent?

A self-improving AI agent by Nous Research with 118K+ GitHub stars, persistent memory, and GEPA self-optimization.

Q: What is GEPA?

Genetic-Pareto Prompt Evolution — rewrites prompting strategies every 15 tool calls for 33-38% speedup (verified by ETH Zurich).

Q: How many GitHub stars does Hermes Agent have?

118,000+ stars — hit 100K in just 7 weeks, the fastest growth of any AI project in 2026.

Q: What is Open WebUI?

The #1 self-hosted AI chat interface with 134K+ GitHub stars and 290M+ downloads — a polished ChatGPT-style frontend.

Q: How many community members does Open WebUI have?

385,000+ members in the Open WebUI community.

Q: Are both tools free to use?

Yes — both are 100% free and open-source under MIT license. You only pay for cloud model API calls if you choose to use them.

What Is Open WebUI?

Open WebUI — formerly known as Ollama WebUI — is a self-hosted AI platform with 134,000+ GitHub stars and over 290 million downloads. It's the most popular open-source ChatGPT-style interface in existence, and it's designed to work entirely offline if you want it to.

Open WebUI homepage — the freedom AI stack, self-hosted chat interface — Open WebUI — "The freedom AI stack" — 134K+ GitHub stars and 385K+ community members

Think of it as the interface layer. On its own, Open WebUI connects to Ollama for running local models. But when you plug in Hermes Agent as the backend instead, you unlock an entirely different level of capability. Suddenly your chat interface has persistent memory across sessions, self-improving prompt strategies, scheduled automation, and access to 200+ cloud models — all while keeping the clean, familiar ChatGPT-style experience.

The feature set is comprehensive: multiple user accounts, conversation management, MCP app support, file and knowledge uploads, saved prompts, web search, image generation, code interpreter with live preview, and even voice and video calls. It's available on mobile too, so you can manage your agents from anywhere.

Why This Combo Is the Best Self-Hosted ChatGPT Alternative

Most Open WebUI guides tell you to connect it to Ollama and call it a day. That works — but you're leaving 90% of what's possible on the table. Ollama gives you local model inference. Hermes Agent gives you an autonomous AI that remembers, learns, automates, and connects to 200+ models. The difference isn't incremental. It's a different category of tool.

Feature	ChatGPT Plus	Open WebUI + Ollama	Open WebUI + Hermes
Monthly Cost	$20/month	Free	Free
Self-Hosted	No	Yes	Yes
Persistent Memory	Limited	No	Full (FTS5)
Self-Improving	No	No	Yes (GEPA)
Models Available	GPT-4/5 only	Local only	200+ (local + cloud)
Scheduled Tasks	No	No	Built-in cron
Data Privacy	Cloud (OpenAI servers)	100% local	100% local

Self-hosted AI comparison — ChatGPT vs Open WebUI + Ollama vs Open WebUI + Hermes Agent — Side-by-side: why the Open WebUI + Hermes Agent stack outperforms every alternative

FLASHCARDS Test Your Knowledge: Integration & Setup

Q: What port does the Hermes API server use?

Port 8642 — the OpenAI-compatible API endpoint.

Q: What env variable enables the Hermes API server?

API_SERVER_ENABLED=true in the Hermes Agent environment file.

Q: What command starts the Hermes gateway?

hermes gateway — this launches the API server that Open WebUI connects to.

Q: Can you use Docker to set up both tools?

Yes — both Open WebUI and Hermes Agent fully support Docker deployment.

Q: Where do you add the Hermes connection in Open WebUI?

Admin Settings → Connections → add the Hermes API URL and key.

Q: What API standard does Hermes use for integration?

OpenAI-compatible API — meaning any tool that speaks OpenAI's format can connect to Hermes.

Step-by-Step Setup Guide

The entire setup takes under 15 minutes if you have Docker installed. Here's the complete walkthrough, based on the official documentation and the video tutorial above.

Prerequisites:

Docker Desktop (Windows/Mac) or Docker Engine (Linux)
Hermes Agent installed on your machine (quickstart guide)
Open WebUI running via Docker (one command — shown below)
~15 minutes of setup time

Step 1: Install Hermes Agent

If you haven't already, install Hermes Agent from the official repo. Once installed, verify it's running — you should be able to interact with it from the terminal. The key is that Hermes needs to be operational before we connect Open WebUI to it.

Step 2: Enable the API Server

Add the following to your Hermes Agent environment file. This exposes an OpenAI-compatible API that Open WebUI can talk to:

    API_SERVER_ENABLED=true
API_SERVER_KEY=your-random-api-key
API_SERVER_PORT=8642

Replace the API key with any strong, random string. This becomes the authentication key you'll enter in Open WebUI. Then start the gateway:

    hermes gateway

Step 3: Deploy Open WebUI with Docker

One command launches Open WebUI in Docker. It'll be accessible at localhost:3000:

    docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway -v open-webui:/app/backend/data --name open-webui --restart always ghcr.io/open-webui/open-webui:main

Step 4: Connect Hermes to Open WebUI

Open localhost:3000 in your browser, create an admin account, then navigate to Admin Settings → Connections. Add a new connection:

    URL: http://host.docker.internal:8642/v1
API Key: your-random-api-key

Open WebUI + Hermes Agent integration documentation — step-by-step API configuration — The official integration docs — three config lines connect Hermes Agent to Open WebUI

Step 5: Start Chatting

Select "hermes-agent" from the model dropdown in Open WebUI and you're done. You now have a self-hosted ChatGPT replacement with persistent memory, self-improving capabilities, and access to every model Hermes supports. The interface looks and feels exactly like ChatGPT — but it's yours, running on your hardware, with zero subscription fees.

Advanced Features You Get for Free

Once you're running, the real power starts showing. Here's what's available through the Open WebUI + Hermes stack that you can't get from a basic Ollama setup:

Model Switching

Switch between local Ollama models, Hermes Agent, DeepSeek, OpenClaw, and any OpenRouter model — all from the same dropdown. No config changes needed.

File & Knowledge Uploads

Attach files, notes, and knowledge bases directly in chat. Way easier than typing file paths in a terminal — drag, drop, and query.

Custom Agent Profiles

Create different Hermes profiles with unique system prompts, tools, and knowledge. Think of them as specialized GPTs — but self-hosted and self-improving.

Code Preview

When Hermes builds code, Open WebUI renders a live preview right in the chat — similar to ChatGPT's canvas but for your self-hosted agent.

Voice & Video Calls

Talk to your agent using voice or video — available on desktop and mobile. Perfect for hands-free brainstorming sessions.

Web Search & Image Gen

Built-in web search and image generation tools — configure once in Open WebUI and every agent can use them.

Five-step setup pipeline — Docker, Open WebUI, Hermes Agent, API server, connect and chat — The five-step pipeline: from Docker install to a fully operational self-hosted ChatGPT

Hermes Agent vs OpenClaw vs Claude Code

If you're evaluating AI agents in 2026, these are the three names that keep coming up. Here's how they compare at a high level:

	Hermes Agent	OpenClaw	Claude Code
Focus	General-purpose AI agent	IDE coding agent	CLI coding agent
Self-Improving	Yes (GEPA)	No	No
Chat UI	Via Open WebUI	Built-in IDE	Terminal + VS Code
Price	Free (OSS)	Free (OSS)	$20/mo (Max plan)
Best For	Research, automation, multi-platform	Building software in IDE	Complex coding, agentic workflows

The short version: if you're primarily coding, Claude Code or OpenClaw are purpose-built for that. If you want a general-purpose AI assistant that handles research, automation, messaging, scheduling, and coding — and improves itself every time you use it — Hermes Agent with Open WebUI is the play.

FLASHCARDS Test Your Knowledge: Features & Comparison

Q: What advantage does Hermes have over plain Ollama?

Persistent memory, self-improving GEPA, built-in automation, 200+ model access, and skills that evolve over time.

Q: How many models can Hermes access via OpenRouter?

200+ models — including DeepSeek, Claude, GPT, Kimi, and local Ollama models.

Q: Can Hermes Agent run scheduled tasks?

Yes — built-in cron scheduler for automating repetitive tasks without manual intervention.

Q: What messaging platforms does Hermes support?

15+ platforms — CLI, Telegram, Discord, Slack, WhatsApp, Signal, and more.

Q: Does Open WebUI support file uploads?

Yes — files, notes, knowledge bases, and reference chats can be attached directly in the chat interface.

Q: Can you use voice/video with this setup?

Yes — Open WebUI supports voice and video calls on both desktop and mobile.

Key statistics — 118K+ Hermes stars, 134K+ Open WebUI stars, 200+ models, $0 cost — The numbers that matter: 250K+ combined GitHub stars and zero subscription fees

Self-Hosted AI Stack Comparison

	Open WebUI + Hermes	LibreChat	LobeChat	Jan.ai	AnythingLLM
Cost	Free	Free	Free	Free	Free
Memory	Persistent (FTS5)	None	None	Basic	RAG-based
Self-Improving	Yes (GEPA)	No	No	No	No
Best For	General AI + automation	Multi-provider chat	Best UI + plugins	Privacy-first desktop	RAG + documents
Docker	Yes	Yes	Yes	Desktop app	Yes

Pros and Cons

Strengths

✓ Completely free. Both tools are open-source with MIT license. Zero subscription fees ever.
✓ Self-improving. GEPA means the agent gets measurably better the more you use it — no other self-hosted tool does this.
✓ Complete privacy. Everything runs locally. Your data never leaves your machine unless you choose cloud models.
✓ Setup in 15 minutes. Docker handles everything. Three config lines connect the two tools.

Limitations

✗ Docker required. You need Docker installed, which can be intimidating for non-technical users.
✗ Self-hosting responsibility. You manage updates, backups, and security — no vendor handles it for you.
✗ Local models need GPU. Running large models locally requires decent hardware. Cloud models via OpenRouter solve this but cost money.
✗ Two moving parts. Debugging issues means checking both Open WebUI and Hermes Agent — more complexity than a single-tool setup.

Related Articles

Frequently Asked Questions

What is Hermes Agent?

Hermes Agent is a self-improving AI agent framework by Nous Research with 118K+ GitHub stars. It features persistent memory, GEPA self-optimization (33-38% speedup verified by ETH Zurich), built-in cron scheduling, 47 tools, and support for 200+ models. Unlike basic chatbots, Hermes learns from every interaction and gets better over time.

Is Open WebUI + Hermes Agent free?

Yes — both are 100% free and open-source under MIT license. The only costs are optional: cloud model API calls through OpenRouter (typically $15-80/month), or VPS hosting if you don't run it on your own machine. You can run the entire stack on local hardware with Ollama models for truly zero cost.

How do I connect Hermes Agent to Open WebUI?

Three steps: (1) Add API_SERVER_ENABLED=true to your Hermes environment file and set an API key. (2) Run hermes gateway. (3) In Open WebUI, go to Admin Settings → Connections and add the URL http://host.docker.internal:8642/v1 with your API key.

What is GEPA in Hermes Agent?

GEPA stands for Genetic-Pareto Prompt Evolution. It's Hermes Agent's self-improvement engine that rewrites its own prompting strategies every 15 tool calls. ETH Zurich independently verified a 33-38% speedup from this process. It's what makes Hermes fundamentally different from other AI agents — it actually gets better with use.

Can I use Hermes Agent with local models?

Absolutely. Hermes works with local Ollama models, and you can switch between local and cloud models directly in the Open WebUI interface. This means you can run the entire stack privately on your own hardware with zero API costs — perfect for sensitive data or air-gapped environments.

Are You Building an AI Tool? Get your tool listed in front of thousands of developers, creators, and businesses searching for AI solutions

Submit Your AI Tool — Free Listing →

Discover More

Explore 1,000+ AI Tools on PopularAiTools.ai

From coding assistants to self-hosted AI stacks — find the perfect tool for your next project.

Browse All AI Tools →