🧩 One Address → Free AI From Many Providers, Auto-Failover

BCBC · June 13, 2026, 8:17am

Milk Multiple Free AI Tiers Through One Local API — Apps Hit a Single Endpoint While FreeLLMAPI Spreads Requests Across Free Providers and Auto-Recovers; Docker Compose Deploy With Logs and Checks

Point any ChatGPT app at YOUR box. It juggles multiple free AI backends behind the scenes.

homelab · ai · self-hosting

Big thanks to SRZ from OneHack for the original FreeLLMAPI thread and bringing it to the community — this is my BCBC lab writeup of getting it running.

The whole idea in one breath: FreeLLMAPI is a little middleman (a forwarder that sits between your app and the AI) that pretends to be ChatGPT’s API (the address apps send AI requests to). So any tool built for ChatGPT talks to it without changes — but behind the curtain it routes to a bunch of free AI providers, and if one’s down or rate-limited, it auto-jumps to the next. One address, many free brains, zero app rewrites.

Why You’d Want This

One endpoint, swap brains freely — apps point at your box, you change providers behind it whenever.
Auto-fallback — provider rate-limits you or dies? It silently switches to the next. No babysitting.
Runs local on your own server — your traffic, your rules.
Stack free providers — milk multiple free AI tiers through one door.

Not hype — repeatable. The point isn’t a flashy demo. It’s infrastructure you can stand up the same way twice and actually rely on.

The Lab Setup

Proxmox CT: 109            (a mini-computer on the server)
Service:    FreeLLMAPI
Runtime:    Docker / docker compose
Port:       3001           (the "door number" it listens on)
Endpoint:   /v1/chat/completions
Database:   SQLite         (tiny built-in data file)

How a request flows:

your app
  ↓
fake-OpenAI address  (looks like ChatGPT's API)
  ↓
FreeLLMAPI gateway   (the middleman)
  ↓
provider routing / fallback
  ↓
multiple free AI backends

Current Status — All Green

Docker container starts clean
SQLite database initializes
Model + fallback seed runs (loads which providers to try and in what order)
API listening on port 3001
Proxy endpoint live at /v1/chat/completions

🔍 Confirm It Yourself (logs)

pct exec 109 -- bash -lc "docker logs --tail 100 freellmapi-freellmapi-1"

You want to see:

Database initialized
Server running on http://[::]:3001
Proxy endpoint: http://[::]:3001/v1/chat/completions

Container up, database seeded, API listening, proxy live.

🛠️ Next Upgrades (my to-do)

nginx reverse proxy — a doorman in front so you hit a clean URL, not a raw port.
Health/status page — see at a glance if it’s alive.
Test the fallback — kill one provider on purpose, confirm it jumps.
Hook in Ollama — (runs AI models locally on your own machine) so you’ve got an offline backend too.
Document safe config examples — copy-paste ready for the next person.

Real homelab truth: the AI service booted faster than the network plumbing did. The full stack dragged in Proxmox, Docker, pfSense, ClouDNS, BIND split DNS (your own private phonebook for internal names), nginx, and browser-cache gremlins. Totally normal — the app is only one piece; the system is the work.

One door, many free AI brains, auto-failover. Who’s spinning this up on their own box — and which providers are you stacking behind it?

Topic		Replies	Views
🔀 [FREE FOREVER] FreeLLMAPI — One Key + 1B Tokens/Month + 14 Providers Stacked Tools & Scripts programming , tips-tricks	6	2314	May 25, 2026
Free GPT-4 Without Paying a Dime (Seriously) Tutorials & Methods cloud-storage , ai	2	2028	July 6, 2025
⚡ Stop Paying for Slow AI — These Free APIs Are 20x Faster Tutorials & Methods tools , freebies	0	832	February 5, 2026
Free GPU Access for Local AI – No Login, No Bill Tutorials & Methods freebies , networking , ai	1	1006	July 1, 2025
🔑 Claude Code for $0 — and the "AUTH Error" That Was Never the API Key Tutorials & Methods programming , tips-tricks , ai	0	605	July 26, 2026

🧩 One Address → Free AI From Many Providers, Auto-Failover

Milk Multiple Free AI Tiers Through One Local API — Apps Hit a Single Endpoint While FreeLLMAPI Spreads Requests Across Free Providers and Auto-Recovers; Docker Compose Deploy With Logs and Checks

Why You’d Want This

The Lab Setup

Current Status — All Green

Related topics