Install Trapetum as a background service on your own GPU machine. You get a private ChatGPT-style chat, an OpenAI-compatible API, admin controls and a usage dashboard, all on localhost. The model runs on your hardware. The compression engine is source available, on GitHub, so your security team can read every line.
For sensitive or regulated data, a local plus source-available LLM is the only setup where you can actually prove what happens to your data. Control and auditability are not features here, they are the point.
One small service on port 8088. Everything below runs locally, served from your own machine.
A ChatGPT-style interface to talk to your compressed models. Pick a model, add more, all on localhost.
/v1/chat/completions and /v1/models. Point any OpenAI client at your own server. Swagger docs at /docs.
Generate and revoke Bearer tokens from the admin console. Lock the API to your apps only.
Port, network binding, CORS, rate limits, default model, prompt logging on or off, all admin-only behind a password.
Graphs of tokens per model, requests, compression rate and energy plus CO2 saved versus fp16, in real time.
Live grid carbon intensity at your location. Compressed 4-bit decode uses about 2.1x less energy than fp16.
Drop-in OpenAI-compatible. Create a token in the admin console, then call your own machine.
curl http://localhost:8088/v1/chat/completions \
-H "Authorization: Bearer trp_your_token" \
-H "Content-Type: application/json" \
-d '{"model":"qwen25-7b","messages":[{"role":"user","content":"Hello"}]}'
| Endpoint | What it does |
|---|---|
GET / | Chat interface |
GET /docs | Swagger API documentation |
POST /v1/chat/completions | Chat completion (OpenAI-compatible) |
GET /v1/models | List installed models |
GET /admin | Admin settings (password protected) |
GET /admin/dashboard | Usage and CO2 dashboard (admin only) |
Requires an NVIDIA GPU with the CUDA runtime. The installer sets up a background service that starts on boot and serves the web UI on http://localhost:8088. During install you set an admin password that locks the settings.
tar xzf trapetum-linux.tar.gz sudo ./trapetum-linux/install-linux.sh # you are prompted for an admin password # manage: systemctl status|restart trapetum
powershell -ExecutionPolicy Bypass ` -File install-windows.ps1 # elevated PowerShell, prompts for an # admin password. Manage in services.msc
After install, open http://localhost:8088, click the + to add a model, and the gear to open the admin settings. Models are compressed ahead of time, so the install stays light.