On-premises AI for regulated professions.
Saklam Bridge masks PII inside your network and forwards only tokens to Anthropic, OpenAI, Google, Azure, AWS, Mistral, or your own inference cluster. Client data never leaves your premises in plaintext.
How it works
What Saklam sees: nothing. What the LLM provider sees: masked tokens.
Workstation
Claude Code, Cursor, or your own apps point at bridge.intern.firm.com
Saklam Bridge (Docker)
Runs on-prem inside your firm network. Masks PII (453 patterns) before requests leave the house.
LLM Provider
Anthropic, OpenAI, Google, Azure, AWS Bedrock, Mistral, or your self-hosted endpoint see only tokens like [PER_a1b2]. The response is unmasked on the way back.
Three properties, three sales arguments
Zero-Knowledge
Saklam servers are not in the data path. The container runs on-prem, sees all plaintext — Saklam sees nothing. Real technical separation, not just contractual assurance.
§203-compliant
Client data never leaves your premises as plaintext. No DPA with Saklam needed — you sign that directly with Anthropic, OpenAI, or Mistral.
BYOK
Anthropic, OpenAI, Google, Azure OpenAI, AWS Bedrock, Mistral, or your own inference cluster (Ollama/vLLM) — your direct provider relationship. We don't mark up your AI usage; we license you the software.
Live in 5 minutes
Linux x86_64 or ARM64, 2 vCPU + 4 GB RAM minimum. Docker Engine ≥ 24.
mkdir -p /opt/saklam-bridge && cd /opt/saklam-bridge2. Download docker-compose.yml + .env template
curl -fsSL https://raw.githubusercontent.com/saklam/bridge/main/docker/docker-compose.yml -o docker-compose.yml curl -fsSL https://raw.githubusercontent.com/saklam/bridge/main/docker/.env.example -o .env3. Enter ANTHROPIC_API_KEY and BRIDGE_MASTER_KEY into .env
$EDITOR .env4. Start
docker compose pull && docker compose up -d5. Smoke test
curl -fsS http://localhost:4000/health/readiness
Pricing
Licensed per Named User (= staff regularly using Bridge). Quarterly cutoff.
Pilot
- Unlimited volume
- Setup support directly with Stefan
- Feedback flows into the roadmap
Pro
- Auto-updates
- Email support within 2 business days
- DPA addendum as PDF
Enterprise
- Priority support (4h within business hours)
- Custom PII patterns
- Audit trail export
Cost example: 60-staff firm on Pro tier = €17,940/year. Compare: 60 × Libra Pro at €200/mo = €144,000/year.
Frequently asked questions
Do we need a DPA (Auftragsverarbeitungsvertrag) with Saklam?
No. Saklam Bridge runs on-prem inside your infrastructure, Saklam receives no personal data. A DPA under Art. 28 GDPR would be content-empty. Instead we provide a site-license agreement + data protection addendum for your DPO documentation. A DPA with the LLM provider (Anthropic, OpenAI, Mistral) is still required and you sign that directly with the provider — we don't resell it through us.
Which LLM providers are supported?
Anthropic (Claude), OpenAI (GPT), Azure OpenAI (same GPT models with EU data residency + Microsoft DPA), Google Gemini (Vertex AI EU or AI Studio), AWS Bedrock (Claude/Llama/Mistral via AWS, EU Frankfurt), Mistral La Plateforme, and self-hosted via Ollama/vLLM/TGI inside your own network. Customer-provider relationship is direct — Saklam is not a reseller.
We run our own inference cluster (Ollama/vLLM). Does Bridge add value there?
Yes. Point Bridge at your cluster via OLLAMA_API_BASE (OpenAI-compatible API). The PII masking layer stays active — even with on-prem inference you get consistent token formats, an audit trail, and unified routing logic for both your local models and cloud fallbacks. Belt-and-suspenders.
We use Claude Pro / Anthropic Team / ChatGPT Plus. Does that work?
Unfortunately not. Bridge only works with classic API keys (pay-as-you-go, Console → API Keys). Subscription OAuth tokens are not accepted.
What hardware do we need?
Linux x86_64 or ARM64. 2 vCPU + 4 GB RAM minimum, 4 vCPU + 8 GB recommended for multi-user. ~3 GB storage for container + models.
What is the latency overhead?
~70 ms PII detection overhead per request (Linux CPU). For structured PII (Email/IBAN/phone) <100 ms; for PERSON/ORG detection (GLiNER + Presidio) 1-2 s additional on first request, then cached.
How does unmasking work with streaming responses?
Bridge unmasks chunk-by-chunk, including tokens that split across chunks. Works for both OpenAI and Anthropic SSE streams.
Can we add custom PII patterns for our client workflows?
Enterprise tier includes this. Pro tier on request.
How do we update the image?
docker compose pull && docker compose up -d. Auto-updates are off — you control what runs in your network.
Is Saklam Bridge open source?
Closed-source, proprietary. Image on hub.docker.com/r/saklam/bridge, setup docs public, license: site license per Named User.
Looking for pilot customers
We're looking for 1-3 mid-sized IT-aware law firms (30-100 staff) for a 30-day pilot. Unlimited use, personal setup support from the founder, feedback flows directly into the roadmap. Afterwards: standard site license.
Book a 30-min demoDirect questions? bridge@saklam.com