Our Mission

Securing & Observing
Modern Infrastructure

O11yConsult was founded by SREs who realized that visibility is only half the battle. To build truly resilient systems, you need automated security and transparent telemetry. We specialize in the "Golden Stack": OpenTelemetry for data, Caddy for edge routing, and Smallstep for identity.

Engineering Team collaborating

Impact by the numbers

60%
MTTR Reduction
45%
O11y Cost Savings
85+
Caddy/mTLS Deployments
250K+
Certificates Automated

Our Specialist Areas

We bridge the gap between infrastructure security and operational visibility.

OpenTelemetry Architecture

Building vendor-neutral pipelines that collect traces, metrics, and logs without proprietary overhead.

Automated PKI & mTLS

Implementing Smallstep CA to provide every microservice with a secure, auto-renewing identity.

Edge Performance (Caddy)

Replacing rigid, complex proxies with Caddy for native observability and zero-touch HTTPS.

The Principles of Reliable Systems

Our consulting philosophy is built on three fundamental engineering pillars.

Identity-Based Security

IP addresses are not identities. We use Smallstep to ensure every internal request is authenticated via mTLS.

Observability as Code

Dashboards should be reproducible. We use Caddy and OTel configurations managed via code, not manual UI clicks.

Zero-Touch Operations

If you are manually renewing certificates, you are building debt. We automate the toil away.

The O11yConsult Approach

Step 1: Audit & Baseline

We analyze your current telemetry spend, MTTR, and certificate management manual toil.

Step 2: Architecture Design

We design a vendor-neutral Golden Stack tailor-made for your specific cloud environment.

Step 3: Implementation & Training

We don't just hand over code; we embed with your team to ensure operational readiness.

Project management and implementation

Consulting Milestones

Solving high-stakes infrastructure challenges for modern engineering teams.

Zero-Trust mTLS Rollout

Architected automated PKI for fintech using Smallstep, securing 4,000+ microservice connections.

Edge Proxy Modernization

Migrated global e-commerce from Nginx to Caddy, reducing config complexity by 70%.

Telemetry Cost Governance

Optimized Prometheus cardinality for a gaming platform, saving $800k/year.

Frequently Asked Questions

Addressing the technical hurdles and strategic concerns of modern infrastructure.

Does mTLS add significant latency to my services?

Modern TLS handshakes are incredibly efficient. By using Smallstep to automate short-lived certificates and Caddy for high-performance proxying, the latency overhead is typically sub-millisecond—a negligible trade-off for zero-trust security.

Why use Caddy instead of Nginx or HAProxy?

While Nginx is capable, Caddy is written in memory-safe Go and features a native ACME client and built-in OpenTelemetry instrumentation. This allows us to automate SSL and export telemetry with zero external plugins or complex Lua scripts.

How do you handle vendor lock-in with OpenTelemetry?

The primary goal of OTel is to eliminate lock-in. By instrumenting with OTel SDKs and using OTel Collectors, you can route your data to Datadog today and ClickHouse or Honeycomb tomorrow simply by changing a YAML configuration.

Can we migrate to VictoriaMetrics without losing historical data?

Yes. VictoriaMetrics supports data ingestion from Prometheus, InfluxDB, and OpenTSDB. We provide migration paths that backfill your historical data so your Grafana dashboards maintain continuity from day one.

What is the "Agent Tax" and how do you reduce it?

Proprietary agents (like Datadog's) often consume significant CPU and RAM. We replace these with the lightweight OpenTelemetry Collector, using tail-based sampling to drop redundant data before it ever leaves your network, saving on both compute and SaaS egress costs.

Is this stack compatible with legacy on-premise hardware?

Absolutely. While the "Golden Stack" is cloud-native by design, Caddy and Smallstep are binary-distributed and run perfectly on bare metal, VMs, or even edge devices with limited resources.

Get Started

Work With Us

Infrastructure Audit

A 2-week deep dive into your telemetry gaps, proxy performance, and internal security posture.

Implementation Retainer

Direct access to SRE experts for deploying OpenTelemetry, Smallstep CA, or Caddy across your fleet.

Ready to Modernize Your Edge?

Stop managing certificates and start collecting data. Schedule a Caddy migration audit today.