Consulting Expertise

Specialized support to transform your engineering operations

🔥

Production Readiness Assessment

With Focus on Incident Response & Operational Resilience

Before your service goes live, ensure your team is ready when (not if) things go wrong. Comprehensive assessment of your production readiness—covering traditional services and AI systems—with deep focus on incident response capabilities.

What's Included:

Review of 5-10 recent incidents
Interviews with 6-8 team members
Operational maturity scorecard
Gap analysis with root causes
Prioritized improvement roadmap
Smart automation & AI recommendations
Quick wins you can implement immediately
90-min findings workshop with leadership

Fixed Price: €32,000 | 2-month engagement

Learn More

☁️

Cloud Governance & FinOps

Optimize cloud spend, improve governance, and build sustainable infrastructure practices

What's Included:

Cloud cost optimization & FinOps strategy
Multi-cloud governance frameworks
Infrastructure efficiency assessments
Cost allocation & chargeback models
Cloud policy & compliance design

Fixed Price: €32,000 | 2-month engagement

Learn More

About Me

Since 2011, I've been building production readiness and operational resilience into some of the world's most critical systems. From incident response processes to operational standards, I've seen what works when systems fail and pressure is high.

🏦 Enterprise Incident Response (Dell/EMC)

Customer Engineer managing SAN/NAS storage incidents for major banks and financial institutions where downtime wasn't an option.

🖥️ Hosted Infrastructure (Rackspace)

Managed incident response for Rackspace's largest accounts across Europe from London—handling data-related incidents where every minute of downtime mattered. Ensured capacity planning and operational resiliency of the hosted solutions.

🎯 High-Scale Consumer Systems (Paddypower Betfair)

Handled incident response for one of Europe's largest online betting platforms—systems that needed to stay up during major sporting events with millions of users. Developed observability and resiliency for the betting exchange and private OpenStack cloud.

🎮 Building from Scratch (Unity)

Designed the incident management process from the ground up for Unity in Finland, taking them from ad-hoc firefighting to a structured, scalable approach.

⚡ Scaling Operational Standards (Zapier)

Created standards for service owner teams on-call and operational readiness, establishing frameworks that enabled dozens of teams to own their services effectively at scale.

🤝 Community Leadership

Co-founder of the SRE Finland meetup group, regularly presenting on incident management and post-mortem practices.

What I Bring:

13+ years of production readiness and operational resilience across enterprise, cloud, and high-scale consumer systems
Experience with regulated industries (financial services), high-availability requirements (gaming/betting), and AI systems
Track record of building operational processes from scratch and transforming existing ones
Real-world experience preparing teams for production and managing incidents at companies where downtime has massive business impact
Pragmatic approach that balances process, people, and technology

Free Resources

Practice incident response and improve operational readiness

🛡️

CDN Resilience Worksheet

Practical steps to improve your CDN resilience without expensive multi-CDN setups. Lessons from the Cloudflare outage.

Open Worksheet

📋

Black Friday Operational Playbook

Your guide to managing extreme load and operational stress during high-traffic events.

Read the Playbook

🎮

Incident Commander Game

Navigate realistic incident scenarios and make decisions under pressure without real-world consequences.

Play the Game

Engineering operations.
Minus the chaos.

Consulting Expertise

Production Readiness Assessment

What's Included:

Cloud Governance & FinOps

What's Included:

About Me

🏦 Enterprise Incident Response (Dell/EMC)

🖥️ Hosted Infrastructure (Rackspace)

🎯 High-Scale Consumer Systems (Paddypower Betfair)

🎮 Building from Scratch (Unity)

⚡ Scaling Operational Standards (Zapier)

🤝 Community Leadership

What I Bring:

Free Resources

CDN Resilience Worksheet

Black Friday Operational Playbook

Incident Commander Game

Ready to Transform Your Operations?

Engineering operations.Minus the chaos.

Consulting Expertise

Production Readiness Assessment

What's Included:

Cloud Governance & FinOps

What's Included:

About Me

🏦 Enterprise Incident Response (Dell/EMC)

🖥️ Hosted Infrastructure (Rackspace)

🎯 High-Scale Consumer Systems (Paddypower Betfair)

🎮 Building from Scratch (Unity)

⚡ Scaling Operational Standards (Zapier)

🤝 Community Leadership

What I Bring:

Free Resources

CDN Resilience Worksheet

Black Friday Operational Playbook

Incident Commander Game

Ready to Transform Your Operations?

Engineering operations.
Minus the chaos.