Fractional CTO · AI Architecture

Turning business vision into systems that scale.

I'm Dzmitry Harupa, Head of Architecture. 17 years of high-load systems, 10 of them in iGaming. I design architecture, take AI/ML to production and still write code myself. TOGAF, SAFe and AWS certified. 100% remote.

Open to new projectsWarsaw · 100% remote
Dzmitry Harupa, Head of Architecture
How we work

Four ways to start.

From a one-off consultation to a fractional CTO role — pick the depth you need now.

Advisory sessions

Architecture reviews and roadmap sessions. One call — specific recommendations.

  • Free 30-min intro call
  • Deep advisory session (90 min)
  • Expert consulting packs

POC & delivery

From concept to a working prototype. Architecture documentation is part of the result.

  • POC Starter: scope & architecture
  • POC Build: working prototype
  • Architecture package with ADRs

Ongoing partnership

Technology leadership part-time: from monthly advisory to a full CTO roadmap.

  • Advisor Lite: monthly guidance
  • Strategic CTO advisor
  • Fractional CTO: full roadmap

Workshops & training

Hands-on workshops for your team: architecture, AI/ML, working with ADRs.

  • Architecture workshops
  • AI/ML strategy sessions
  • Remote & on-site
What I deliver

Strategy through code — one architect, the whole stack.

Enterprise architecture, AI/ML and ten years of iGaming domain knowledge — hands-on, in code.

Technology strategy & roadmap

  • IT strategy aligned with business goals
  • Product roadmap
  • TCO optimization & FinOps

Cloud & enterprise architecture

  • AWS, GCP, Kubernetes
  • Microservices & event-driven design
  • TOGAF enterprise architecture

AI/ML strategy & implementation

  • AWS Bedrock, SageMaker
  • LLM, RAG & AI agents
  • AI process automation
LLM & GenAI services

High-performance development

  • Go, TypeScript, Python, C++
  • SAFe & Lean-Agile delivery
  • DevOps and CI/CD pipelines

iGaming expertise

  • Platform architecture: PAM, payments
  • Game math, RNG development
  • Regulatory compliance

Documentation & standards

  • Architecture Decision Records (ADR)
  • Technical specifications
  • Team standards & guidelines
Why fractional CTO

Senior expertise without a full-time hire.

Cost effective

Senior-level expertise without a full-time salary and overhead.

Flexible scale

The scope changes with your needs. You pay for hours, not a seat.

Fast start

Hiring a CTO takes months. Here the first call is this week.

Low risk

Start with a free consultation. A retainer comes after you see results.

Hands-on delivery

Working prototypes, executable architectures, code in production.

100% remote

Flexible timezone coverage and async communication.

Track record

Outcomes you can measure.

.2M
Annual savings from AI automation
200K
Messages per second across 1000+ systems
+23%
ARPU via CRM event pipeline
25%
Cloud cost reduction via FinOps

AI call-center automation

>

.2M saved per year on AWS Bedrock & SageMaker.

NATS event mesh at scale

200K+ msgs/sec across 1000+ systems, <100ms p99 latency.

Monolith → microservices

40% of transactional load migrated; release lead time down 57%, deploys under 15 minutes.

Decisions on data. Results in numbers.
Dzmitry Harupa
Dzmitry Harupa
Head of Architecture
Warsaw, Poland · 100% remote
17+ years · 10 in iGaming
English C1 · Polish · Russian
About

An architect who stays until it ships.

Ten years in iGaming: Casino, Sports, Poker, PAM, payments and CRM. Production AI/ML on AWS Bedrock, SageMaker and LangGraph.

I grew an architecture practice from 6 to 12 architects plus 20 system analysts, mentored three of them to Head-of-Technology roles and led 200+ engineers. I still write production code.

Technology stack
GoPythonTypeScriptC++AWSKubernetesKafkaNATSClickHouseTerraformBedrockSageMakerLangGraphGrafanaOpenTelemetry
Certifications
SAFe® 6 AgilistAWS Solutions ArchitectTOGAF® EA PractitionerArchiMate® 3CKADProduct Metrics & Analytics
Writing

Notes on architecture and AI.

What I work with: architecture, AI/ML, iGaming.

2026-07-04

My Architect, part 10: a repo memory for the agent — a code graph it's not allowed to trust

The recursive-context story continues: four releases in one day teach the agent a persistent code graph (Graphify) under strict distrust rules — freshness gate, facts only from live files. Plus a controlled A/B test of graph vs grep on NestJS: −71% tokens on subsystem understanding, parity elsewhere — and the hidden sub-agent costs that make naive benchmarks lie.

Read
2026-07-03

My Architect, part 9: Recursive context — teaching the agent to admit what it hasn't read

From MIT's Recursive Language Models paper to the recursive-context skill in one day: why a huge context window doesn't solve big data, how a fan-out of isolated sub-agents with honest coverage replaced a Python library, and what only live runs could catch — from a baseline that turned out too good to a 14-millisecond bug.

Read
2026-06-29

My Architect, part 8: Event Storming as sequence control, not another diagram

A picture-diagram checks nothing. Event Storming does: every event has a cause, every command an effect. How an agent runs a board through a deterministic sequence analyzer, closes mechanical gaps to zero, and leaves unresolved business questions visible. On a live Forklift project — 5 contexts, 17 gaps caught.

Read
Let's talk

Tell me what you're building.

Describe the problem in a couple of paragraphs — or skip ahead and book a free 30-minute call.

Free 30-min strategy call

calendly.com/d7561985 · no commitment

Get in touch