Fractional CTO · AI Architecture

Turning business vision into systems that scale.

I'm Dzmitry Harupa, Head of Architecture. 17 years of high-load systems, 10 of them in iGaming. I design architecture, take AI/ML to production and still write code myself. TOGAF, SAFe and AWS certified. 100% remote.

Open to new projects—Warsaw · 100% remote

How we work

Four ways to start.

From a one-off consultation to a fractional CTO role — pick the depth you need now.

Advisory sessions

Architecture reviews and roadmap sessions. One call — specific recommendations.

Free 30-min intro call
Deep advisory session (90 min)
Expert consulting packs

POC & delivery

From concept to a working prototype. Architecture documentation is part of the result.

POC Starter: scope & architecture
POC Build: working prototype
Architecture package with ADRs

Ongoing partnership

Technology leadership part-time: from monthly advisory to a full CTO roadmap.

Advisor Lite: monthly guidance
Strategic CTO advisor
Fractional CTO: full roadmap

Workshops & training

Hands-on workshops for your team: architecture, AI/ML, working with ADRs.

Architecture workshops
AI/ML strategy sessions
Remote & on-site

What I deliver

Strategy through code — one architect, the whole stack.

Enterprise architecture, AI/ML and ten years of iGaming domain knowledge — hands-on, in code.

Technology strategy & roadmap

IT strategy aligned with business goals
Product roadmap
TCO optimization & FinOps

Cloud & enterprise architecture

AWS, GCP, Kubernetes
Microservices & event-driven design
TOGAF enterprise architecture

AI/ML strategy & implementation

AWS Bedrock, SageMaker
LLM, RAG & AI agents
AI process automation

LLM & GenAI services

High-performance development

Go, TypeScript, Python, C++
SAFe & Lean-Agile delivery
DevOps and CI/CD pipelines

iGaming expertise

Platform architecture: PAM, payments
Game math, RNG development
Regulatory compliance

Documentation & standards

Architecture Decision Records (ADR)
Technical specifications
Team standards & guidelines

Why fractional CTO

Senior expertise without a full-time hire.

Cost effective

Senior-level expertise without a full-time salary and overhead.

Flexible scale

The scope changes with your needs. You pay for hours, not a seat.

Fast start

Hiring a CTO takes months. Here the first call is this week.

Low risk

Start with a free consultation. A retainer comes after you see results.

Hands-on delivery

Working prototypes, executable architectures, code in production.

100% remote

Flexible timezone coverage and async communication.

Track record

Outcomes you can measure.

.2M

Annual savings from AI automation

200K

Messages per second across 1000+ systems

+23%

ARPU via CRM event pipeline

25%

Cloud cost reduction via FinOps

AI call-center automation

.2M saved per year on AWS Bedrock & SageMaker.

NATS event mesh at scale

200K+ msgs/sec across 1000+ systems, <100ms p99 latency.

Monolith → microservices

40% of transactional load migrated; release lead time down 57%, deploys under 15 minutes.

Decisions on data. Results in numbers.

— grew the architecture practice 6 → 12 architects + 20 analysts; led 200+ engineers

Dzmitry Harupa

Head of Architecture

Warsaw, Poland · 100% remote

17+ years · 10 in iGaming

English C1 · Polish · Russian

About

An architect who stays until it ships.

Ten years in iGaming: Casino, Sports, Poker, PAM, payments and CRM. Production AI/ML on AWS Bedrock, SageMaker and LangGraph.

I grew an architecture practice from 6 to 12 architects plus 20 system analysts, mentored three of them to Head-of-Technology roles and led 200+ engineers. I still write production code.

Technology stack

GoPythonTypeScriptC++AWSKubernetesKafkaNATSClickHouseTerraformBedrockSageMakerLangGraphGrafanaOpenTelemetry

Certifications

SAFe® 6 AgilistAWS Solutions ArchitectTOGAF® EA PractitionerArchiMate® 3CKADProduct Metrics & Analytics

Writing

Notes on architecture and AI.

What I work with: architecture, AI/ML, iGaming.

2026-07-04

My Architect, part 10: a repo memory for the agent — a code graph it's not allowed to trust

The recursive-context story continues: four releases in one day teach the agent a persistent code graph (Graphify) under strict distrust rules — freshness gate, facts only from live files. Plus a controlled A/B test of graph vs grep on NestJS: −71% tokens on subsystem understanding, parity elsewhere — and the hidden sub-agent costs that make naive benchmarks lie.

Read

2026-07-03

My Architect, part 9: Recursive context — teaching the agent to admit what it hasn't read

From MIT's Recursive Language Models paper to the recursive-context skill in one day: why a huge context window doesn't solve big data, how a fan-out of isolated sub-agents with honest coverage replaced a Python library, and what only live runs could catch — from a baseline that turned out too good to a 14-millisecond bug.

Read

2026-06-29

My Architect, part 8: Event Storming as sequence control, not another diagram

A picture-diagram checks nothing. Event Storming does: every event has a cause, every command an effect. How an agent runs a board through a deterministic sequence analyzer, closes mechanical gaps to zero, and leaves unresolved business questions visible. On a live Forklift project — 5 contexts, 17 gaps caught.

Read

All articles

Let's talk

Tell me what you're building.

Describe the problem in a couple of paragraphs — or skip ahead and book a free 30-minute call.

Free 30-min strategy call

calendly.com/d7561985 · no commitment