The engineering behind quantitative finance: algorithms, systems & infrastructure

Designing Fault-Tolerant Async Trading Services in Python

Posted on Sat 07 March 2026 | Part 4 of Building Real Trading Systems | 20 min read

A production-ready async runtime architecture with explicit supervision and restart discipline, built to keep trading systems correct under failure, stress and load spikes.

Inside DeFi's Hidden Economy: MEV, Mempools, and the Battle for Blockspace

Posted on Sat 21 February 2026 | Part 4 of DeFi Engineering | 17 min read

Step inside DeFi's hidden economy: how mempools, MEV and Flashbots turn transaction ordering into a latency-driven execution game where speed and visibility decide outcomes long before settlement.

From Blocks to State: A Mental Model for Blockchain Systems

Posted on Sat 07 February 2026 | Part 3 of DeFi Engineering | 18 min read

Blockchains are often explained through protocol-specific concepts like blocks or slots. This article reframes them as distributed state-transition systems, where ambiguity and delayed agreement exist to varying degrees across chains.

The Hidden DAG Behind Every Modern Trading System: How Market Data Is Ingested at Scale

Posted on Sat 24 January 2026 | Part 5 of Distributed Systems in Finance | 17 min read

Modern trading systems rely on directed acyclic graphs (DAGs) that branch, merge, and transform real-time feeds into many parallel consumers: matching engines, risk checks, analytics, surveillance, and storage. These ingestion DAGs exist to isolate failure, control fan-out, and preserve latency and correctness under extreme market conditions.

Flow Control in Low-Latency Systems: Batching, Conflation, and Backpressure

Posted on Sat 10 January 2026 | Part 3 of Low-Latency Fundamentals | 15 min read

Low-latency systems fail when work becomes unbounded. Batching, conflation, and backpressure are mechanisms that keep systems stable under bursty, adversarial load. Without them, tail latency and cascading failures are inevitable.

Observability at Scale: Distributed Telemetry for Modern Trading Infrastructure

Posted on Sat 13 December 2025 | Part 4 of Distributed Systems in Finance | 22 min read

How do trading systems observe themselves in real time? This article breaks down the telemetry architecture that keeps distributed systems visible under extreme latency pressure.

How Exchanges Turn Order Books into Distributed Logs

Posted on Sat 06 December 2025 | 22 min read

Every modern exchange is a distributed database in disguise. This article reveals how trading engines transform chaotic streams of buy and sell orders into a perfectly ordered, replayable log, ensuring fairness, determinism, and market data reliability.

Latency Profiling in Python: From Code Bottlenecks to Observability

Posted on Sat 29 November 2025 | Part 2 of Low-Latency Fundamentals | 19 min read

Learn how to pinpoint where time disappears in Python systems, from CPU and I/O bottlenecks to Prometheus metrics and continuous profilers. Explore the tools that turn raw measurements into observability dashboards revealing real performance patterns.

Understanding Latency: From Wire to Code

Posted on Sat 15 November 2025 | Part 1 of Low-Latency Fundamentals | 12 min read

Every microsecond counts, but where do they actually go?

This article traces the hidden journey of a message from the network wire to the application's code, revealing how the NIC, interrupts, syscalls, and runtimes introduce delay at every hop.

Dissecting the Infra of DeFi Protocols

Posted on Sat 08 November 2025 | Part 2 of DeFi Engineering | 15 min read

Most DeFi discussions stop at smart contracts. This article goes deeper: through the data pipelines, executors, keepers, and coordination layers that make protocols run. If you're a software engineer, you'll see how DeFi architecture mirrors the systems you already know.

Older Posts