Question 1

When should I use microservices vs a monolith?

Accepted Answer

Start with a monolith. Split when you have multiple teams stepping on each other, different deploy cadences, or components that need independent scaling. Microservices solve organizational scaling problems but add distributed system complexity — network latency, distributed transactions, and operational overhead. Most teams that start with microservices on day one regret it.

Question 2

What is the difference between pub/sub and a message queue?

Accepted Answer

In pub/sub, every subscriber gets every message (fan-out). In a message queue, each message goes to exactly one consumer (load distribution). Use pub/sub when multiple services need to react to the same event. Use a queue when you need to distribute tasks across workers.

Question 3

How does cache invalidation work?

Accepted Answer

Four main strategies: TTL (entries expire after a set time), event-based (invalidate when the data changes), manual purge (clear the cache explicitly), and versioned keys (include a version in the cache key). TTL is simplest. Event-based is most accurate. Most production systems combine TTL with event-based invalidation.

Question 4

What is event sourcing?

Accepted Answer

Event sourcing stores every state change as an immutable event rather than overwriting the current state. The current state is derived by replaying events. This provides a complete audit trail, temporal queries, and the ability to rebuild read models. Often paired with CQRS. The tradeoff is complexity — event stores are harder to query, schemas must evolve carefully, and replaying millions of events requires snapshots.

Question 5

What is a circuit breaker in software?

Accepted Answer

A circuit breaker prevents a service from calling a failing dependency. After enough failures, the breaker 'opens' and fails fast instead of waiting for timeouts. After a cooldown period, it lets a few test requests through. If they succeed, it closes again. This prevents cascading failures where one slow dependency brings down the entire system.

Question 6

How does a load balancer decide where to send traffic?

Accepted Answer

Common algorithms: round-robin (sequential rotation), least connections (fewest active requests), weighted (proportional to server capacity), consistent hashing (same key always hits the same server), and random with power-of-two-choices. Round-robin is the default. Least connections adapts better when requests have varying processing times.

Question 7

What is CQRS and when should I use it?

Accepted Answer

CQRS (Command Query Responsibility Segregation) uses separate models for reads and writes. The write model is normalized and consistent. The read model is denormalized and fast. Use it when you have a high read-to-write ratio, different scaling needs for reads vs writes, or multiple query patterns against the same data. Don't use it for simple CRUD apps — the complexity is not justified.

Question 8

What is a saga pattern?

Accepted Answer

A saga manages a business process across multiple services using a sequence of local transactions. Each step has a compensating action that undoes it if a later step fails. Two flavors: choreography (services react to events) and orchestration (a coordinator directs the flow). Sagas replace distributed transactions in microservices, trading ACID guarantees for availability and service independence.

Question 9

What is the difference between event-driven and request-driven architecture?

Accepted Answer

In request-driven architecture, service A calls service B and waits for a response. A knows about B. They are coupled in time and availability. In event-driven architecture, A emits an event and moves on. B processes the event later. A doesn't know B exists. Services are decoupled. Most systems use both — synchronous for queries needing immediate answers, asynchronous for operations that tolerate delay.

Question 10

What is a service mesh?

Accepted Answer

A service mesh is infrastructure that manages service-to-service communication. Sidecar proxies (typically Envoy) intercept all traffic and handle mutual TLS, retries, circuit breaking, load balancing, and observability — without changing application code. A control plane (Istio, Linkerd) configures the proxies. Service meshes add operational complexity and resource overhead, but centralize networking concerns for large microservice deployments.

Question 11

How do microservices communicate?

Accepted Answer

Two patterns: synchronous (REST, gRPC — one service calls another and waits) and asynchronous (events, message queues — one service publishes, others consume later). Synchronous is simpler but creates runtime dependencies. Asynchronous decouples services but introduces eventual consistency. Most production systems use both, choosing based on whether the caller needs an immediate response.

Software Architecture FAQ

When should I use microservices vs a monolith?

What is the difference between pub/sub and a message queue?

How does cache invalidation work?

What is event sourcing?

What is a circuit breaker in software?

How does a load balancer decide where to send traffic?

What is CQRS and when should I use it?

What is a saga pattern?

What is the difference between event-driven and request-driven architecture?

What is a service mesh?

How do microservices communicate?

Referenced by

Software Architecture FAQ

When should I use microservices vs a monolith?

What is the difference between pub/sub and a message queue?

How does cache invalidation work?

What is event sourcing?

What is a circuit breaker in software?

How does a load balancer decide where to send traffic?

What is CQRS and when should I use it?

What is a saga pattern?

What is the difference between event-driven and request-driven architecture?

What is a service mesh?

How do microservices communicate?

Related

Referenced by