Resilient Pipelines Guide

This guide demonstrates real-world patterns for building fault-tolerant data pipelines using module call options.

Overview

Resilient pipelines combine multiple options to handle failures gracefully:

Retry for transient failures
Timeout to prevent hanging
Fallback for graceful degradation
Cache for performance and availability
Rate control to respect limits

Option Interactions

When combining multiple options, be aware of their interactions. For example, timeout applies to each individual retry attempt, not the total time. A call with retry: 3, timeout: 5s could take up to 15 seconds (plus backoff delays) in the worst case.

Pattern 1: External API Integration

When calling external APIs, handle network issues, rate limits, and service unavailability.

in request: ApiRequest

# Resilient API call with full protection
response = ExternalApi(request) with
    retry: 3,
    delay: 1s,
    backoff: exponential,
    timeout: 30s,
    throttle: 100/1min,
    fallback: { success: false, error: "Service unavailable" }

out response

What This Does

Throttle: Limits to 100 calls/minute (respects API rate limits)
Timeout: Each attempt limited to 30 seconds
Retry: Up to 3 retries on failure
Backoff: Wait 1s, 2s, 4s between retries
Fallback: Return error object if all retries fail

Pattern 2: Cached Data Loading

For expensive data that doesn't change frequently, use caching with fallback.

in userId: String

# Load with cache and fallback to stale data
userData = LoadUserData(userId) with
    cache: 15min,
    retry: 2,
    timeout: 5s,
    fallback: GetStaleData(userId)

out userData

What This Does

Cache: Return cached value if available (within 15 min)
Timeout: Fresh load limited to 5 seconds
Retry: 2 retry attempts if load fails
Fallback: Use potentially stale data if all else fails

Pattern 3: Multi-Stage Pipeline

Break complex pipelines into stages with different resilience needs.

in rawData: Record

# Stage 1: Validation (fast, no retry)
validated = Validate(rawData) with
    timeout: 1s,
    on_error: wrap

# Stage 2: Enrichment (external call, needs retry)
enriched = when validated.isRight
    then Enrich(validated.right) with
        retry: 3,
        delay: 500ms,
        timeout: 10s
    else { data: rawData, enriched: false }

# Stage 3: Storage (critical, max retry)
stored = Store(enriched) with
    retry: 5,
    delay: 1s,
    backoff: exponential,
    timeout: 30s,
    priority: high

out stored

What This Does

Validation: Fast fail, wrap errors for handling
Enrichment: Moderate retry for external service
Storage: Maximum effort for critical data persistence

Pattern 4: Parallel Processing with Limits

Process items in parallel with resource protection.

in items: List[DataItem]

# Process with concurrency and rate limiting
results = items.map(item =>
    ProcessItem(item) with
        concurrency: 10,
        throttle: 50/1s,
        retry: 2,
        timeout: 5s,
        on_error: skip
)

# Filter successful results
successful = results.filter(r => r.success)

out successful

What This Does

Concurrency: Max 10 parallel executions
Throttle: Max 50 items per second
Retry: 2 retries per item
On_error: Skip failed items, continue processing

Pattern 5: Priority-Based Processing

Handle different priorities of work appropriately.

in event: Event

# Route based on priority
result = when event.priority == "critical" then
    ProcessCritical(event) with
        priority: critical,
        retry: 5,
        timeout: 60s
else when event.priority == "high" then
    ProcessHigh(event) with
        priority: high,
        retry: 3,
        timeout: 30s
else
    ProcessNormal(event) with
        priority: normal,
        retry: 2,
        timeout: 15s,
        on_error: log

out result

What This Does

Critical: Maximum retry, longer timeout, highest scheduling priority
High: Moderate settings, elevated priority
Normal: Standard settings, log failures

Pattern 6: Lazy Evaluation for Conditional Paths

Defer expensive computations until needed.

in request: Request

# Define but don't execute yet
fullAnalysis = DeepAnalysis(request) with lazy, cache: 1h
quickCheck = FastCheck(request) with cache: 5min

# Decide which path to take
output = when quickCheck.needsFullAnalysis
    then fullAnalysis  # Only now is DeepAnalysis executed
    else quickCheck.result

out output

What This Does

Lazy: DeepAnalysis is only executed if needed
Cache: Both results are cached for reuse
Efficiency: Expensive computation avoided when possible

Pattern 7: Circuit Breaker Pattern

Combine options to implement circuit breaker behavior.

in request: Request

# Track failures in cache
failureCount = GetFailureCount(request.service) with cache: 1min

# Circuit breaker logic
result = when failureCount > 5 then
    # Circuit open - return fallback immediately
    { status: "circuit_open", data: cachedData }
else
    # Circuit closed - try the service
    CallService(request) with
        retry: 2,
        timeout: 5s,
        on_error: wrap

# Update failure tracking
finalResult = when result.isLeft then
    IncrementFailures(request.service) >> { status: "failed", error: result.left }
else
    ResetFailures(request.service) >> result.right

out finalResult

Pattern 8: Graceful Degradation

Progressively fall back to simpler services.

in query: SearchQuery

# Try services in order of quality
result = PremiumSearch(query) with
    timeout: 2s,
    fallback: StandardSearch(query) with
        timeout: 3s,
        fallback: BasicSearch(query) with
            timeout: 5s,
            fallback: { results: [], source: "none" }

out result

What This Does

Tries premium service first, falls back through standard and basic, finally returns empty if all fail.

Start Simple

Begin with basic resilience (retry: 2, timeout: 5s) and add options incrementally as you understand your failure modes. Over-engineering resilience can hide underlying issues.

Best Practices Summary

Pattern	Key Options	Use Case
API Integration	retry, backoff, throttle, timeout	External services
Cached Loading	cache, fallback, retry	Expensive reads
Multi-Stage	varying options per stage	Complex pipelines
Parallel Processing	concurrency, throttle, on_error	Batch operations
Priority-Based	priority, varying retry/timeout	Mixed workloads
Lazy Evaluation	lazy, cache	Conditional expensive ops
Circuit Breaker	cache, fallback, on_error	Unstable services
Graceful Degradation	nested fallback	Service tiers

Cache Hit Rates

Monitor your cache hit rates regularly. A hit rate below 50% may indicate the cache TTL is too short or the data changes more frequently than expected. Adjust accordingly.

Monitoring

Monitor pipeline health with the /metrics endpoint:

# Check cache effectiveness
curl http://localhost:8080/metrics | jq .cache.hitRate

# Check retry rates
curl http://localhost:8080/metrics | jq .execution.retryRate

# Check throttle queue depth
curl http://localhost:8080/metrics | jq .rateControl.queueDepth

Overview​

Pattern 1: External API Integration​

What This Does​

Pattern 2: Cached Data Loading​

What This Does​

Pattern 3: Multi-Stage Pipeline​

What This Does​

Pattern 4: Parallel Processing with Limits​

What This Does​

Pattern 5: Priority-Based Processing​

What This Does​

Pattern 6: Lazy Evaluation for Conditional Paths​

What This Does​

Pattern 7: Circuit Breaker Pattern​

Pattern 8: Graceful Degradation​

What This Does​

Best Practices Summary​

Monitoring​

See Also​

Overview

Pattern 1: External API Integration

What This Does

Pattern 2: Cached Data Loading

What This Does

Pattern 3: Multi-Stage Pipeline

What This Does

Pattern 4: Parallel Processing with Limits

What This Does

Pattern 5: Priority-Based Processing

What This Does

Pattern 6: Lazy Evaluation for Conditional Paths

What This Does

Pattern 7: Circuit Breaker Pattern

Pattern 8: Graceful Degradation

What This Does

Best Practices Summary

Monitoring

See Also