SASE+ Pattern Matching Guide

Advanced guide to SASE+ pattern matching in Varpulis for detecting complex event sequences.

Overview

SASE+ (Sequence Algebra for Stream Events) is a pattern matching algorithm for Complex Event Processing. Varpulis implements SASE+ based on the SIGMOD 2006 paper "High-Performance Complex Event Processing over Streams" by Wu, Diao, and Rizvi.

Key Features

NFA-based matching: Efficient finite automaton execution
Kleene closures: Match one or more (+) or zero or more (*) events
Negation: Detect absence of events within time windows
Logical operators: AND (any order), OR (either)
Partition-by optimization: Independent matching per partition key
Temporal constraints: Patterns must complete within time bounds

Pattern Syntax

Varpulis uses arrow (->) syntax for both named patterns and inline stream expressions. Both compile to the same NFA engine.

Named Patterns

Use pattern Name = ... to declare a reusable named pattern:

vpl

pattern BruteForce = AuthEvent where status == "failed" as first
    -> all AuthEvent where status == "failed" as fails
    -> AuthEvent where status == "success" as success
    within 30m partition by source_ip

Within an arrow item the order is: [all] [NOT] EventType [where filter] [as alias]. Use all for Kleene-plus accumulation.

Inline Stream Patterns

Use -> inside stream expressions for a chainable style:

vpl

stream FraudAlert = login as l
    -> transfer as t .within(5m)
    .where(l.user_id == t.user_id && t.amount > 5000)
    .emit(alert: "Suspicious transfer", user: l.user_id)

For Kleene-plus inside stream expressions, use the all keyword:

vpl

stream BruteForce = LoginFailed as f
    -> all LoginFailed where user_id == f.user_id as fails
    -> LoginSuccess where user_id == f.user_id as success
    .within(30m)
    .partition_by(user_id)
    .emit(failures: count(fails) + 1)

When to Use Which

	Named pattern	Inline stream expression
Reuse across multiple streams	Yes	No
Chaining `.emit()`, `.forecast()`	Via separate `stream = Pattern.emit(...)`	Inline
Readability for complex logic	Good	Good for short pipelines

Selection and Emission Modes

Pattern matching has two orthogonal axes that control how matches are produced. Most users never need to think about them — the defaults match what practitioners expect — but understanding them is essential for advanced patterns.

Quick Reference

vpl

stream X = ... pattern ...
    .stam()       // selection: how runs are spawned (default)
    .each()       // emission: how matches are produced (default)
    .emit(...)

Axis	Operators	Default
Selection strategy	`.strict()`, `.stnm()`, `.stam()`	`.stam()`
Emission mode	`.each()`, `.longest()`, `.subsets()`	`.each()` (or `.longest()` for monotonic)

Selection Strategies

Selection controls how runs are spawned and which events extend them when multiple events of the same type arrive.

`.strict()` — Strict Contiguity

Events must be adjacent in the stream, with no irrelevant events between them. Use for regex-like matching where order and adjacency both matter.

vpl

# Match only when A is immediately followed by B (no events between)
pattern AdjacentAB = A -> B
stream Strict = AdjacentAB.strict().emit(...)

If events arrive A, X, B, the X breaks contiguity and the pattern fails. Useful for parsing log lines, network protocol parsing, DNA sequence matching.

`.stnm()` — Skip-Till-Next-Match

Skip irrelevant events until the next matching event. Each event participates in at most one match (no overlapping runs). The most "intuitive" mode for first-match-wins detection.

vpl

# Skip irrelevant events; one maximal match per anchor
stream FirstMatch = Login -> Action -> Logout
    .stnm()
    .within(1h)
    .emit(...)

`.stam()` — Skip-Till-Any-Match (default)

Skip irrelevant events with non-deterministic branching: any event that could start a new pattern instance does so, in addition to extending existing runs. This is the SASE+ paper's most permissive mode.

vpl

# Multiple overlapping runs, each anchored at a different Login
stream OverlappingFraud = Login as l -> Transaction as t
    .stam()
    .where(t.amount > 5000)
    .emit(...)

With events Login1, Login2, Transaction1, you get two matches: (Login1, Transaction1) and (Login2, Transaction1).

Default: .stam() is the default. Most CEP use cases want overlapping matches.

Emission Modes

Emission controls how many MatchResults a completed run produces. This is the axis the SASE+ paper conflates with selection — separating them gives finer control.

`.each()` — Fire on Each Kleene Event (default)

Emit one match per Kleene event extension. Linear in the size of the Kleene closure. Most ergonomic for "for each B captured, do something" patterns.

vpl

# Each fail event triggers one alert
stream FailedAttempt = LoginFailed as fail
    .each()    # default — can be omitted
    .emit(alert: "Login failed", user: fail.user_id)

For Start -> all B as b -> End with 5 Bs, .each() emits 5 matches as the Bs arrive (the End terminator confirms but doesn't add an extra emit).

`.longest()` — Emit Once at Completion

Emit one consolidated match when the pattern completes (terminator arrives or Kleene self-loop breaks). The match contains the longest captured sequence; the alias is bound to the last captured event of that sequence.

vpl

# Brute force: one alert when login finally succeeds, with the full failure history
pattern BruteForce = LoginFailed as first
    -> all LoginFailed as fails
    -> LoginSuccess as success
    within 30m partition by user_id

stream Alert = BruteForce
    .longest()
    .emit(user: first.user_id, num_fails: count(fails) + 1)

For Start -> all B as b -> End with 5 Bs, .longest() emits 1 match at End with b bound to B5.

Auto-resolved for monotonic patterns: .increasing() and .decreasing() automatically use .longest() because users want one "rising sequence ended" alert, not one per data point. Override with .increasing(temp).each().

`.subsets()` — Paper-Correct STAM Verbose (expert mode)

Emit one match per non-empty subset of the Kleene capture. For N captured Kleene events, this produces 2^N − 1 matches. This is the textbook SASE+ STAM verbose output (SIGMOD 2008 §4.2).

vpl

# Academic mode: enumerate every subset of the captured Bs
stream PaperMode = A -> all B as b -> C
    .subsets()
    .emit(...)

For SEQ(A, B+, C) with 3 Bs, .subsets() produces 2³ − 1 = 7 matches:

{B1}, {B2}, {B3}
{B1, B2}, {B1, B3}, {B2, B3}
{B1, B2, B3}

Cost: exponential in the number of Kleene events. Capped at MAX_ENUMERATION_RESULTS = 10_000 to prevent memory blowup. Use only when you specifically need formal SASE+ verbose semantics or to feed downstream consumers expecting subset enumeration.

Warning: .subsets() is for spec compliance and academic correctness. For practical use cases, prefer .each() (linear) or .longest() (constant).

Combining Modes

Modes are independent and can be combined:

vpl

# STNM selection + Each emission
stream X = A -> all B as b -> C
    .stnm()
    .each()
    .emit(...)

Kleene-bound Variables as Arrays

Per the SASE+ paper, a Kleene-bound variable like b in all B as b is conceptually a sequence/array of captured events, not a single event. Varpulis supports this with the following syntax:

vpl

stream X = Start -> all Reading as b -> End
    .longest()
    .emit(
        count:   b.LEN,            # number of captured Bs
        first_id: b[0].id,          # first captured B's id
        last_id:  b[b.LEN - 1].id,  # equivalent to b.id (shortcut)
        all_ids:  collect(b.id),    # array of all ids
        all_vals: collect(b.val)    # array of all values
    )

Expression	Meaning
`b.id`	Field of the last captured B (ergonomic shortcut)
`b.LEN` or `count(b)`	Number of captured Bs
`b[i].field`	Field of the i-th captured B (zero-indexed)
`b[0].field` / `first(b).field`	First captured B's field
`b[b.LEN - 1].field` / `last(b).field`	Last captured B's field
`collect(b.field)`	List of `field` values across all captured Bs (returns `Value::Array`)
`sum(b.field)`, `avg(b.field)`, `min(b.field)`, `max(b.field)`	Numeric aggregates over the captured Bs
`distinct_count(b.field)`	Number of distinct values for `field`

Note: b.field (without .LEN or indexing) returns the last captured event's field. This is a Varpulis-specific ergonomic shortcut. To get a specific element, use b[i].field. To get a list, use collect(b.field).

Paper reference: SASE+ (SIGMOD 2008) Query 3 uses a.LEN for sequence length and a[a.LEN] for the last element. Varpulis uses zero-indexing (b[0], b[b.LEN - 1]) following common programming conventions.

Comparison Table

For pattern SEQ(A, B+, C) with events A, B1, B2, B3, C:

Mode	# matches	Bindings
`.each()` (default)	3	`b=B1`, `b=B2`, `b=B3` (one per B)
`.longest()`	1	`b=B3` (last captured)
`.subsets()`	7	one per non-empty subset of `{B1,B2,B3}`

For pattern Start -> all B as b -> End with 9 Bs and an End terminator:

Mode	# matches
`.each()`	9
`.longest()`	1
`.subsets()`	511

Choosing the Right Mode

Use case	Recommended	Why
"For each event, do X"	`.each()` (default)	One emit per Kleene step is most intuitive
"Alert when pattern completes, summarize"	`.longest()`	One consolidated emit with bound aggregates
"Detect rising/falling trends"	`.increasing()` / `.decreasing()`	Auto-resolves to `.longest()`
"Enumerate every match per SASE+ paper"	`.subsets()`	Spec-compliant verbose mode
"Strict log line parsing"	`.strict()` selection	No skipping allowed
"First match wins"	`.stnm()` selection	One non-overlapping match per anchor

Pattern Types

Sequence

Events must occur in the specified order.

vpl

pattern ThreeStep = A -> B -> C within 5m

# Or equivalently in a stream expression:
stream ThreeStep = A -> B -> C .within(5m)

NFA Structure:

[Start] -> [Match A] -> [Match B] -> [Match C] -> [Accept]

Kleene Plus (`+` / `all`)

One or more occurrences of an event type.

vpl

# Named pattern syntax
pattern BruteForce = all LoginFailed as fails
    -> LoginSuccess as success
    within 10m partition by user_id

# Stream expression syntax (same `all` keyword)
stream BruteForce = LoginFailed as first
    -> all LoginFailed as fails
    -> LoginSuccess as success
    .within(10m)
    .partition_by(user_id)

NFA Structure:

[Start] -> [Match Event] -> [Accept]
              ^      |
              +------+  (self-loop)

Implementation Notes:

Uses a stack to track Kleene state
Each match pushes to the stack
Non-greedy by default (first complete match)

Kleene Star (`*`)

Zero or more occurrences.

vpl

// Start, any events, then end
pattern Session = SessionStart -> Activity* -> SessionEnd

// Optional middleware
pattern Request = ClientRequest -> Middleware* -> ServerResponse

Key Difference from +:

A* accepts immediately (zero matches allowed)
A+ requires at least one match

Negation (`NOT`)

Detect the absence of an event within a time window.

vpl

// Order not confirmed within 1 hour
pattern UnconfirmedOrder =
    OrderPlaced -> NOT(OrderConfirmed) within 1h

// Payment started but never completed
pattern AbandonedPayment =
    PaymentStart -> NOT(PaymentComplete) within 5m

How Negation Works:

The NFA enters a "negation state" after matching the preceding pattern
A timeout is set based on the within clause
If the negated event occurs before timeout, the pattern fails
If timeout expires without the event, the pattern succeeds

Implementation: See NegationInfo and StateType::Negation in sase.rs:108

AND (Any Order)

Both patterns must match, but order doesn't matter.

vpl

// Both documents required (any order)
pattern BothDocs =
    AND(DocumentA, DocumentB) within 1h

// Application complete when both submitted
pattern ApplicationComplete =
    AND(FormSubmitted, PaymentReceived) within 24h

Implementation:

Creates an AND state that tracks which branches have matched
Accepts when all branches are satisfied
See AndConfig and StateType::And in sase.rs

OR (Either)

Either pattern matches.

vpl

// Accept either payment method
pattern PaymentReceived =
    OR(CreditCard, BankTransfer)

// Multiple termination conditions
pattern SessionEnd =
    OR(Logout, Timeout, ForceDisconnect)

Predicates

Field Comparisons

vpl

pattern HighValue =
    Transaction[amount > 10000]

pattern SpecificUser =
    Login[user_id == "admin" and ip != "10.0.0.1"]

Operators: ==, !=, <, <=, >, >=

Reference Comparisons

Reference fields from earlier events using aliases:

vpl

pattern SameUser =
    Login as login
    -> Activity[user_id == login.user_id] as activity
    -> Logout[user_id == login.user_id]
    within 1h

Compound Predicates

vpl

pattern Complex =
    Event[(value > 100 and status == "active") or priority == "high"]

Temporal Constraints

Pattern-Level Constraints

Apply to the entire pattern:

vpl

pattern MustBeQuick =
    Start -> Middle -> End
    within 5m

Per-Transition Constraints

Different timeouts for different steps:

vpl

pattern VariableTiming =
    FastEvent
    -> SlowEvent within 1h
    -> FinalEvent within 5m

Timeout Handling

When a pattern times out:

Partial matches are discarded
Resources are freed
No match is emitted

Rising & Monotonic Patterns (v0.10.0)

Self-referencing Kleene predicates let you detect strictly monotonic trends — rising temperatures, escalating prices, increasing severity — where each event must compare against the previous captured event.

For convenience, Varpulis provides .increasing(field) and .decreasing(field) operators that generate the self-referencing predicate automatically.

Using `.increasing()` / `.decreasing()` (recommended)

vpl

# Strictly increasing temperature
stream RisingTemp = TempReading -> all TempReading.increasing(temperature) as rising
    .partition_by(sensor_id)
    .emit(sensor: rising.sensor_id, max: rising.temperature, count: count(rising))

# Strictly decreasing pressure
stream DropPressure = Sensor -> all Sensor.decreasing(pressure) as falling
    .partition_by(sensor_id)
    .emit(sensor: falling.sensor_id, min: falling.pressure)

Default emission: .increasing() / .decreasing() automatically use .longest() mode — one alert when the trend ends. Override with .each() for per-event emissions:

vpl

stream EachRise = TempReading -> all TempReading.increasing(temperature) as rising
    .partition_by(sensor_id)
    .each()    # one emit per rising event instead of one at break
    .emit(...)

Manual Self-Reference (advanced)

If you need finer control over the predicate, you can write the self-reference explicitly:

vpl

pattern StrictlyRising = SensorReading as first
    -> all SensorReading where temperature > rising.temperature as rising
    -> SensorReading where temperature < first.temperature as drop
    within 5m partition by sensor_id

How it works: The predicate temperature > rising.temperature compares each new event against the last captured Kleene event (not the first). The alias rising refers to itself — the engine:

First Kleene event: Compares against the previous step's event (the anchor first), or skips if the anchor doesn't carry the field
Subsequent events: Evaluates temperature > rising.temperature against the previously captured event
Terminator: The final SensorReading where temperature < first.temperature closes the pattern

Strictly Decreasing Values

vpl

pattern CoolingDown = SensorReading as first
    -> all SensorReading where temperature < cooling.temperature as cooling
    within 10m partition by sensor_id

Escalating Severity

vpl

pattern Escalation = Alert as first
    -> all Alert where severity > escalating.severity as escalating
    within 1h partition by host

stream EscalationAlert = Escalation
    .where(count(escalating) >= 3)
    .emit(
        host: first.host,
        initial_severity: first.severity,
        final_severity: escalating.severity,
        steps: count(escalating)
    )

Aggregating Over Trends

When you need statistics (count, average) over rising trends rather than individual matches, use .trend_aggregate() with the Hamlet engine for O(n) performance:

vpl

stream TrendStats = StockTick as first
    -> all StockTick where price > first.price as rising
    .within(60s)
    .partition_by(symbol)
    .trend_aggregate(count: count_trends())
    .emit(symbol: first.symbol, trends: count)

Note: Self-referencing predicates (rising.temperature) compare against the previous Kleene event. Cross-referencing predicates (first.temperature) compare against a fixed earlier event.

Partition Strategies

Partition-By Attribute

Process patterns independently per key:

vpl

pattern PerUser =
    Login+ -> Logout
    within 1h
    partition by user_id

Benefits:

Parallel processing across partitions
Memory isolation (one partition doesn't affect others)
Natural grouping for user/device/session patterns

Memory Impact:

Each partition maintains its own NFA state
N partitions = N × base memory

Without Partitioning

Global pattern matching across all events:

vpl

pattern GlobalPattern =
    SystemAlert -> AdminResponse
    within 30m

Event Selection Strategies

SASE+ supports different strategies for selecting events when multiple matches are possible.

Skip-Till-Any-Match (Default)

Match as many patterns as possible, potentially with overlapping events.

vpl

// Given events: A1, B1, A2, B2
// Pattern: A -> B
// Matches: (A1, B1), (A1, B2), (A2, B2)

Skip-Till-Next-Match

Each event participates in at most one match.

vpl

// Given events: A1, B1, A2, B2
// Pattern: A -> B
// Matches: (A1, B1), (A2, B2)

Strict Contiguity

Events must be immediately adjacent (no skipping).

vpl

// Given events: A1, C1, B1
// Pattern: A -> B (strict)
// Matches: none (C1 breaks contiguity)

Debugging Patterns

Verbose Output

Use --verbose with simulation to see pattern state:

bash

varpulis simulate -p rules.vpl -e events.evt --verbose

Pattern Tracing

Enable trace logging:

bash

RUST_LOG=varpulis_runtime::sase=trace varpulis simulate ...

Common Issues

1. Pattern Never Matches

Possible causes:

Event types don't match exactly (case-sensitive)
Predicates are too restrictive
Timeout is too short

Debug steps:

vpl

// Remove predicates to test basic matching
pattern Debug1 = A -> B within 1h

// Add predicates back one at a time
pattern Debug2 = A[field > 0] -> B within 1h

2. Too Many Matches

Possible causes:

Missing predicates to constrain matches
Skip-till-any-match creating overlapping matches
Missing partition by causing cross-user matches

Solution:

vpl

// Add partition to isolate matches
pattern Isolated =
    Login -> Action -> Logout
    within 1h
    partition by user_id

3. Memory Growth

Possible causes:

Unbounded Kleene closure (+ or *)
Too many partitions
Timeout too long

Solutions:

vpl

// Limit Kleene matches
pattern Limited =
    Event+ within 5m  // Natural bound via timeout

// Reduce partition cardinality
partition by category  // Use low-cardinality field

4. Negation Not Triggering

Possible causes:

Event type mismatch in NOT clause
Timeout too short (event arrives just after)
Negated event arriving before pattern start

Debug:

vpl

// Ensure event types match exactly
pattern Debug =
    Start -> NOT(Exactly_This_Type) within 10m

Performance Considerations

NFA Complexity

Pattern	States	Transitions
`A -> B`	3	2
`A -> B -> C`	4	3
`A+`	2	2 (with self-loop)
`AND(A, B)`	4	4
`A -> B+ -> C`	4	4

Memory Usage

Memory = O(partitions × active_runs × events_per_run)

Recommendations:

Use short timeouts to limit active runs
Partition by low-cardinality fields
Avoid unbounded Kleene without timeout

Throughput

Typical throughput on modern hardware:

Simple patterns: 500K+ events/sec
Complex patterns with Kleene: 100K+ events/sec
Patterns with ZDD optimization: 200K+ events/sec

ZDD Optimization

Varpulis uses Zero-suppressed Decision Diagrams (ZDD) to represent Kleene capture combinations during SASE+ pattern matching. When a Kleene pattern like A -> B+ -> C matches many B events, ZDD compactly encodes all possible subsets — e.g., 100 matching B events produce ~100 ZDD nodes instead of 2^100 explicit combinations.

Benefits

Exponential compression of Kleene match combinations
Efficient subset/superset operations for match enumeration
Reduced memory for patterns with many Kleene captures

When ZDD Helps

Kleene+ patterns with many matching events
Patterns with overlapping match candidates
High-throughput streams where match combination count would explode

Implementation

See varpulis-zdd crate for ZDD data structures and sase.rs for integration with the pattern matcher.

Note: ZDD is used for pattern matching (Kleene captures). For trend aggregation over patterns, Varpulis uses the Hamlet engine — see trend aggregation.

Examples

Fraud Detection

vpl

// Multiple small transactions followed by large withdrawal
pattern SmurfingPattern = all Transaction where amount < 1000 as small
    -> Transaction where amount > 9000 as large
    within 1h partition by account_id

stream FraudAlerts = SmurfingPattern
    .emit(
        alert: "Smurfing",
        account: large.account_id,
        num_small: count(small)
    )

SLA Monitoring

vpl

// Request without response within SLA
pattern SLABreach = Request as req
    -> NOT Response where request_id == req.id
    within 5s

stream SLAAlerts = SLABreach
    .emit(alert: "SLA breach", request_id: req.id)

IoT Device Monitoring

vpl

// Device going offline (no heartbeat within 1m)
pattern DeviceOffline = Heartbeat as last_beat
    -> NOT Heartbeat
    within 1m partition by device_id

stream OfflineAlerts = DeviceOffline
    .emit(alert: "Device offline", device: last_beat.device_id)

Trend Aggregation Mode

For patterns where you need statistics over trends (COUNT, SUM, AVG) rather than individual matches, use .trend_aggregate() instead of the default detection mode:

vpl

# Instead of detecting each rising price pattern individually...
# Count how many rising trends exist (without enumerating them)
stream TrendCount = StockTick as first
    -> all StockTick where price > first.price as rising
    .within(60s)
    .partition_by(symbol)
    .trend_aggregate(count: count_trends())
    .emit(symbol: first.symbol, trends: count)

This uses the Hamlet engine (SIGMOD 2021) for O(n) aggregation instead of explicit trend construction, which can be exponential. Multiple queries sharing Kleene sub-patterns are automatically optimized via shared aggregation.

See Trend Aggregation Reference and Trend Aggregation Tutorial for details.

Pattern Forecasting

Use .forecast() to predict whether a partially-matched pattern will complete, and when. This uses Prediction Suffix Trees (PST) combined with the SASE NFA to form a Pattern Markov Chain (PMC):

vpl

stream FraudForecast = Transaction as t1
    -> Transaction as t2 where t2.amount > t1.amount * 5
    -> Transaction as t3 where t3.location != t1.location
    .within(5m)
    .forecast(confidence: 0.7, horizon: 2m, warmup: 500)
    .where(forecast_probability > 0.8)
    .emit(
        probability: forecast_probability,
        expected_time: forecast_time
    )

Parameters: confidence (min probability, default 0.5), horizon (forecast window), warmup (learning period, default 100), max_depth (PST depth, default 5)

Built-in variables (available after .forecast()): forecast_probability, forecast_time, forecast_state, forecast_context_depth

The PST learns online from the event stream — no pre-training required. See Forecasting Tutorial and Forecasting Architecture for details.

SASE+ Pattern Matching Guide ​

Overview ​

Key Features ​

Pattern Syntax ​

Named Patterns ​

Inline Stream Patterns ​

When to Use Which ​

Selection and Emission Modes ​

Quick Reference ​

Selection Strategies ​

.strict() — Strict Contiguity ​

.stnm() — Skip-Till-Next-Match ​

.stam() — Skip-Till-Any-Match (default) ​

Emission Modes ​

.each() — Fire on Each Kleene Event (default) ​

.longest() — Emit Once at Completion ​

.subsets() — Paper-Correct STAM Verbose (expert mode) ​

Combining Modes ​

Kleene-bound Variables as Arrays ​

Comparison Table ​

Choosing the Right Mode ​

Pattern Types ​

Sequence ​

Kleene Plus (+ / all) ​

Kleene Star (*) ​

Negation (NOT) ​

AND (Any Order) ​

OR (Either) ​

Predicates ​

Field Comparisons ​

Reference Comparisons ​

Compound Predicates ​

Temporal Constraints ​

Pattern-Level Constraints ​

Per-Transition Constraints ​

Timeout Handling ​

Rising & Monotonic Patterns (v0.10.0) ​

Using .increasing() / .decreasing() (recommended) ​

Manual Self-Reference (advanced) ​

Strictly Decreasing Values ​

Escalating Severity ​

Aggregating Over Trends ​

Partition Strategies ​

Partition-By Attribute ​

Without Partitioning ​

Event Selection Strategies ​

Skip-Till-Any-Match (Default) ​

Skip-Till-Next-Match ​

Strict Contiguity ​

Debugging Patterns ​

Verbose Output ​

Pattern Tracing ​

Common Issues ​

1. Pattern Never Matches ​

2. Too Many Matches ​

3. Memory Growth ​

4. Negation Not Triggering ​

Performance Considerations ​

NFA Complexity ​

Memory Usage ​

Throughput ​

ZDD Optimization ​

Benefits ​

When ZDD Helps ​

Implementation ​

Examples ​

Fraud Detection ​

SLA Monitoring ​

IoT Device Monitoring ​

Trend Aggregation Mode ​

Pattern Forecasting ​

See Also ​

SASE+ Pattern Matching Guide

Overview

Key Features

Pattern Syntax

Named Patterns

Inline Stream Patterns

When to Use Which

Selection and Emission Modes

Quick Reference

Selection Strategies

`.strict()` — Strict Contiguity

`.stnm()` — Skip-Till-Next-Match

`.stam()` — Skip-Till-Any-Match (default)

Emission Modes

`.each()` — Fire on Each Kleene Event (default)

`.longest()` — Emit Once at Completion

`.subsets()` — Paper-Correct STAM Verbose (expert mode)

Combining Modes

Kleene-bound Variables as Arrays

Comparison Table

Choosing the Right Mode

Pattern Types

Sequence

Kleene Plus (`+` / `all`)

Kleene Star (`*`)

Negation (`NOT`)

AND (Any Order)

OR (Either)

Predicates

Field Comparisons

Reference Comparisons

Compound Predicates

Temporal Constraints

Pattern-Level Constraints

Per-Transition Constraints

Timeout Handling

Rising & Monotonic Patterns (v0.10.0)

Using `.increasing()` / `.decreasing()` (recommended)

Manual Self-Reference (advanced)

Strictly Decreasing Values

Escalating Severity

Aggregating Over Trends

Partition Strategies

Partition-By Attribute

Without Partitioning

Event Selection Strategies

Skip-Till-Any-Match (Default)

Skip-Till-Next-Match

Strict Contiguity

Debugging Patterns

Verbose Output

Pattern Tracing

Common Issues

1. Pattern Never Matches

2. Too Many Matches

3. Memory Growth

4. Negation Not Triggering

Performance Considerations

NFA Complexity

Memory Usage

Throughput

ZDD Optimization

Benefits

When ZDD Helps

Implementation

Examples

Fraud Detection

SLA Monitoring

IoT Device Monitoring

Trend Aggregation Mode

Pattern Forecasting

See Also