Process and Criteria

Processes and criteria shape every decision, product, and interaction in modern organizations. When they are explicit, teams move faster, customers trust more, and regulators stay satisfied.

Yet most groups confuse a loose checklist with a true process, or they mistake a vague aspiration for a measurable criterion. This article dismantles that confusion and rebuilds it into a repeatable system you can deploy tomorrow.

🤖 This article was created with the assistance of AI and is intended for informational purposes only. While efforts are made to ensure accuracy, some details may be simplified or contain minor errors. Always verify key information from reliable sources.

Defining the Core Terms

A process is a time-bound sequence of inputs, actions, and outputs that converts uncertainty into value. A criterion is a measurable rule that signals “go” or “no-go” at any step in that sequence.

Without criteria, a process drifts into opinion. Without a process, criteria become arbitrary gatekeeping.

Process Anatomy

Every effective process contains five atomic parts: trigger, owner, task, artifact, and exit condition. Miss one and the chain breaks.

The trigger is the single detectable event that starts the clock. The owner is one named person, never a committee.

The task is a verb-noun pair that fits in a two-minute voice memo. The artifact is the tangible proof the task happened. The exit condition is a binary test that releases the work to the next node.

Criteria Anatomy

A useful criterion is SMART-D: specific, measurable, assignable, realistic, time-bound, and dependency-aware. “High quality” fails; “zero memory leaks detected by Valgrind in staging within 30 min” passes.

Each criterion needs a unit, a sensor, and a threshold. Unit answers “in what currency?” Sensor answers “who or what measures?” Threshold answers “how much is enough?”

Mapping Value Streams Before You Script Steps

Jumping into flowcharts without a value-stream map is like writing dialogue before you know the plot. Start with a sticky-note exercise that follows the product from raw signal to delighted user.

Color-code notes by wait time versus work time. Any span that is more than 25 % waiting is a candidate for collapse or parallelization.

Identifying Failure Load

Failure load is the hidden metric of rework, refunds, and reputation damage. Tag every support ticket with the process step that originated it.

Within six weeks a Pareto pattern emerges. One or two steps usually generate 70 % of the pain.

Rewrite those steps first; the rest can wait.

Capacity Caps

Each step has a theoretical max throughput. A human review step capped at 40 items per day becomes a brick wall once demand hits 41.

Model the cap with a simple Google Sheet: list daily output for the last quarter, add one standard deviation, and you have a practical ceiling.

Design overflow bypasses before you need them.

Designing Gate Criteria That Prevent Escalation

A gate is a moment where work can be rejected without apology. Good gates save expensive downstream work.

The secret is to place the gate just after the point of maximum reversibility and just before the point of maximum cost.

Binary vs. Gradient Gates

Binary gates are pass-fail; gradient gates return a score that feeds a risk budget. Use binary when failure is catastrophic, gradient when you can absorb partial flaws.

For example, medical device firmware uses binary gates for sterilization tests. E-commerce product photos use gradient gates for visual similarity scoring.

Gatekeepers’ Charter

Write a one-page charter that gives the gatekeeper explicit power to stop the line and zero power to redesign the product. Separation of veto and authorship prevents grade inflation.

Publish the charter on the intranet. Link it in the same Jira ticket template every time.

From Policy to Playbook: Writing the Living Document

Policies state intent; playbooks state finger movements. Keep policies at the division level; push playbooks to the repo that also holds the code.

When the pull request updates the code, the same diff must update the playbook.

Versioning Tactics

Use calendar versioning (2024.05.14) instead of semantic versioning for process docs. It forces readers to check freshness.

Archive snapshots in a frozen PDF every quarter for regulatory audits. Store the PDF in an S3 bucket with object lock enabled.

Change Advisory Threshold

Not every tweak needs a CAB meeting. Define a materiality threshold: any change that touches money, metrics, or customer-facing SLA triggers review.

Smaller edits can ship after a peer review recorded in Git.

Embedding Criteria in Code

Human-readable criteria drift; code-embedded criteria persist. Turn gates into unit tests that fail the build.

Example: a Python test that queries the staging database for unindexed foreign keys blocks deploy if the count exceeds zero.

Feature Flags as Process Branches

Deploy dark features behind flags that can only be toggled when criteria pass. The flag configuration file becomes the gate record.

Audit logs capture who flipped the flag and which test suite ran.

Automated Evidence Collection

Store screenshots, logs, and test metadata in an immutable bucket keyed by git SHA. Inspectors love time-stamped artifacts they did not have to request.

Set lifecycle rules to delete evidence after the statutory period to control cost.

Human Overrides Without Chaos

Overrides will happen; design the aperture, not the wall. Require a short free-text justification field that is automatically posted to a Slack channel monitored by compliance.

Override metrics become a leading indicator of criterion decay.

Override Budgets

Give each team a monthly budget of three overrides. Once spent, they must either change the criterion or escalate to the VP level.

Budgets turn abstract quality goals into a currency teams actually watch.

Post-Override Review

Within five business days, a different engineer must write a blameless retrospective that answers two questions: Was the criterion wrong or was the test wrong? What data supports the answer?

File a ticket to fix whichever was wrong.

Cross-Functional Calibration Workshops

Engineering, QA, legal, and support often interpret the same criterion differently. Run a 45-minute workshop every sprint where each department scores five anonymized examples.

Reveal the variance live; it is usually larger than anyone expects.

Consensus Dice

Use a silent voting tool that forces everyone to pick 1–5 without groupthink. Display a histogram instantly.

Discuss the outliers, not the median; that is where knowledge lives.

Calibration Records

Store the histogram and the revised rubric in the same Confluence page. Tag the page with the sprint number so future auditors can replay the evolution of standards.

Metrics That Signal Process Debt

Process debt is the invisible interest paid in slower cycle time and burnout. Track “criteria volatility”: the number of times a gate threshold changes per quarter.

Another signal is “override velocity”: overrides per deploy. A spike predicts incidents better than any lagging indicator.

Lead Time per Cognitive Load

Measure lead time weighted by the number of decision points. A process with ten gates feels slower even if each gate is fast.

Collapse adjacent gates when the sum of their cognitive load exceeds the risk they prevent.

Queue Length Alerting

Set a Prometheus alert on queue age, not queue size. A five-item queue that sits for two days is riskier than a fifty-item queue cleared in two hours.

Customer-Facing Transparency

Publish a condensed version of your criteria on the public website. Customers reward transparency with reduced support tickets.

Cloudflare publishes its edge SSL renewal checklist; it became a sales asset.

Status Page Integration

Link each status-page component to the exact process that keeps it healthy. When an incident occurs, users can read the playbook in real time.

This reduces “is it fixed yet?” pings by 30 %.

Open-Sourcing Gates

Open-source communities thrive on clear merge criteria. Kubernetes labels every pull request with “tide/merge-method-squash” only after CI, code review, and DCO criteria pass.

External contributors self-select, saving maintainer time.

Regulatory Alignment

Regulators do not care about your internal jargon; they care about traceability. Map every external requirement to a specific gate and artifact.

SOX, GDPR, and HIPAA all accept automated evidence if the chain of custody is unbroken.

Control Matrix

Build a spreadsheet with columns: regulation, clause, internal process, gate ID, artifact location, last audit date. Color rows green only when all cells are filled.

Share read-only access with auditors in advance; it halves the onsite duration.

Dynamic Control Testing

Instead of annual sampling, run dynamic tests that pull 1 % of records daily. Log the pass rate in a Grafana dashboard.

Auditors love living evidence over point-in-time screenshots.

Scaling Without Dilution

Adding headcount often dilutes criteria. Counteract with “process sharding”: duplicate the entire pipeline for each product line but keep gates identical.

Shared tooling guarantees consistency while autonomous teams optimize inside the guardrails.

Center of Excellence Charter

Create a lightweight CoE that owns only the gates and metrics, never the execution. Rotate membership every six months to prevent empire building.

The CoE publishes a quarterly delta report: what changed, why, and the impact on lead time.

Micro-process Repositories

Store each subprocess in its own repo with isolated tests. A change to onboarding then cannot break billing.

Use Git submodules to compose mega-playbooks from vetted Lego bricks.

Retirement Protocols

Processes calcify when no one retires them. Schedule a “sunset review” the moment a criterion hits 100 % pass rate for 90 consecutive days.

Ask whether the test is now redundant or the risk has vanished.

Artifact Archival

Before deletion, export a cryptographic hash of all artifacts to a cold-storage bucket. Ten years later you can still prove the control existed even if the tooling is obsolete.

Include a README that lists the obsolete dependencies so future archaeologists can recreate the environment if needed.