Image Moderation

Steve now supports a global image moderation pre-step that runs before the normal workflow pipeline.

Its purpose is to stop submissions that contain:

Where it runs

The moderation pre-step is executed near the top of convex/engine/process.ts, after the submission enters processing but before:

If a submission is flagged as unsafe, the pipeline marks it failed and stops there.

Image moderation is a global config, not a workflow-version stage.

It is stored in the imageModerationConfig table with two primary controls:

This keeps the safety gate consistent across all workflows instead of making every workflow maintain its own moderation policy.

Super admins can manage the feature from:

Configuration -> Image Moderation

The settings page allows them to:

The prompt supports the {{image_legend}} placeholder, which is replaced with the uploaded file labels before the model call is made.

Image moderation uses a dedicated AI pipeline ID:

image_moderation

Default behavior:

This is separate from the workflow's normal OCR or analysis model selection.

When moderation runs, Steve records:

token usage under sourceType: image_moderation
review timeline events such as content_moderation_complete or content_moderation_blocked
a failure reason when the submission is rejected by the pre-step

That means moderation usage appears in the admin usage dashboard as its own traffic category.

When enabled:

When disabled: