> For the complete documentation index, see [llms.txt](https://docs.decube.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.decube.io/data-quality/known-limitations.md).

# Monitor Behaviour and Design Considerations

The Data Quality module makes a number of deliberate design choices about how monitors train, alert, and respond to configuration changes. Understanding these upfront helps you set up monitors that work reliably from day one.

***

## Sensitivity adapts through use

Rather than asking you to set a sensitivity value at monitor creation — when you have no data to reason from — Decube starts every monitor at the default sensitivity (0) and lets you adjust it based on what you observe in practice.

Once a monitor has been running and producing incidents, you can tune its confidence interval through the incident feedback mechanism. Marking incidents as false positives shifts the model toward fewer alerts; marking missed anomalies shifts it toward more. This means sensitivity improves as your monitors accumulate real operational history.

{% content-ref url="/pages/oHQMBTOmZQevImaIjmyw" %}
[Incident model feedback](/data-quality/data-quality/incident-model-feedback.md)
{% endcontent-ref %}

***

## The ML model waits for sufficient signal before alerting

To avoid producing unreliable alerts on insufficient data, the ML model requires a minimum of **5 valid data points in the last 30 observations** before it begins raising incidents. On a newly created monitor or a low-volume table, scans may run without producing incidents until this threshold is reached — this is by design.

Once the signal threshold is met, the model activates and begins producing confidence-interval-based alerts.

{% hint style="success" %}
If you need immediate coverage on a low-volume or sparsely-updated table, use On-Demand mode with a manual threshold. Manual thresholds evaluate on every scan without requiring prior observations.
{% endhint %}

{% content-ref url="/pages/12tZSWlWRX4XEXBTD4L6" %}
[How Anomaly Detection Works](/data-quality/anomaly-detection-explained.md)
{% endcontent-ref %}

***

## Custom SQL gives you full control over thresholds

Custom SQL monitors evaluate threshold conditions you define explicitly, rather than using the ML model. This makes them well-suited to complex business rules and cross-table assertions where the right threshold is deterministic — for example, "zero rows should have a negative price" or "these two tables must always agree."

{% content-ref url="/pages/PWGhO7SkX4GtJu3jfRBX" %}
[Set Up Custom SQL Monitors](/data-quality/how-to-set-up-monitors/custom-sql-monitors.md)
{% endcontent-ref %}

***

## Retraining gives you a clean, accurate baseline

When you make a significant change to a monitor's configuration — such as changing the scan frequency, row filtering mode, or timestamp column — Decube starts the training process fresh. This ensures the model is trained on data that reflects the monitor's new configuration, rather than producing a baseline that mixes two different measurement approaches.

Because the previous history no longer reflects the new configuration, it is cleared as part of the retrain. Before making any change that triggers a retrain, note or resolve any open incidents you want to preserve.

{% content-ref url="/pages/KgXixMx9V2joZYvvAjL9" %}
[Retraining Monitors](/data-quality/monitor-configuration-settings/retraining-monitors.md)
{% endcontent-ref %}


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.decube.io/data-quality/known-limitations.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
