dbt has built-in testing for data quality.
Schema tests (in schema.yml):
models:
- name: orders
columns:
- name: order_id
tests:
- unique
- not_null
- name: status
tests:
- accepted_values:
values: ['pending', 'shipped', 'delivered']
Custom data tests: SQL that should return rows.
Run tests: dbt test runs all tests. Integrate into CI/CD.
Interview favorite: "How do you ensure data quality in your pipelines?"