If your validation examples appear in training data, your metrics are meaningless. This is data contamination.
Watch for near-duplicates, not just exact matches. Paraphrased versions of the same content count as leakage.
Deduplicate your data before splitting. Tools like MinHash can find near-duplicates efficiently. Clean data produces trustworthy evaluations.