The AI Act compliance deadline is August 2026. Your SME doesn’t need Atlan to make it.
I keep getting asked about AI Act compliance by founders and CTOs at scaleups. The conversation usually starts with a vendor quote attached, somewhere between 40K and 200K per year for a lineage tool. Then the question: “do we need this?”
Probably not. Here’s what the high-risk AI requirements actually demand for data traceability:
- Where did the training data come from? (source URLs, dataset names, vendor agreements)
- What’s in it? (schema, size, time range)
- How was it filtered or preprocessed? (deduplication rules, PII scrubbing, sampling)
- Who has access? (IAM groups, log who pulled it for training)
- When was it last updated? (timestamp, change log)
- Was it validated? (quality checks, manual review notes)
Six fields. Per dataset. That’s a markdown file or a small Notion page per training data source.
For a scaleup with three to ten high-risk training datasets, this fits on one page each. Total documentation effort: a week of work spread across two engineers. Total ongoing cost: 10 minutes per dataset update.
The enterprise lineage tools earn their money when you have hundreds of datasets, regulated industries with audit trails, or AI workloads where lineage is a product feature. Below that, you’re paying for capability you’ll never use.
Start with the six fields. Add a column to your data catalog. Move on. The AI Act doesn’t reward over-engineering.
How many high-risk AI datasets does your org train on today?
