Blog

AI and Data Privacy: What Businesses Must Do

The data privacy requirements businesses must meet when using AI: what data you can use, how to protect it, and how to maintain compliance across AI systems.

Phos Team ·
AI Strategy

Data privacy is the most immediate compliance challenge for businesses using AI. Most AI systems process personal data, and most organizations have not fully mapped their AI data processing against applicable privacy requirements.

Why data privacy is the top AI compliance issue

AI is unusually data-hungry. Training AI systems requires large datasets. Running AI systems often requires processing personal data in real time. Improving AI systems requires retaining logs of AI interactions. At every stage, personal data is involved, and at every stage, data privacy requirements apply.

The regulatory environment makes this especially important in 2026. GDPR enforcement involving AI is increasing in frequency and severity. The EU AI Act creates additional data governance requirements for high-risk AI systems. US state privacy laws are extending to cover AI-specific data uses. The compliance surface area is growing faster than most organizations’ data privacy programs have expanded to cover it.

Data you can and cannot use for AI

Not all data is equally available for AI use. Privacy law constrains what data you can collect, how you can use it, and how long you can keep it.

Data you can generally use:

  • Transactional data about your own business operations (with appropriate retention controls)
  • Customer data for purposes customers reasonably expected when they provided it
  • Employee data for legitimate HR purposes, with appropriate disclosure
  • Publicly available data, subject to the lawful basis requirements of applicable privacy law
  • Data provided with specific, informed consent for AI training or processing

Data you should not use without careful assessment:

  • Special category data (health, biometric, racial or ethnic origin, religious beliefs, etc.)
  • Data collected for one purpose repurposed for AI training without reassessment
  • Data from jurisdictions with strict localization requirements processed outside those jurisdictions
  • Children’s data, which receives heightened protection under most privacy frameworks
  • Data obtained through third-party data brokers without verified compliance

The purpose limitation principle. A core principle of GDPR and most major privacy frameworks is that data collected for one purpose should not be used for a materially different purpose without new lawful basis or consent. Using customer service interaction data to train a new AI product, for example, may not be permitted under the original data collection basis.

Protecting customer data in AI systems

When AI processes customer data, the protections that apply to that data do not change. The fact that processing is automated does not reduce the privacy rights of the individuals involved.

Disclosure. If AI is used to make or significantly influence decisions about customers, customers are generally entitled to know. GDPR’s Article 22 creates specific rights around automated decision-making. Many privacy frameworks require transparency about AI use in customer interactions.

Data minimization. Design customer-facing AI systems to use the minimum customer data necessary for the function. Do not collect additional customer data simply because it might improve AI performance in some future use case.

Retention. Apply the same data retention policies to AI processing logs and interaction records as to other customer data. AI interaction logs containing personal data are personal data. They must be retained only as long as necessary and deleted per your retention schedule.

Security. Customer data processed by AI systems must be protected with controls appropriate to its sensitivity. This includes security for data in transit to AI services, at rest in AI system logs, and within AI models trained on customer data.

Employee data and AI

Employee data in AI systems is a particularly sensitive area. AI used in hiring, performance management, scheduling, or monitoring creates significant privacy obligations.

Lawful basis. Processing employee data for AI-driven HR functions requires a lawful basis. In the EU, employment-related AI is likely to be high-risk under the EU AI Act, adding requirements beyond GDPR baseline.

Transparency. Employees are entitled to know what data about them is processed, how it is used, and what decisions it influences. AI-driven performance scores, scheduling algorithms, or monitoring tools require clear employee communication.

Article 22 implications. AI that makes automated decisions about hiring, promotion, termination, or pay is potentially subject to GDPR Article 22’s requirements for meaningful human oversight and the right to challenge the decision.

Works council and union considerations. In many European jurisdictions, introducing AI that monitors or manages employees requires consultation with employee representatives. This is a data privacy issue with significant employment law dimensions.

Vendor and third-party AI data sharing

When your organization uses third-party AI services, data sharing obligations arise. The data processing agreement (DPA) between your organization and the AI vendor is both a contractual and a legal requirement.

GDPR data processing agreements. Under GDPR, any vendor that processes personal data on your behalf is a data processor and must sign a DPA. AI service providers processing your customers’ or employees’ personal data are data processors. Many organizations have not completed DPAs with all their AI vendors.

Data transfer mechanisms. If AI vendor data processing occurs outside the EU or UK, data transfer mechanisms (Standard Contractual Clauses, adequacy decisions, or other approved mechanisms) are required for the transfer to be lawful.

Vendor security requirements. Your DPA should specify security requirements that the vendor must meet. Generic security commitments are not sufficient. Specify encryption standards, access controls, breach notification timelines, and audit rights.

Sub-processor transparency. AI vendors often use sub-processors (other vendors who process data on their behalf). Your DPA should require disclosure of sub-processors and your consent before new sub-processors are engaged.

For high-sensitivity workloads, a private AI workspace eliminates third-party vendor data sharing by keeping all AI processing within your own infrastructure.

Building a data privacy compliance program for AI

A complete AI data privacy program requires more than reviewing individual AI systems. It requires integrating AI into your broader data governance structure.

Map AI data flows. For every AI system, document what personal data it receives, where that data comes from, what the AI does with it, where outputs go, and how long data is retained. This data flow map is the foundation for privacy assessments.

Conduct privacy impact assessments. For AI systems involving high-risk processing (profiling, automated decision-making, special category data, large-scale processing), conduct a DPIA before deployment.

Update privacy notices. If AI processing represents a new or changed use of customer or employee data, update privacy notices to reflect that use.

Integrate AI into data subject request handling. Your data subject request processes need to handle AI-specific requests: access to AI-related processing, erasure from AI training data, and objections to automated decision-making.

Audit third-party AI vendors. Conduct an annual review of all AI vendors processing personal data. Verify DPAs are in place, data transfer mechanisms are current, and security practices meet your requirements.

For a detailed treatment of GDPR’s specific requirements for AI, see GDPR and AI.

Frequently asked questions

Does our AI provider’s GDPR compliance cover us?

No. Your AI provider’s compliance with GDPR covers their data processing obligations. Your organization is the data controller responsible for the lawfulness of processing personal data in AI systems, choosing vendors who meet GDPR requirements, and ensuring data subjects can exercise their rights. Vendor compliance does not transfer controller responsibility.

What are the biggest data privacy risks when using AI?

The biggest risks are: using personal data for AI purposes without adequate lawful basis, failing to provide transparency about automated decision-making, not responding adequately to data subject requests in AI contexts, missing DPAs with AI vendors, and transferring data to AI vendors outside the EU without adequate transfer mechanisms.

How does AI data privacy differ from general data privacy?

AI data privacy involves all general data privacy principles plus AI-specific challenges: training data lawful basis, the technical difficulty of data erasure from trained models, automated decision-making rights, and the opacity of AI processing that makes transparency more challenging. Standard data privacy programs need to be extended to address these AI-specific issues.

Is your AI program meeting its data privacy obligations?

Most organizations have data privacy programs that were not designed for AI. The gaps are significant and the regulatory risk is real.

Path one: map your AI data flows. An AI audit maps the personal data flowing through your AI systems, identifies privacy compliance gaps, and produces a remediation plan.

Path two: work with Phos AI Labs. If you want expert help building a privacy-compliant AI program, including private AI workspace options that minimize vendor data sharing risk, Phos AI Labs is a CCA-F certified Claude implementation partner. Thirty minutes, no deck. Start here.

Related articles

The fastest way to know whether we're the right fit, is a conversation.

STEP 1/2 · ABOUT YOU