How to Handle PII in Enterprise Integrations

Every enterprise integration moves data. Some of that data is innocuous: product SKUs, invoice numbers, inventory counts. But a significant share of what flows through your integration middleware is something far more sensitive: personally identifiable information (PII) and protected health information (PHI). Employee records syncing from Workday to a benefits platform. Patient identifiers moving between an EHR and a billing clearinghouse. Customer data traveling through an iPaaS pipeline to a CRM.

When that data crosses system boundaries, it does not leave your compliance obligations behind.

Under the EU General Data Protection Regulation (Regulation EU 2016/679) (GDPR) and the Health Insurance Portability and Accountability Act of 1996 (HIPAA), the integration layer is not a safe passage exempt from regulatory scrutiny. It is a processing activity in its own right, subject to the same data protection rules that govern your source and destination systems. This post lays out the compliance architecture, technical patterns, and implementation decisions that teams building enterprise integrations need to understand.

Why the Integration Layer Is a Compliance Blind Spot

Most compliance programs start with the systems of record: the HRIS, the EHR, the CRM. Security teams harden these endpoints and assume that anything shuttling data between them is infrastructure rather than processing. That assumption is wrong, and regulators have been clear about it.

Under GDPR, Article 4(2) defines processing as “any operation or set of operations which is performed on personal data or on sets of personal data, whether or not by automated means, such as collection, recording, organisation, structuring, storage, adaptation or alteration, retrieval, consultation, use, disclosure by transmission, dissemination or otherwise making available, alignment or combination, restriction, erasure or destruction.” Data transformation, field mapping, routing logic, and even the act of retrieving data to pass it along are all processing activities under this definition.

The implication is significant. When your integration engine reads an employee record from Workday, transforms it, and writes it to a payroll processor, each of those steps is processing. The integration platform or middleware you use to do this is a data processor. Your organization, as the entity determining the purpose and means, is the controller. Both roles carry distinct obligations under GDPR.

Under HIPAA, the equivalent construct is the business associate relationship. Any organization that creates, receives, maintains, or transmits electronic protected health information (ePHI) on behalf of a covered entity is a business associate, and per HHS’s Summary of the HIPAA Security Rule, “regulated entities must implement reasonable and appropriate administrative, physical, and technical safeguards for protecting ePHI.” An integration platform that handles ePHI on your behalf sits squarely in this category.

The practical problem is that integration architectures are complex, distributed, and change frequently. Field mappings get updated. New connectors get added. Pipelines branch into staging environments where PII data leaks into non-production systems. Logs capture full payload content for debugging and then never get cleaned up. These are not hypothetical scenarios — they are the patterns that generate audit findings and breach notifications.

What Counts as PII Under GDPR and PHI Under HIPAA

Before you can protect it, you need to know what you are protecting. The two frameworks take different approaches to defining sensitive information, and those differences shape how you architect your integration controls.

GDPR: Broad, Identity-Focused Definition

According to GDPR Article 4(1), personal data is “any information relating to an identified or identifiable natural person.” Identifiability is the key concept. A name paired with an employee ID is personal data. An IP address is personal data because it can be linked back to an individual. A device fingerprint embedded in an event payload is personal data. Pseudonymized data — where a real identifier has been replaced with a token — remains personal data unless the pseudonym cannot be reversed without additional information held separately.

This breadth matters for integrations because PII often travels in fields that integration developers do not flag as sensitive. User-agent strings in webhook payloads, audit trail data that includes username and timestamp, geolocation fields embedded in a contact record — all of these are in scope under GDPR.

GDPR also establishes a higher protection tier for special categories of data under Article 9: health data, biometric data, genetic data, data revealing racial or ethnic origin, political opinions, religious or philosophical beliefs, trade union membership, and data concerning sex life or sexual orientation. Processing these categories requires one of a limited number of explicit legal bases and significantly stronger technical controls. If your integration handles anything in this category — and HR integrations frequently do, through medical leave records, disability accommodations, or diversity data — you need an additional layer of governance.

HIPAA: Structured Around 18 Identifiers

HIPAA takes a more enumerated approach. Per HHS’s HIPAA Privacy Rule summary, PHI is “individually identifiable health information” that relates to an individual’s past, present, or future physical or mental health condition, the provision of health care, or payment for health care, and that identifies the individual or for which there is a reasonable basis to believe it can be used to identify the individual.

The HIPAA Security Rule specifically governs ePHI: PHI that is maintained in or transmitted by electronic media. Because virtually all enterprise integrations are electronic, ePHI rules apply to any integration touching health data.

The 18 identifiers under HIPAA’s safe harbor de-identification standard (45 CFR section 164.514) include names, geographic data below state level, dates (other than year) directly related to an individual, phone numbers, fax numbers, email addresses, Social Security Numbers, medical record numbers, health plan beneficiary numbers, account numbers, certificate and license numbers, vehicle identifiers and serial numbers, device identifiers and serial numbers, web URLs, IP addresses, biometric identifiers, full-face photographs and comparable images, and any other unique identifying number or code. Removing all 18 produces de-identified data that is no longer subject to HIPAA’s use and disclosure restrictions.

For integration architects, this enumerated list is operationally useful because it maps directly to field-level controls. You can inventory your integration payloads against this list and determine which fields require encryption, masking, or tokenization at each step in the pipeline.

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

Every integration that touches employee records, patient identifiers, or personal data carries regulatory exposure. Sama Integrations builds and audits PII handling patterns across Workday and Infor environments to keep your data flows compliant. Let's review your integration architecture.

contact@samaintegrations.com

Schedule A Call

Where PII and PHI Surface in Enterprise Integration Architectures

Understanding the regulatory definitions is the starting point. The harder problem is mapping those definitions to the actual data flows in your integration architecture.

Payload Fields

The most obvious place PII appears is in the request and response payloads your integrations exchange. An HR integration might include fields like employeeId, firstName, lastName, dateOfBirth, salary, homeAddress, nationalId, and medicalLeaveStatus in a single record. A healthcare integration might carry patientName, dateOfService, diagnosisCode, providerNPI, and insuranceMemberId in a single transaction message.

Every transformation step in your pipeline that reads these fields is processing PII. Logging these payloads for troubleshooting is processing PII. Caching them for retry logic is processing PII. This has implications for data retention, access control, and purpose limitation that most integration teams have not fully thought through.

Integration Logs

This is the most commonly overlooked PII vector in enterprise integrations. When a pipeline fails, the instinct is to log everything: full request bodies, response payloads, header values, error messages. Debug logs in an iPaaS platform, a middleware server, or an API gateway can contain complete PII records that persist for months or years in log aggregation systems that were never designed for PII handling.

GDPR’s data minimization principle under Article 5(1)(c) requires that personal data be “adequate, relevant and limited to what is necessary in relation to the purposes for which they are processed.” Verbose logging of PII in integration middleware almost never meets this standard. The purpose of a debug log is to diagnose integration failures, not to process personal data, and logging full PII payloads goes beyond what is necessary for that purpose.

Intermediate Staging and Error Queues

Enterprise integration platforms routinely use intermediate staging tables, dead-letter queues, and retry buffers. Records that fail processing land in error queues and wait for manual remediation or automated retry. If these error queues contain PII, and in most HR and healthcare integrations they will, they are subject to all of the same data protection obligations as the primary data flow. Organizations frequently have excellent controls on production data paths and virtually no controls on error handling paths.

Test Environments

Production data copied into test environments for integration testing is one of the most common causes of compliance findings. GDPR requires a legal basis for every processing activity, and “we needed real data to test our integration” does not constitute a legal basis for processing personal data in a non-production environment. HIPAA takes a similarly strict view of ePHI in test environments. Building integrations using properly anonymized or synthetically generated test data is not optional for organizations operating under either framework.

Core GDPR Compliance Patterns for Enterprise Integrations

Pattern 1: Data Processing Agreements with Integration Vendors

Before data flows, contracts must be in place. GDPR Article 28 requires that data controllers only use data processors who provide “sufficient guarantees to implement appropriate technical and organisational measures” and that the relationship is governed by a written contract, the data processing agreement (DPA).

Every integration platform, iPaaS vendor, middleware provider, or cloud service that processes personal data on your behalf requires a DPA. This includes the Workday tenant if it is acting as a processor for any of your data, the middleware platform connecting Workday to downstream systems, and any API gateway or logging service in the data path. Absent a valid DPA, every data transfer through that service is a potential GDPR violation.

For teams building integrations on platforms like Workday, the Workday Data Processing Exhibit governs how Workday handles personal data. You should verify that your specific integration use case falls within the scope of activities covered by your existing DPA, particularly when adding new integration targets or new categories of personal data to existing pipelines.

Pattern 2: Data Minimization at the Integration Layer

The principle of data minimization under GDPR Article 5(1)(c) requires collecting and processing only the data that is strictly necessary for the defined purpose. In an integration context, this means your field mappings should not pass through fields that the downstream system does not need.

The implementation pattern is a field allowlist approach at the transformation layer. Rather than passing the full source record and letting the destination system ignore fields it does not use, the integration layer explicitly selects only the fields that serve the documented purpose. This requires collaboration between integration architects and data owners to define which fields are necessary for each integration, and it requires that field selection be reviewed when the integration’s purpose changes.

For HR integrations built on the Workday EIB (Enterprise Interface Builder), this means designing your report-as-a-web-service data sources to extract only the fields your target system requires. A downstream identity provider receiving employee data for account provisioning does not need the employee’s compensation details, medical leave history, or performance ratings. If those fields are in the EIB output because they were added for a different use case, they should be removed or separated into a distinct integration with its own legal basis and DPA.

Pattern 3: Purpose Limitation and Integration Documentation

GDPR’s purpose limitation principle under Article 5(1)(b) requires that personal data be “collected for specified, explicit and legitimate purposes and not further processed in a manner that is incompatible with those purposes.” In integration terms, this means you cannot build a single pipeline that sends employee data to HR analytics, payroll processing, and marketing personalization, even if the underlying data is the same, unless you have a documented legal basis for each purpose.

Organizations with mature GDPR programs document their integrations in a Record of Processing Activities (ROPA) under Article 30. Each integration should be represented as a distinct processing activity with a defined purpose, legal basis, data categories processed, recipient categories, and retention period. When you add a new connector to an existing pipeline, you are adding a new processing activity that requires its own entry.

For teams building complex integration landscapes, this documentation requirement has architectural implications. Single-purpose integrations with narrow field scopes are easier to document, audit, and change than multi-purpose pipelines carrying broad datasets. Designing for GDPR compliance from the start means building integrations that serve one documented purpose rather than consolidating multiple unrelated data flows into a single efficient pipeline.

Pattern 4: Cross-Border Data Transfer Controls

One of the most operationally complex aspects of GDPR compliance in enterprise integrations is managing cross-border data transfers. GDPR Chapter V restricts transfers of personal data to countries outside the EU and EEA unless adequate protection is ensured through one of the permitted mechanisms: an EU adequacy decision for the destination country, Standard Contractual Clauses (SCCs), Binding Corporate Rules (BCRs), or approved codes of conduct and certification mechanisms.

For enterprise integrations, this becomes relevant when your integration platform is hosted in a non-EEA country, when your iPaaS vendor’s infrastructure routes data through non-EEA regions, or when your integration sends data to a downstream system operated in a non-EEA country. An HR integration sending EU employee data from a European Workday tenant to an ADP payroll system in the United States, for example, requires SCCs or another transfer mechanism to be in place.

The practical controls here work at two levels. At the contractual level, ensure your DPAs with integration vendors include the current SCCs adopted by the European Commission (the 2021 modular SCCs replaced the older versions). At the technical level, understand where your data is being processed geographically. Cloud iPaaS platforms often operate across multiple regions, and default routing may not respect the geographic constraints your compliance program requires.

Pattern 5: Supporting Data Subject Rights Through Integration Architecture

GDPR gives data subjects rights including access, rectification, erasure, restriction, and data portability. When personal data is replicated across multiple systems through enterprise integrations, fulfilling these rights requires knowing where the data has traveled. The right to erasure under Article 17 — the right to be forgotten — is particularly challenging in an integration context because a single source record may have been copied to dozens of downstream systems.

The architectural pattern that supports rights fulfillment is a data lineage map: a documented record of which source systems feed which downstream systems through which integrations, and what data categories are replicated along each path. For complex integration landscapes, this is not a manual exercise. Data catalog tools and iPaaS platforms with built-in lineage tracking can automate the discovery and maintenance of these relationships.

When a data subject submits an erasure request, your process needs to identify every downstream system that received the subject’s data from your integration layer and trigger deletion in each of those systems, not just the source of record. The integration pipelines themselves may also hold copies of the data in retry queues, logs, and caches that require purging.

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

contact@samaintegrations.com

Schedule A Call

Core HIPAA Compliance Patterns for Enterprise Integrations

Pattern 1: Business Associate Agreements for Every ePHI Processor

Before any ePHI flows through an integration, a Business Associate Agreement (BAA) must be executed with every vendor or service in the data path. Per HHS, a business associate is any person or organization, other than a member of your covered entity’s workforce, who creates, receives, maintains, or transmits ePHI on your behalf.

This requirement extends to integration platforms, cloud services used for message routing, logging platforms that capture ePHI in debug output, and any managed service provider involved in operating your integration infrastructure. The BAA obligates the business associate to implement appropriate safeguards, report breaches, and restrict subcontractors through their own BAAs — creating a chain of agreements that must cover the entire data path.

When using cloud-native integration services, verify that BAA coverage extends to all services in the data path, not just the primary integration platform. Object storage used for staging files, message queue services used for reliable delivery, and log management services used for operational monitoring all process ePHI if they are in the data path, and all require BAAs.

Pattern 2: Minimum Necessary Standard in Integration Payloads

HIPAA’s minimum necessary standard requires that covered entities and business associates limit the ePHI used, disclosed, or requested to the minimum necessary to accomplish the intended purpose. For integration architects, this is the HIPAA equivalent of GDPR’s data minimization principle, and the implementation pattern is the same: build integrations that extract and transmit only the fields required by the downstream system for the specific function it performs.

A claims adjudication integration sending patient data from an EHR to a clearinghouse does not need to include the full clinical notes, lab results, or imaging references if the clearinghouse only needs demographic identifiers and diagnosis codes to adjudicate the claim. A scheduling integration connecting a provider system to a patient portal does not need to carry medication lists or surgical history. The minimum necessary standard requires actively scoping each integration to the minimum PHI that accomplishes the legitimate purpose.

The operational challenge is that the minimum necessary determination must be made by personnel with authority to make that judgment, typically the covered entity’s privacy officer or a designated data owner, not the integration developer. Integration architects should build the technical capability to restrict fields, but the decision of which fields to restrict should come from a documented business process with appropriate sign-off.

Pattern 3: Encryption for ePHI in Transit and at Rest

The HIPAA Security Rule requires covered entities to “implement a mechanism to encrypt and decrypt electronic protected health information” as an addressable implementation specification. Addressable does not mean optional. It means you must either implement the safeguard or document why it is not reasonable and appropriate for your environment and implement an equivalent alternative. For virtually all enterprise integration scenarios, encryption is both reasonable and appropriate.

The practical requirements for integration architects:

For ePHI in transit, all connections carrying ePHI must use TLS 1.2 or higher. Older versions of TLS and all versions of SSL are deprecated and should not be accepted. API endpoints that receive ePHI must be configured to refuse connections on deprecated TLS versions. This applies to connections between your integration layer and source systems, between your integration layer and destination systems, and to any webhook callbacks or event subscriptions that carry ePHI.

For ePHI at rest, any staging tables, message queues, file drop zones, or caching layers that hold ePHI between processing steps must use encryption at rest with keys that are managed and rotated according to your security policy. This includes the integration platform’s internal storage: many iPaaS platforms persist in-flight messages to disk for reliability, and if those messages contain ePHI, the storage must be encrypted.

Pattern 4: Audit Controls and Access Logging

HIPAA’s Security Rule requires covered entities to implement hardware, software, and procedural mechanisms that record and examine activity in information systems that contain or use ePHI. For integration systems, this means every access to ePHI in the integration layer must be logged with sufficient detail to support an investigation of unauthorized access or disclosure.

The minimum content for an ePHI access log entry includes who initiated the access (user or system identity), what data was accessed (record identifiers, not full content), when the access occurred (UTC timestamp), from where the access originated (IP address or service endpoint), and the outcome (success, failure, error type).

This creates a tension with the principle of keeping PII out of logs. The resolution is to log metadata about ePHI access — identifiers, timestamps, access types — without logging the ePHI content itself. Record identifiers allow the audit trail to be correlated with the actual data record in the source system if an investigation requires it, without embedding ePHI content in the logging infrastructure.

For teams building integrations on Workday, the Workday REST API provides access to audit trail data through the Security Activity log and User Activity log APIs. Building an automated feed of these audit events into your SIEM for correlation with integration access patterns is a recommended practice for covered entities using Workday as a source of ePHI.

Pattern 5: Breach Notification Preparedness

HIPAA’s Breach Notification Rule requires covered entities to notify affected individuals, HHS, and in some cases the media, within 60 days of discovering a breach of unsecured ePHI. The rule also requires business associates to notify covered entities within 60 days of discovering a breach.

For integration architects, breach preparedness requires knowing the data lineage of every ePHI flow. When a potential breach is identified — a misconfigured API endpoint, an unauthorized access event in the audit logs, an error that caused ePHI to be written to an unencrypted log — you need to be able to determine within hours what ePHI was potentially exposed, which individuals were affected, and which downstream systems and business associates received copies of the exposed data.

This requires that your integration documentation, data lineage maps, and access logs be current, accurate, and rapidly searchable. An integration landscape that was documented two years ago and has since been modified extensively without updating the documentation will fail precisely at the moment the breach notification clock starts running.

Technical Implementation Patterns

Field-Level Encryption in Integration Payloads

Field-level encryption encrypts specific sensitive fields within a payload while leaving non-sensitive fields readable. Unlike transport encryption (TLS), which only protects data in transit and decrypts entirely at the destination, field-level encryption allows you to transmit data through intermediary systems — message brokers, routing layers, logging infrastructure — without exposing the sensitive content to those systems.

The pattern is implemented by encrypting specific fields before they enter the integration pipeline and only decrypting them at the authorized destination system. The integration middleware can route, transform non-sensitive fields, and log messages without ever having access to the plaintext sensitive data.

For example, in an HR integration sending employee tax identifiers from an HRIS to a payroll processor, the national identification number can be encrypted with a key shared only between the HRIS and the payroll processor. The integration middleware sees an opaque ciphertext in that field, routes the message correctly, and logs the message structure without capturing the plaintext identifier.

Implementation considerations include key management (who holds keys, how are they rotated, what happens when a key is compromised), searchability (encrypted fields cannot be searched or filtered in transit), and schema design (ciphertext may be longer than the original plaintext, requiring schema adjustments in staging tables).

Tokenization for PII References

Tokenization replaces a sensitive value with a non-sensitive substitute (the token) and stores the mapping in a secure token vault. Unlike encryption, tokenization does not have a reversible mathematical relationship between the original value and the token, making it more resistant to certain categories of attack.

In an integration context, tokenization is useful when the same individual needs to be tracked across multiple systems without sharing their real identifier. An employee’s national ID is tokenized at ingestion into the integration layer. All downstream systems receive and store the token. When a specific downstream system needs to resolve the token back to the real identifier for a legitimate purpose (tax filing, for example), it presents the token to the vault and receives the real identifier under audit-logged conditions.

The token vault is a high-trust, high-security system that itself requires strong access controls, encryption at rest, comprehensive audit logging, and BAA or DPA coverage from its vendor. Centralizing the mapping in the vault makes it a single point of control for data subject rights operations: revoking a token effectively severs the link between downstream copies and the real identifier.

Pseudonymization and Its Limits

GDPR specifically mentions pseudonymization in Article 4(5) as a technical measure that can reduce risk and in some contexts allows for secondary processing under Article 89. Pseudonymization replaces direct identifiers with pseudonyms while keeping the pseudonym-to-identity mapping separately and securely.

The critical point for integration architects is that pseudonymized data remains personal data under GDPR. The regulation is explicit: pseudonymization “reduces, but does not eliminate” the risks to data subjects. If your integration transmits employee data with real names replaced by employee codes, that data is still personal data because the mapping from code to name exists and could be obtained.

Pseudonymization does provide real risk reduction: a data breach of pseudonymized data has less immediate impact than a breach of plaintext-identified data, and GDPR recital 78 recognizes pseudonymization as a risk mitigation measure that can be relevant to demonstrating compliance with Article 5’s data protection by design and by default requirements. But it does not relieve you of the obligations that apply to personal data.

True anonymization, producing data from which individuals cannot be identified even with additional information reasonably available to an attacker, falls outside GDPR’s scope entirely. The practical test for anonymization is stringent, and most enterprise datasets that organizations describe as “anonymized” do not actually meet it.

Data Masking in Non-Production Environments

The specific problem of PII in test and development environments requires data masking: a process that replaces real PII values with realistic but fictitious substitutes that preserve the data’s structural characteristics (format, length, character set) without revealing real personal data.

For integration testing, masked data needs to be realistic enough to exercise the integration’s validation logic. A masked email address should still be a valid email format. A masked date of birth should still be a plausible date. A masked social security number should still be a 9-digit format that passes checksum validation if the downstream system validates checksums.

Masking should be applied as close to the production data source as possible, before data enters any non-production system. The masking process itself has access to real PII and requires strict controls: it should run as an automated, audited process in a secure environment, not as an ad-hoc developer script.

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

contact@samaintegrations.com

Schedule A Call

Integration-Layer Access Control

Role-Based Access Control at the API and Platform Level

For complex integration landscapes, access control is a multi-layer problem. At the API layer, the integration platform’s service account credentials should be scoped to the minimum permissions required for each integration. A read-only integration pulling employee data from Workday for reporting purposes should use credentials that have read access to the relevant report but no write access to any Workday object. An integration that provisions accounts in a downstream system should have write access scoped specifically to the object type being provisioned, not broad administrative access.

Workday’s security model uses domain security policies and security groups to control what data integration users can access. Configuring integration system users (ISUs) with purpose-scoped security groups, rather than granting broad report access or using shared administrator credentials, is both a GDPR data minimization control and a HIPAA minimum necessary control. An ISU that can only access the specific domains required for its integration cannot be exploited to extract a broader dataset even if its credentials are compromised.

At the integration platform level, access to the pipeline configuration, monitoring dashboards, and log data should be role-based and audited. Development teams building new integrations do not need access to production credentials or production log data containing PII. Operations teams monitoring integration health need access to status and error counts but not to payload content. Only a limited set of administrators should have access to the systems that process PII in production.

OAuth 2.0 Scopes for Fine-Grained API Authorization

For integrations using OAuth 2.0 to authenticate with source and destination systems, scope configuration is a primary access control mechanism. OAuth scopes define what resources and operations the integration is authorized to access on behalf of the authorizing entity.

When requesting OAuth tokens for integrations, request only the scopes that the integration actively uses. An integration that reads employee job profiles does not need scopes that permit writing to payroll or accessing time-off requests. Minimal scope requests reduce the blast radius of a compromised token: an attacker with a stolen token can only access what the token’s scopes permit.

For systems that support token-based access, implement token rotation: OAuth access tokens should have short expiry times (minutes to hours for high-sensitivity integrations), and the integration should use refresh tokens to obtain new access tokens rather than relying on long-lived static credentials. This limits the window during which a compromised token can be used.

Audit Logging, Monitoring, and Breach Detection

Compliance with both GDPR and HIPAA requires not just implementing controls but demonstrating that controls are operating effectively. This requires audit logging that captures what happened, monitoring that detects anomalies in real time, and documented incident response procedures that activate when anomalies are detected.

For integration audit logging:

Log every data access at the integration layer with subject identifiers (not content), timestamps, and the identity of the service or user initiating the access. For HIPAA ePHI, this is a specific regulatory requirement. For GDPR, it supports the accountability principle under Article 5(2) which requires controllers to be able to demonstrate compliance.

Log all configuration changes to integration pipelines. Adding a new field mapping, connecting a new downstream system, or changing the routing logic of an existing pipeline are all changes that affect personal data processing and should be captured in an immutable audit trail with the identity of the person who made the change.

Implement anomaly detection on integration data volumes and access patterns. A sudden spike in records transferred by a particular integration, access to sensitive fields by a service account that normally does not touch them, or data being routed to an unexpected destination are all indicators that merit investigation.

Retain logs for the periods required by your applicable regulations. HIPAA requires documentation of Security Rule policies and procedures for six years from creation or last effective date. GDPR does not specify fixed log retention periods but requires that retention be justified by purpose and limited to what is necessary. Coordinate log retention requirements with your records management and legal teams.

Testing Integrations for Compliance

Testing integrations for compliance is distinct from functional testing and requires specific test cases aimed at verifying that compliance controls work correctly.

For data minimization controls, verify that the integration does not transmit fields that are not in the defined allowlist. Send source records with additional fields added and confirm that the integration strips them before transmission to the destination. This regression tests against configuration drift where a new field is added to the source schema and inadvertently picked up by an integration that should not receive it.

For encryption controls, verify that ePHI fields are encrypted in transit by capturing network traffic and confirming that sensitive values do not appear in plaintext. Verify encryption at rest by examining staging storage directly and confirming that sensitive values are not stored in cleartext.

For access control, test that service accounts cannot access data beyond their defined scope. Attempt to access fields or records outside the integration’s authorized domain using the integration’s service credentials and verify that access is denied. This is particularly important after credential rotations or security group changes, which can inadvertently expand or contract access.

For data lineage and rights support, simulate a data subject erasure request and verify that the process correctly identifies and purges all copies of that individual’s data held by the integration layer, including retry queues, logs, and staging tables.

For teams using Workday EIB or Workday’s REST API for integration, the Workday Community provides testing guidance specific to integration security configuration, including how to test that integration system users are correctly scoped and that report access policies are enforced as expected.

Managing Compliance Across a Growing Integration Landscape

As integration landscapes grow, the compliance burden scales with them. Each new connector introduces new data flows, new processor relationships, and new access control requirements. Managing this at scale requires treating integration compliance as a continuous operational discipline, not a one-time implementation project.

The operational patterns that sustain compliance at scale include:

Integration cataloging: maintaining a current inventory of all production integrations, the data categories they process, the systems they connect, and the legal basis or BAA that authorizes each data flow. This is the foundation for impact assessments when source systems change, when new regulations come into effect, or when integration vendors modify their data handling practices.

Change management gates: requiring that new integrations and significant modifications to existing integrations pass a privacy and security review before deployment to production. This review should verify that a DPA or BAA is in place with any new vendor in the data path, that field scopes are properly minimized, that encryption is configured, and that audit logging is enabled. For teams building custom integration development at scale, embedding this review into the deployment pipeline prevents compliance gaps from reaching production.

Vendor management for integration platforms: reviewing the compliance posture of your iPaaS and middleware vendors on a defined cycle. Changes in vendor subprocessors, data center locations, or security certifications affect your own compliance position and need to be tracked.

Periodic data lineage audits: verifying that the documented data flows match the actual data flows by comparing integration configuration to the lineage documentation. Configuration drift — fields added to a pipeline without documentation, routing rules changed informally — is common in rapidly evolving integration landscapes and creates undocumented processing activities.

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

contact@samaintegrations.com

Schedule A Call

GDPR and HIPAA Compliance: Where the Frameworks Diverge

Organizations building integrations that must satisfy both GDPR and HIPAA, typically those in US healthcare organizations serving EU patients or EU-based organizations with US health plan obligations, need to manage both frameworks simultaneously.

The frameworks converge on several principles: data minimization, access control, encryption, audit logging, and breach notification. An integration built to satisfy HIPAA’s technical safeguards will satisfy many of GDPR’s security of processing requirements under Article 32. An integration that implements GDPR’s data processing agreement framework will cover most of the contractual requirements for HIPAA’s business associate agreements.

The divergences are primarily in consent, purpose, and individual rights. HIPAA permits certain uses and disclosures of PHI without patient authorization for treatment, payment, and healthcare operations, while GDPR may require explicit consent or another specific legal basis for the same processing. GDPR’s data subject rights, particularly the right to erasure and data portability, have no direct equivalent in HIPAA and require technical capabilities that pure HIPAA compliance programs do not address.

For cross-framework integrations, the practical approach is to build to the more demanding requirement on each dimension. Use GDPR’s data minimization and purpose limitation rigor for field scoping decisions. Use HIPAA’s structured audit requirements for logging design. Apply GDPR’s data subject rights architecture to ensure erasure and portability capabilities exist. The resulting integration will satisfy both frameworks without needing two separate compliance architectures.

Conclusion

Handling PII in enterprise integrations is not a checkbox exercise. It requires deliberate architectural decisions at every layer: the fields you include in integration payloads, the encryption controls you implement in transit and at rest, the access scopes you grant to service accounts, the logs you keep and the logs you do not, the contracts you execute with every vendor in the data path, and the documentation you maintain to demonstrate that your controls are working.

GDPR and HIPAA share a common architectural implication: the integration layer is a full participant in your data protection program, not an exempt conduit between protected endpoints. Every pipeline that processes personal data or ePHI is a processing activity under GDPR, a business associate relationship under HIPAA, or both.

Organizations that build these controls into their integration architecture from the start will find compliance easier to maintain as their integration landscape grows. Organizations that treat compliance as a post-hoc overlay on a poorly governed integration environment will spend significant resources responding to audit findings, remediation projects, and potentially regulatory enforcement.

For organizations building and operating integrations on platforms like Workday and Infor, the technical capabilities for compliant integration design exist. The challenge is applying them consistently, documenting them rigorously, and revisiting them when the integration landscape changes. That discipline is what separates integration programs that can demonstrate compliance from those that can only assert it.

How to Handle PII in Enterprise Integrations: GDPR and HIPAA Compliance Patterns

Jason Walisser

Why the Integration Layer Is a Compliance Blind Spot

What Counts as PII Under GDPR and PHI Under HIPAA

GDPR: Broad, Identity-Focused Definition

HIPAA: Structured Around 18 Identifiers

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

Where PII and PHI Surface in Enterprise Integration Architectures

Payload Fields

Integration Logs

Intermediate Staging and Error Queues

Test Environments

Core GDPR Compliance Patterns for Enterprise Integrations

Pattern 1: Data Processing Agreements with Integration Vendors

Pattern 2: Data Minimization at the Integration Layer

Pattern 3: Purpose Limitation and Integration Documentation

Pattern 4: Cross-Border Data Transfer Controls

Pattern 5: Supporting Data Subject Rights Through Integration Architecture

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

Core HIPAA Compliance Patterns for Enterprise Integrations

Pattern 1: Business Associate Agreements for Every ePHI Processor

Pattern 2: Minimum Necessary Standard in Integration Payloads

Pattern 3: Encryption for ePHI in Transit and at Rest

Pattern 4: Audit Controls and Access Logging

Pattern 5: Breach Notification Preparedness

Technical Implementation Patterns

Field-Level Encryption in Integration Payloads

Tokenization for PII References

Pseudonymization and Its Limits

Data Masking in Non-Production Environments

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

Integration-Layer Access Control

Role-Based Access Control at the API and Platform Level

OAuth 2.0 Scopes for Fine-Grained API Authorization

Audit Logging, Monitoring, and Breach Detection

Testing Integrations for Compliance

Managing Compliance Across a Growing Integration Landscape

Moving PII and PHI Across Enterprise Systems Without a Documented GDPR or HIPAA Compliance Pattern?

GDPR and HIPAA Compliance: Where the Frameworks Diverge

Conclusion

Recent Insights

Quick Links

Contact Info