Caleb Dinsmore

My personal programming blog about security, Python, .NET, AngularJS, React, and whatever other tech stacks I find myself in.

De-identifying (Anonymizing) Data for Development (using Python)

When working with Personally Identifying Information (PII) or Protected Health Information (PHI), it’s often necessary to “de-identify” that information, particularly if you are displaying it in any way (for reporting or displaying research results). You want to ensure that the data you’re showing can’t be used to identify any given individual. In my case, I wanted to also de-identify the data being used within my organization, even though that data wouldn’t be displayed publicly.

Continue reading