Open-sourcing Datanymizer: in-flight template-driven data anonymization
The Californer/10249653

Datanymizer currently supports PostgreSQL databases, although MySQL (and so also MariaDB) support is planned. Contributions are of course very welcome!

SAN FRANCISCO - Californer -- Fakers, anonymizers, and obfuscators — there are various free and open-source data anonymization tools that have been around for a long time and work pretty well, so why did we create a new one? The one that supports globals, uniqueness constraints, inline rules, and other cool features.

We had some particular requirements we wanted our tool to meet. We didn't want the anonymizer to take a "raw" dump and mutate it. Instead, we needed to provide an already anonymized dump, without access to real data. The configuration that determined how the real system data would be anonymized should have been kept separate from that data.

More on The Californer
We also wanted a tool that was flexible about how the anonymization itself takes place, ideally allowing the use of templates to populate field contents.

Future development

We plan to implement the following additional features soon:
  • Pre-filtering: for example, if it is necessary to dump not all users but those matching specific criteria (e.g., 100 users, aged 27 years old or more, named Alexander), supporting arbitrary SQL queries for filtering.
  • Data generation: when you don't need to anonymize existing data, but instead generate synthetic data based upon certain rules.

Read more about the approach Datanymizer takes.

Media Contact
44 Tehama St, San Francisco, CA 94105


Show All News | Report Violation


Latest on The Californer