Replace anonymized data in string - Mailing list pgsql-general

From Patrick FICHE
Subject Replace anonymized data in string
Date
Msg-id VI1PR05MB5917D508CAA017AB9FF70890EF959@VI1PR05MB5917.eurprd05.prod.outlook.com
Whole thread Raw
Responses Re: Replace anonymized data in string  (Daniel Gustafsson <daniel@yesql.se>)
Re: Replace anonymized data in string  (Rob Sargent <robjsargent@gmail.com>)
List pgsql-general

Hi Team,

 

I have some data that has been anonymized and I would like to generate some test data from this. In some way, I would like to deanonymize this data with random data.

 

For example, phone numbers have been anonymized with changing the 5 right digits with the 8 digit (preserving length).

Applying this, the number 390694802756 was changed to 3906948088888.

 

I would like to get random digits at the end of the phone number knowing that anonymized data can be a variable length.

So, I would like to change every sequence (at least 2) of 8 by random value of same length (I don’t worry if phone number contains 88 in the middle and this sequence is changed to random data)…

 

I tried to do this with replace / regexp_replace functions but could not achieve what I wanted to do.

I don’t want these digits to be changed with a single one (88888 by 111111 or 99999 but something like 42384)…

Ideally, the new string would be different if multiple sequences of 8 appear in a string and would be different from one record to the other when applied to a full table…

 

Is there any way to do this ?

 

Thanks,

Patrick

pgsql-general by date:

Previous
From: Michael Lewis
Date:
Subject: Re: Are all unlogged tables in any case truncated after a server-using
Next
From: Daniel Gustafsson
Date:
Subject: Re: Replace anonymized data in string