Re: [cross-project-issues-dev] Anonymisation of public data

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [cross-project-issues-dev] Anonymisation of public data

From: Boris Baldassari <boris@xxxxxxxxxxxxxx>
Date: Fri, 27 Apr 2018 12:44:03 +0200
Delivered-to: cross-project-issues-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/private/cross-project-issues-dev>
List-help: <mailto:cross-project-issues-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/cross-project-issues-dev>, <mailto:cross-project-issues-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/options/cross-project-issues-dev>, <mailto:cross-project-issues-dev-request@eclipse.org?subject=unsubscribe>
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0

Hiho good people,

There has been some off-list discussions going on, and I'd like tofollow-up on this.


As Mike nailed it, keys will not be stored.

And since there was no counter-reaction, we'll go with that for now. Anyinputs or feedback is still appreciated, of course, and I'll let youknow when things move forward.


Thanks, have a lovely end of week! :-)


--
boris




On 26/04/2018 10:15, Mike Milinkovich wrote:

The Eclipse Foundation would prefer to *not* be responsible for securelyretaining such keys. If an interesting pattern was ever uncovered, Ithink that we could analyze the original data to discover the relevantauthors using the original available data. And I cannot really imagineproviding researchers direct access to interesting contributorsdiscovered by analyzing anonymized data, as I am certain that wouldviolate our privacy policies.
In short, IMO the privacy risk of maintaining those keys outweighs anypotential advantages of retaining them. My 2c :)
On 2018-04-26 3:02 AM, Mickael Istria wrote:
Hi Boris,

    The basic idea is to simply replace all identifiers with
    asymmetrically encrypted strings, so all IDs have the same
    ciphered result. RSA is used for the encryption, and the private
    key is thrown away once the encoding is done, making it impossible
    (according to common encryption standards) to retrieve the
    original string.
Is this a requirement, at this point, to make it impossible toretrieve the original stream for anyone?I understand that the providing anonymous dataset is interesting asyou explained, but what couldn't you or Eclipse Foundation keep theprivate RSA key safely to decode the id if you find some unexpectedpatterns? If you make id anonymous and find a set of id which have astrange correlation and that you'd like to explain, wouldn't it behelpful to decode the id and find out who are the individuals behindit to better understand the cause of the correlation or even set upchats with selected contributors to better understand their practices?I have the impression there could be value in keeping ability todecode strings, while I don't think fully discarding the key is muchsafer than keeping it in a safe place (like an EF server with strongrestriction on who can access the key).
My 2c (or maybe even less ;)
--
Mickael Istria
Eclipse IDE <https://www.eclipse.org/downloads/eclipse-packages/>developer, for Red Hat Developers <https://developers.redhat.com/>
_______________________________________________
cross-project-issues-dev mailing list
cross-project-issues-dev@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/cross-project-issues-dev
--
Mike Milinkovich
mike.milinkovich@xxxxxxxxxxxxxxxxxxxxxx
(m) +1.613.220.3223



_______________________________________________
cross-project-issues-dev mailing list
cross-project-issues-dev@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/cross-project-issues-dev

References:
- [cross-project-issues-dev] Anonymisation of public data
  - From: Boris Baldassari
- Re: [cross-project-issues-dev] Anonymisation of public data
  - From: Mickael Istria
- Re: [cross-project-issues-dev] Anonymisation of public data
  - From: Mike Milinkovich

Prev by Date: Re: [cross-project-issues-dev] Website style changes
Next by Date: Re: [cross-project-issues-dev] Anonymisation of public data
Previous by thread: Re: [cross-project-issues-dev] Anonymisation of public data
Next by thread: Re: [cross-project-issues-dev] Anonymisation of public data
Index(es):
- Date
- Thread

Breadcrumbs