Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
Re: [science-iwg] Science Top Level Project draft

that one is also a good example of a "leaky abstraction"

so to be avoided ;-)


Op 16/03/2016 om 12:24 schreef Matt.Gerring@xxxxxxxxxxxxx:

The second one is what my salary buys in Oxfordshire.

 

From: science-iwg-bounces@xxxxxxxxxxx [mailto:science-iwg-bounces@xxxxxxxxxxx] On Behalf Of Philip Wenig
Sent: 16 March 2016 10:55
To: Science Industry Working Group
Subject: Re: [science-iwg] Science Top Level Project draft

 

Matt,

you're right :-). It was a bad comparison ... I should have better taken a "booth" instead of a "shack". Then it looks rather like this:



than like this:





Cheers,
Philip

Am 16.03.2016 um 09:51 schrieb Matt.Gerring@xxxxxxxxxxxxx:

Bazaar rather than cathedrals I’m into that, yea cool… but shacks?

 

From: science-iwg-bounces@xxxxxxxxxxx [mailto:science-iwg-bounces@xxxxxxxxxxx] On Behalf Of Philip Wenig
Sent: 16 March 2016 06:43
To: science-iwg@xxxxxxxxxxx
Subject: Re: [science-iwg] Science Top Level Project draft

 

Yep, that's right. It's not about storing the data but to enable interoperability between different scientific techniques. Furthermore, I don't think it will be successful if we try to create THE ONE ALLMIGHTY interchange data format specification. Many tried this before and still try it, especially in my area:

https://en.wikipedia.org/wiki/Mass_spectrometry_data_format (JCAMP-DX, ANDI/AIA, mzData, mzXML, ...)
http://www.allotrope.org

IMHO ... they didn't reached/will not reach their goals as they are either too specific or too generic.

The better way is how we do it - it's the bazaar strategy ... the Eclipse bazaar. There are several shacks at the Science marketplace ... Matt and the DAWNSci shack, Jay and the ICE shack, Philip and the ChemClipse shack ... and there is e.g. Tobias who likes to combine the results of different analytical techniques (shacks) to perform some statistic evaluations. He is getting the data from each shack in a generic format bundled with the recipe how to read it. That's why I really like the HDF5 approach:

https://www.hdfgroup.org/HDF5

Now imagine how to combine data from shacks of different marketplaces like Science, LocationTech and IoT. We can do the same as before.


Best,
Philip



Am 16.03.2016 um 02:05 schrieb Jay Jay Billings:

So we are not talking about data storage per se, but the tools and libraries that make it possible? Something along the lines of

"Data storage tools, libraries and associated technologies to promote interoperability."

 

My concern is the size of scientific data sets so I want to be explicit that we don't plan to store anything. I don't want to put this line in the document and give them the feeling that they could ask us to store a Petabyte of data.

Jay

 

On Tue, Mar 15, 2016 at 10:04 AM, <Matt.Gerring@xxxxxxxxxxxxx> wrote:

Hello,

 

I do agree with Erwin. Synchrotron and neutron sources do have existing forums for meeting each other and sharing code, however we would be an easy win since we are doing similar things, at least computing wise.

 

All the best,

 

Matt

 

 

From: science-iwg-bounces@xxxxxxxxxxx [mailto:science-iwg-bounces@xxxxxxxxxxx] On Behalf Of Philip Wenig
Sent: 15 March 2016 13:39
To: science-iwg@xxxxxxxxxxx


Subject: Re: [science-iwg] Science Top Level Project draft

 

Thanks Erwin :-)
I skip the draft I was just about to send. This is exactly what I meant.


Best,
Philip

Am 15.03.2016 um 14:34 schrieb Erwin de Ley:

Jay, Philip, all,

Permit me to also add some 2-cents, based on what I've seen at the (mainly synchrotron) institutions that we've been working with. I'm sure Matt and others have a much deeper knowledge on this, and they can correct me, add more info etc. I just want to trigger some thoughts here ;-)

At the synchrotrons I've seen ideas and efforts that not only want to allow sharing datasets between different users, but also between different tools (interactive and automated).

Another topic is that datasets are becoming ever bigger, and the volume of them as well. Insofar that engineering has to provide storage and access services for the (visiting) scientists and that the moment is getting near that the raw data is no longer easily transportable, e.g. to a scientists home institution.

Some aspects of this :
1. common / standard storage formats, i.e. the technical format / filetype
2. within those technical "containers", trying to standardize nomenclature & functional structures
and provide abstraction tools that can work with different storage variations in a common way
3. large storage infrastructures and corresponding tools/APIs to allow storing huge volumes of datasets and make them accessible in a controlled manner
4....

E.g. :
1. is about HDF5, Nexus...
2. DAWN's loaders, Soleil's CDMA, I remember that Philip also described ChemClips's related features to work with many different file formats..
3. e.g. ICAT, http://pan-data.eu/about, ...

The Science IWG would be a great place to collect cross-domain requirements and to extract knowhow, APIs, libraries, ... to integrate some of these aspects!

cheers
erwin

Op 15/03/2016 om 13:14 schreef Jay Jay Billings:

Philip,

That's an interesting idea. Will you write a little more about what you have in mind?

Jay

On Mar 15, 2016 4:58 AM, "Erwin de Ley" <erwin.de.ley@xxxxxxxxxx> wrote:

All,

Great to see a top-level project being planned!

And a definite +1 for Tobias' and Philip's suggestions.

cheers
erwin

 

 

 

Op 15/03/2016 om 08:38 schreef Philip Wenig:

Jay, everyone,

it would be great if we could include under section "Scope" also:

* Storing scientific data and enable an easy exchange between researchers.


Best,
Philip

Am 15.03.2016 um 08:33 schrieb Tobias Verbeke:

Hi Jay,

 

Thanks for the revisions.

 


From: "Jay Jay Billings" <jayjaybillings@xxxxxxxxx>
To: "Science Industry Working Group" <science-iwg@xxxxxxxxxxx>
Sent: Tuesday, March 15, 2016 4:27:02 AM
Subject: Re: [science-iwg] Science Top Level Project draft

Tobias, everyone,

Here is an updated version with most of your changes worked in one way or the other. Please let me know what you think. Two things:

*BSD is already mentioned next to EDL.

*Exporting cryptographic algorithms is a violation of US export control laws. So we need to explicitly say that we will not work on cryptography or its applications.

OK... Adding a sentence that provides the background (along 'In order to comply with ..., ') may be useful.

 

Best,

Tobias

Doing compression or anonymization are separate subjects, regardless of the origins of the techniques.

Jay

 

On Mon, Mar 14, 2016 at 7:26 PM, Jay Jay Billings <jayjaybillings@xxxxxxxxx> wrote:

Tobias,

Good suggestions. I'll work them in.

Jay

On Mar 14, 2016 17:24, "Tobias Verbeke" <tobias.verbeke@xxxxxxxxxxxxxxxx> wrote:

L.S.

 

Some comments on the Science TLP draft:

 

- modeling and simulation is only one way to collect (in this case generate) and analyze scientific data (be it in the physical or social sciences); this could be broadened to collection and analysis of sample survey data and experimental data

- maybe add economics to the social sciences ?

- tools and libraries for statistics, machine learning, artificial intelligence, data mining, text mining

- data structures should not be limited to 3D; we want to live in more dimensions

- visualization: a detail, but 4D is not uncommon (including e.g. time dimension)

- I don't understand why we should explicitly exclude (the mathematics of) cryptography; certain anonymization or privacy-protecting procedures as applied to scientific data can use cryptographic techniques

- I would use Revised BSD or 3-clause BSD next to (or rather than) EDL; I understand the inclination to use EDL, but the name is not widely known (e.g. Wikipedia knows EPL but not EDL) and, if it does not make any practical difference, the other names are much more recognizable and therefore reassuring to people.

 

Just my two eurocents.

 

Best,

Tobias

 


From: "Jay Jay Billings" <jayjaybillings@xxxxxxxxx>
To: "Science Industry Working Group" <science-iwg@xxxxxxxxxxx>
Sent: Monday, March 14, 2016 7:09:44 PM
Subject: Re: [science-iwg] Science Top Level Project draft

Everyone,

This draft is under review by the Steering Committee too and we are going to review it one final time on Wednesday. We will share our thoughts with you then.

 

Ideally we would have as much community feedback addressed in the document as possible before we submit it to the Foundation. So please speak up if you have ideas!

 

Jay

 

On Mon, Mar 14, 2016 at 12:17 PM, Andrea Ross <andrea.ross@xxxxxxxxxxx> wrote:

Dear Everyone,

A draft of the Science Top Level Project can be viewed and commented on here. If you're interested in edit access, just ask and I'll grant it to you.

A few folks asked what a Top Level Project (TLP) is in our annual meeting. It's a code-less project that provides vetting for important practices of the projects hosted beneath it. The members of the TLP are called the Project Management Committee (PMC). Here's a page listing the kinds of things the PMC does: https://wiki.eclipse.org/PMC

A couple of key ones are:

  • reviewing, discussing, and approving/rejecting CQ requests before they go to the intellectual property (IP) team.
  • some checks & balances related to committer elections

Kind regards,

Andrea


_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg




--

Jay Jay Billings

Oak Ridge National Laboratory

Twitter Handle: @jayjaybillings


_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg


_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg




--

Jay Jay Billings

Oak Ridge National Laboratory

Twitter Handle: @jayjaybillings


_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg

 

_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg

 

-- 
~~~~~~~~~~~~~~~~~~~~~~~~
OpenChrom - the open source alternative for chromatography / mass spectrometry
Dr. Philip Wenig » Founder » philip.wenig@xxxxxxxxxxxxx » http://www.openchrom.net
~~~~~~~~~~~~~~~~~~~~~~~~

 

_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg

 


_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg




_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg





_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg

 

-- 
~~~~~~~~~~~~~~~~~~~~~~~~
OpenChrom - the open source alternative for chromatography / mass spectrometry
Dr. Philip Wenig » Founder » philip.wenig@xxxxxxxxxxxxx » http://www.openchrom.net
~~~~~~~~~~~~~~~~~~~~~~~~

 

-- 

This e-mail and any attachments may contain confidential, copyright and or privileged material, and are for the use of the intended addressee only. If you are not the intended addressee or an authorised recipient of the addressee please notify us of receipt by returning the e-mail and do not use, copy, retain, distribute or disclose the information in or attached to the e-mail.
Any opinions expressed within this e-mail are those of the individual and not necessarily of Diamond Light Source Ltd.
Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments are free from viruses and we cannot accept liability for any damage which you may sustain as a result of software viruses which may be transmitted in or with the message.
Diamond Light Source Limited (company no. 4375679). Registered in England and Wales with its registered office at Diamond House, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom
 


_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg




--

Jay Jay Billings

Oak Ridge National Laboratory

Twitter Handle: @jayjaybillings





_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg




-- 
~~~~~~~~~~~~~~~~~~~~~~~~
OpenChrom - the open source alternative for chromatography / mass spectrometry
Dr. Philip Wenig » Founder » philip.wenig@xxxxxxxxxxxxx » http://www.openchrom.net
~~~~~~~~~~~~~~~~~~~~~~~~



-- 
~~~~~~~~~~~~~~~~~~~~~~~~
OpenChrom - the open source alternative for chromatography / mass spectrometry
Dr. Philip Wenig » Founder » philip.wenig@xxxxxxxxxxxxx » http://www.openchrom.net
~~~~~~~~~~~~~~~~~~~~~~~~

 

-- 

This e-mail and any attachments may contain confidential, copyright and or privileged material, and are for the use of the intended addressee only. If you are not the intended addressee or an authorised recipient of the addressee please notify us of receipt by returning the e-mail and do not use, copy, retain, distribute or disclose the information in or attached to the e-mail.
Any opinions expressed within this e-mail are those of the individual and not necessarily of Diamond Light Source Ltd.
Diamond Light Source Ltd. cannot guarantee that this e-mail or any attachments are free from viruses and we cannot accept liability for any damage which you may sustain as a result of software viruses which may be transmitted in or with the message.
Diamond Light Source Limited (company no. 4375679). Registered in England and Wales with its registered office at Diamond House, Harwell Science and Innovation Campus, Didcot, Oxfordshire, OX11 0DE, United Kingdom
 



_______________________________________________
science-iwg mailing list
science-iwg@xxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
https://dev.eclipse.org/mailman/listinfo/science-iwg


Back to the top