Re: [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous

From: Scott Lewis <slewis@xxxxxxxxxxxxx>
Date: Tue, 21 Oct 2008 08:02:01 -0700
Delivered-to: eclipse-incubator-e4-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/private/eclipse-incubator-e4-dev>
List-help: <mailto:eclipse-incubator-e4-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/eclipse-incubator-e4-dev>, <mailto:eclipse-incubator-e4-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/eclipse-incubator-e4-dev>, <mailto:eclipse-incubator-e4-dev-request@eclipse.org?subject=unsubscribe>
User-agent: Thunderbird 2.0.0.17 (Windows/20080914)

Hi Martin,

Oberhuber, Martin wrote:

Hi Scott,

good points, indeed! thanks for taking the time to write
such an elaborate reply.

When blocking the calling thread (e.g. any synchronous reads/writes)results in system-wide 'problems' (e.g. UI is stopped, other server


Hm... IMHO this is not a use-case which requires async
because it couldn't get implemented with synchronous
calls. This just shows that somebody's using a synchronous
API in a way that's inappropriate for slow/unreliable
back-ends.

Yes...I guess the point is that any network is a relativelyslow/unreliable backend compared to any disk.

This does point out an important truth, though:

synchronous APIs may *encourage* usage of background Jobs
for slow operations, but cannot enforce this. Asynchronous
APIs, on the other hand, *force* the client to take actions
which are appropriate for use with slow/unreliable back-ends.

True...because the default assumption for the network is that it isrelatively slow and unreliable.

>From that point of view, it might actually make sense to
have the "true E4 resources kernel" only support asyncfile system access, and the backward compatibility wrappersprovide a bridge to synchronous access... that way we couldforce "true E4" clients to take appropriate measures. Giventhat ECF filetransfer is in Equinox already, I could imagine
getting rid of EFS and replacing it by ECF filetransfer
(probably extended) in the "core E4 Resources".

This seems too extreme to me. That is, EFS is an established, very nicesynchronous file system API. No reason to 'get rid' of it for technicalpurity IMHO (i.e. everything must be asynchronous over network). Ratherit seems to me that having the ability to go between synchronous andasynchronous is a way to go...while introducing mixed strategies (likeHadoop-based EFS impls, which asynchronously replicate files/file blocks).

Futures as return value might be a concept that allowsusing asynchronous APIs with minimal extra effort when
results are available "very fast and reliably".

I agree that futures (we have the class name 'AsynchResult'...the 'h' isembarrasing for me) can be a very useful concept for bridgingasynchronous calls with with synchronous needs (BTW, we use AsynchResultto get JRE 1.4 compatibility...the 1.5+ concurrent API also has futuresof course). But they are (still) a relatively foreign APIconcept...that is, not too familiar for many programmers. Still, Ithink they are useful.

Writing an EFS wrapper to ECF filetransfer for backward
compatibility should be an easy thing to do (and probably
you have done it already). In terms of the resource layer,
EFS is pretty separated from it already (only connected
by URI on the API). Having the Resources layer directly
make asynchronous calls (instead of using the EFS wrapper)
should be a very interesting experiment.

Well, no we haven't done this already, although we have done the reverse(implement async ECF filetransfer on top of EFS+jobs). It might be auseful exercise, but it seems to me like reusing more completereplication approaches (i.e. Hadoop, etc) for implementing EFS on top ofasynchronous would be quicker and easier.

Well, if such an adapter is not available then they could do itsynchronously rather than asynchronously.
But that's exactly my point: we don't want clients having
to write code for both synchronous and asynchronous variants.
That's code duplication, resulting in bloat. I'd like toshoot for ONE core e4 api for each concept (with additional
compatibility layers for backward compatibility where needed).

Although I share your desire to reduce bloat, I'm not sure that havingeither synchronous xor asynchronous access to resources (whether remoteor local) is the natural way to keep bloat to a minimum for access tofilesystem/resources.

By "adding async to the EFS API" I didn't think about any
technical measure such as blowing up the IFileStore interface.
What I meant was, that clients should be able to expect any
contributed file system to be accessible with all the APIthat E4 resources FS exposes -- be it synchronous orasynchronous, via 1 or multiple interfaces, obtained viaadapter pattern or otherwise.

It seems to me this is more a requirement on file systemimplementer...i.e. that they implement all resources API (i.e. bothsync/async)...right?

Although I think this is a good general principal (implementers shouldimplement entire relevant API), in practice I'm not sure how to requireit given a provider architecture (for EFS and for ECF). That is, I'msure that there will be incomplete EFS implementations, incomplete ECFfile transfer implementations, etc. Encouraging completeness will beeasy...requiring it will be hard I expect.



<stuff deleted>

I disagree. I think the problem is with trying to make localand remote access look exactly the same (network transparency)
Hm... on the other hand, a client that is prepared to dealwith remote files should easily be able to handle the local
case as well, no? I'd like to investigate technical measures
of how we can make it simple to program the remote case.

Yes, I agree that it should be easy to handle both the local and remotecases...but that's the hard part...since the local and remote cases aredifferent...in performance, reliability, partial failure, etc. and asthe Note on Distributed Computing points out...these are differencesthat are very hard to create a uniform API for...because the differencesin network and local behavior frequently 'bubble up' to the API.

But I do think that there is a lot of room for innovation...particularlyaround replication/caching/synchronization for file systems (e.g. Hadoop).

If the core framework is remote-aware we can add layers for
simplified access if we want. We cannot do it the other way
round.


True.

Can anybody argue against using the asynchronous ECFfiletransfer APIs as the core E4 resources file system
layer?

Yes, I can (surprise :). I think introducing ECF/asynchronous for localfile system access would be a waste of time. Even though it would beeasily done (ECF's file transfer API has asynch access to the local filesystem already), I don't think it would be worth doing.

Although I'm not sure what the best way to 'bridge' EFS and the ECF filetransfer APIs is (i.e. adapters, etc), I don't think it's reallynecessary or desirable to strictly layer them. An example of this isp2's usage of ECF...it only uses the file retrieval part of the ECFfiletransfer API (it has no use for upload, or directory navigation).It's actually simpler and a better fit to just use that part (retrieval)of ECF filetransfer...and not have to deal with other dependencies thatwould be implied by including, say, all of EFS (with or without ECFunderneath).

I understand (and fully appreciate) the desire to reduce API bloat (i.e.client code duplication, multiple APIs, etc), but I'm not sure of thebest way to do that when it comes to synchronous/asynchronous (orlocal/network rather) access to filesystems.

Scott

References:
- [eclipse-incubator-e4-dev] [resources] Java7 / JSR203 and EFS
  - From: Oberhuber, Martin
- Re: [eclipse-incubator-e4-dev] [resources] Java7 / JSR203 and EFS
  - From: Scott Lewis
- RE: [eclipse-incubator-e4-dev] [resources] Java7 / JSR203 and EFS
  - From: Schaefer, Doug
- Re: [eclipse-incubator-e4-dev] [resources] Java7 / JSR203 and EFS
  - From: Scott Lewis
- [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous
  - From: Oberhuber, Martin
- Re: [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous
  - From: Scott Lewis
- RE: [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous
  - From: Oberhuber, Martin

Prev by Date: RE: [eclipse-incubator-e4-dev] Re: Avoiding Bloat
Next by Date: Re: [eclipse-incubator-e4-dev] [resources] Resource Folder filters
Previous by thread: RE: [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous
Next by thread: Re: [eclipse-incubator-e4-dev] [resources] EFS, ECF and asynchronous
Index(es):
- Date
- Thread

Breadcrumbs