Re: [ecf-dev] E-intro [Was Efficient downloads]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [ecf-dev] E-intro [Was Efficient downloads]

From: Filip Hrbek <filip.hrbek@xxxxxxxxxxxxxx>
Date: Wed, 30 May 2007 19:03:21 +0200
Delivered-to: ecf-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/listinfo/ecf-dev>
List-help: <mailto:ecf-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/ecf-dev>, <mailto:ecf-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/ecf-dev>, <mailto:ecf-dev-request@eclipse.org?subject=unsubscribe>
Organization: Cloudsmith Inc.
User-agent: Thunderbird 2.0.0.0 (Windows/20070326)

Hi Scott, comments inside.

- resume from a different location (e.g. different mirror)
Hmm. Don't know how you are going to accomplish that withoutsomething quite different from normal http, but sounds interesting.

Not sure for what protocols we are able to implement. To do this, wemust be able to start downloading at a particular offset and finallycheck the file consistency, e.g. using a digest file if available. Wealso have to have a list of mirrors containing the same artifact (let'sassume we've obtained it somewhere). This should be possible with http.There could be API supporting this feature. Protocols which wouldn'tsupport this would either make a workaround, or throw an exception.

- retrieving information from special headers (like Content-Disposition)
- detecting URL redirections to final mirrors
I'm not sure what you are going to use to implement this, but would becurious to find out.

If you download a file from an URL, you have to discover the filename ifuser doesn't specify it explicitly. The most precise solution is parsingthe Content-Disposition header if it's available (browsers use it fordetermining the name of the file to save). Unlike other http headers,Content-Disposion has a very complex syntax. We should be able to parseit properly.

Detecting URL redirections would help us in statistics collection. Itwould be wrong to assign statistics belonging to different mirrors toone URL covering all the mirrors. This is why we should detect thatreading from the covering URL points to different mirrors on differentretrieval attempts. Finally we could automatically deprecate using someof the black-listed mirrors to avoid speed or timeout problems.

I think you would need to describe what statistics are desired here.We can easily add adapter interfaces for collecting statisticsassociated with a given file retrieval/all to ecf or individualproviders, but would need to know what stats are of interest.


The most interesting statistics:

- average download speed (related to concrete mirrors, geographicalprovider/consumer location, day time etc.)- amount of bytes downloaded from particular location / duringparticular time period

- frequency of timeouts including timeout values
- etc.

We could share the statistics among users in an application by storingthem on a server (the downloader would send the statistics to the serverautomatically). This would prevent users from attempts to accesscorrupted/slow repositories.




Regards
 Filip Hrbek

Follow-Ups:
- Re: [ecf-dev] E-intro [Was Efficient downloads]
  - From: Scott Lewis

References:
- [ecf-dev] Efficient downloads
  - From: Thomas Hallgren
- Re: [ecf-dev] Efficient downloads
  - From: Scott Lewis
- Re: [ecf-dev] Efficient downloads
  - From: Thomas Hallgren
- Re: [ecf-dev] Efficient downloads
  - From: Scott Lewis
- [ecf-dev] E-intro [Was Efficient downloads]
  - From: Thomas Hallgren
- Re: [ecf-dev] E-intro [Was Efficient downloads]
  - From: Scott Lewis
- Re: [ecf-dev] E-intro [Was Efficient downloads]
  - From: Filip Hrbek
- Re: [ecf-dev] E-intro [Was Efficient downloads]
  - From: Scott Lewis

Prev by Date: Re: [ecf-dev] E-intro [Was Efficient downloads]
Next by Date: Re: [ecf-dev] E-intro [Was Efficient downloads]
Previous by thread: Re: [ecf-dev] E-intro [Was Efficient downloads]
Next by thread: Re: [ecf-dev] E-intro [Was Efficient downloads]
Index(es):
- Date
- Thread

Breadcrumbs