[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[news.eclipse.employment] Internship offer at XRCE, France

We are looking to take one student for approximately 3 to 6 months at Xerox Research Centre Europe, located in France. The student should be a good quality Masters in software engineering. Experience in Eclipse and Java technology is important. Students should be fluent in English or French.


Title: Does Eclipse fit for document conversion?

Proposal: The Document Structure research group at Xerox Research Centre Europe is interested in mining large collections of unstructured documents and in recovering a meaningful structure for all these documents. As an example, we work with large legacy collections of PDF business documents that should be converted in XML with a customer-specific schema. Such conversion tasks involve using multiple tools and techniques ranging from developing components from standard technologies (e.g. Relax schema conception, XSLT transformation) up to highly specific pieces, many of them being researched and developed in our lab, together with a particular methodology. However, the setting up of a conversion platform today remains difficult and deserves some more support. Therefore we are interested in this project in exploring the use of the Eclipse open source environment for tying together existing and envisioned conversion components.

The objective of the project is to identify the desired functions the platform should provide and explore how the Eclipse environment can satisfy them.

This internship will deliver a proof of concept prototype and can fit for one or two students at their last year of engineering school or masters students, preferably experienced with Java and Eclipse.

XRCE provides an informal and relaxed working environment situated in the Parc de Maupertuis in Meylan, France. The successful students will be given the freedom and flexibility to find their own solutions and to work in a way that suits them but will have the guidance and support of experienced full-time Xerox researchers and thereby gain an introduction to the field of commercial research in a world-class research laboratory. The project is part of the Document Structure research area.