[pde-ui-dev] PDE API tools and ASM bytecode manipulation framework

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

[pde-ui-dev] PDE API tools and ASM bytecode manipulation framework

From: Eugene Kuleshov <eu@xxxxxxxx>
Date: Tue, 08 Jan 2008 00:59:09 -0500
Delivered-to: pde-ui-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/listinfo/pde-ui-dev>
List-help: <mailto:pde-ui-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/pde-ui-dev>, <mailto:pde-ui-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/pde-ui-dev>, <mailto:pde-ui-dev-request@eclipse.org?subject=unsubscribe>
User-agent: Thunderbird 2.0.0.9 (Windows/20071031)

Hi everyone,

Chris Aniszczyk mentioned another day that PDE API tools are using ASMframework. Since I am one of the developers of ASM, I thought it wouldbe useful to share my thoughts and comments about PDE usage of ASM aswell as few questions that could help me to get better understanding ofyour implementation and use cases. My observations are more or lessrandom and mostly based on a quick scanning trough ASM API calls anddon't assume deep analysis of the PDE implementation. Please don't takethose notes as criticism, those simply suggestions for better usage ofthe ASM API that could help to improve performance and memory usage foryour application. I think it is really great application for ASM libraryand I am very excited about it.

First of all I see that PDE API tools is using "asm" and "asm-tree"jars. However "asm-tree" jar is usually used for complex in-memory classtransformations and method analysis, which doesn't seem the case for PDEAPI tools. I found two uses of the "tree" classes and they both can beremoved: org.eclipse.pde.api.tools.internal.comparator.TypeDescriptorand org.eclipse.pde.api.tools.internal.search.ClassFileVisitorBasically those classes are making inefficient use of ClassNode, whereit can be replaced either with org.objectweb.asm.commons.EmptyVisitorfrom "asm-commons" jar or your own dummy implementation of all ASM'svisitor interfaces, like theorg.eclipse.pde.api.tools.internal.util.ClassVisitorAdapter you have.That would improve processing performance and eliminate unnecessarymemory allocation.

If I understood TypeDescriptor.initialize() method correctly, it isnot interested in the method code, so you could useclassReader.accept(visitor, ClassReader.SKIP_CODE); to completely skipall methods code from visiting. Same applies to implementation ofSearchEngine.getExtraction(..) and TagScanner.Visitor.getMethods(..)methods, where you also can add ClassReader.SKIP_CODE to avoid visitingmethod code.

There is a feature in ASM that allows to skip unneeded methods andother class artifacts. So you can return a null from visitMethod() call,then method code will be also skipped (that also happens when you visitan abstract or native method). For example, inConverter.MyClassFileAdapter.visitMethod() you could rewrite thefollowing code:

MethodVisitor visitor = super.visitMethod(accessFlags,methodName, desc, signature, exceptions);

           if (visitor != null) {
               if (reportRefs) {
                   visitor = new MyMethodAdapter(visitor);
               } else {
                   visitor = new ClearCodeAttributeMethodAdapter(visitor);
               }
           }
           return visitor;

 like this :

MethodVisitor visitor = super.visitMethod(accessFlags,methodName, desc, signature, exceptions);

           if (visitor != null) {
               if (reportRefs) {
                   visitor = new MyMethodAdapter(visitor);
               } else {
                   visitor.visitEnd();  // for safety
                   visitor = null;
               }
           }
           return visitor;

The above essentially eliminating ClearCodeAttributeMethodAdapter,which should probably have been implementing MethodVisitor directlyinstead of extending MethodAdapter, ororg.objectweb.asm.commons.EmptyVisitor should have been used instead of it.

If none of your visitors actually cares about StackMap information,you may as well add ClassReader.SKIP_FRAMES flag toClassReader.accept(..) call when reading or transforming classes.

It is better to not extend ClassAdapter when visitor doesn't do anytransformations and it is used only to collect data, likeClassFileDescriptorBuilder does. I would suggest to use something likeEmptyVisitor for that. Though it is worth to mention thatorg.objectweb.asm.commons.EmptyVisitor is flattening the visitingevents, i.e. it don't make difference between method and fieldsannotations, but you can return new instance of the EmptyVisitorsubclass if you need to keep hierarchy.Also, code in ClassFileDescriptorBuilder.visitMethod() that extractsvalue for default annotation look fairly similar to the code inorg.objectweb.asm.util.TraceAnnotationVisitor class from ASM's"asm-util" jar, but your version doesn't seem add separator betweenarray entries.


 In ClassFileVisitor.visitMethod(..) method. The following code:
   switch(type.getSort()) {
     case Type.LONG :
     case Type.DOUBLE :
       argumentcount += 2;
    default:
       argumentcount++;
   }

 can be replaced with: argumentcount += type.getSize();

The state machine used in Converter.MyMethodAdapter (passingstringLiteral between visitLdcInsn() and visitMethodInsn() methods) isprobably going to work for classes compiled by javac or JDT, but therecould be legal bytecode that won't fit into that pattern, but I am notsure what is the implication of that for PDE API tools.

I think I understand what MyMethodAdapter does about reference types(which seem very neat idea), but I'd be interested to learn why you havespecial processing for ASTORE, IRETURN, RETURN, ATHROW and POP opcodesand not all INVOKE* opcodes?

In Converter.MyClassFileAdapter.visit(..) it is better to useOpcodes.V1_5 instead of magic number "49". Though I wonder why you needto use that version instead of version from the original class?In the same method, it might be better to use annotation instead ofcustom attribute. Annotations are usually easier to access (in thebytecode as well as trough reflectio API) and you can still store theminto classes with version <49. Though that is obviously matter ofpersonal preferences.I also curious what Converter.MyMethodWriter is used for? I don't seeany use of collected lastOffset value.

Hope you would find these notes somehow useful. Please let me know ifI could be at any help in regards to ASM framework and thanks again forusing it.


 Eugene

Follow-Ups:
- Re: [pde-ui-dev] PDE API tools and ASM bytecode manipulation framework
  - From: Darin Wright

Prev by Date: Re: [pde-ui-dev] OSGi Frameworks Extension point
Next by Date: Re: [pde-ui-dev] PDE API tools and ASM bytecode manipulation framework
Previous by thread: [pde-ui-dev] Action for bundle.update() in Plugin Registry Browser
Next by thread: Re: [pde-ui-dev] PDE API tools and ASM bytecode manipulation framework
Index(es):
- Date
- Thread

Breadcrumbs