[pdt-dev] Manipulating the PHPTokenizers grammar

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

[pdt-dev] Manipulating the PHPTokenizers grammar

From: Robert Gründler <r.gruendler@xxxxxxxxx>
Date: Tue, 14 Jun 2011 11:35:14 +0200
Delivered-to: pdt-dev@xxxxxxxxxxx
Domainkey-signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:user-agent:mime-version:to:subject :content-type:content-transfer-encoding; b=HUfKhuOOFqAqGDQnx7vhQBnZXTPTZy1MXykvufy5c1Fv4C4urpFei/PqmyHd39OdcJ 0t8bINaYfTC3tnlASwizOAGh/SQM99z5ZaT9kd9oLcPa5KEoSElGqy77eFqs4Xa1OihH M96t1ONBaWXEPzdzCRIwh71xkR0JoPncaYI9M=
List-archive: <https://dev.eclipse.org/mailman/private/pdt-dev>
List-help: <mailto:pdt-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/pdt-dev>, <mailto:pdt-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/options/pdt-dev>, <mailto:pdt-dev-request@eclipse.org?subject=unsubscribe>
User-agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.17) Gecko/20110414 Thunderbird/3.1.10

Hi,

after receiving the correct JFlex.jar i managed to modify and compilethe PHPTokenizer.jflex

grammar into the PHPTokinezer Java class.

The tokenizing of the template languages structures is working by andlarge, the plugin implementing

it can be found here:

https://github.com/pulse00/Twig-Eclipse-Plugin

However, there's some minor bugs which are related to the way the jflexgrammar has been extended, and

hopefully someone from the list can help me out.

The tokens of the templating language reside inside the XML content,this is:


<div>
  {{ template code }}
</div>

My problem is that the rule for generic XML content of the grammaralways overrides my rule for opening Template tags:


// initial rule to go to the Template content state
// will always be overriden by the rule below, as it will always match more
// characters

<YYINITIAL>  "{{"{Whitespace}* {
 // switch to template content state
}


// initial rule to go to the XML content state

<YYINITIAL>  [^<&%]*|[&%]{S}+{Name}[^&%<]*|[&%]{Name}([^;&%<]*|{S}+;*) {
  // switch to xml content state
}


If the input to the tokenizer is only "{{", then the first rule matches. But as soon as there's any other
characters following ("{{ foo"), the second rule matches because the match is longer - as described in the
jflex documentation.

It seems the authors of the original smarty plugin also had the same problem, that's probably why they chose to
detect the opening template tags via a custom function (findTwigDelimiter):

Here's the code for it:

https://github.com/pulse00/Twig-Eclipse-Plugin/blob/master/org.eclipse.twig.core/Resources/parserTools/TwigTokenizer.jflex#L1951

Has anyone a hint how i can extend the PHPTokenizer in a way that theopening template tags will always match - and not be matched

by the generic xml-content rule?

any hints would be greatly appreciated!

thanks,


-robert

Prev by Date: Re: [pdt-dev] compile jflex grammar
Next by Date: [pdt-dev] Build failed in Hudson: cbi-pdt-3.0-indigo #154
Previous by thread: [pdt-dev] Participation Requested: Survey about Open-Source Software Development
Next by thread: [pdt-dev] REMINDER: Participation Requested: Survey about Open-Source Software Development
Index(es):
- Date
- Thread

Breadcrumbs