Disambiguate between context declarations and context references #58

skaupper · 2023-09-20T13:11:27Z

This pull request fixes #16, among other things.

The context keyword is used for context declarations (which were already implemented) and context references (which were still missing). To disambiguate between these two (and to avoid having to deal with this issue at a higher level again), I implemented a lookahead mechanism.

Context.StartBlock contains a set of states which only check whether the token stream describes a context declaration, reference or neither. These states do not alter the tokens nor do they generate new blocks by themselves.
As soon as it is decideable, Context.StartBlock hands control over to either Context.ReferenceStartBlock or Context.DeclarationStartBlock respectively.

Since the TokenToBlockParser only supported looking at a token a single time, I created a second parser instance, which iterates over everything from TokenMarker (which holds the ContextKeyword token) up to the current token. The method TokenToBlockParser.ReparseFromTokenMarker can be used to do exactly that.

I also implemented a parser method TokenToBlockParser.HandleNonCodeTokens which can create non-code blocks (i.e. all kinds of whitespaces and comments) since the LRM allows them basically everywhere. While the LRM does not technically list delimited comments (/* ... */) as separators (§15.3), they effectively act as separators insofar as they separate adjacent lexical elements.

I would love to know your thoughts about these changes and if/how you want to integrate them!

…tions Add functions to the TokenToBlockParser which can be generally useful: HandleNonCodeTokens creates NonCode (i.e. whitespace and comment) blocks. The LRM allows these blocks basically everywhere. Note: Delimited comments are strictly speaking not separators according to the LRM (§15.3)! ReparseFromTokenMarker allows to iterate over a set of tokens a second time. This is useful if you cannot decide the block type based on a single token. `Context.StartBlock` searches ahead (without modifying tokens or adding blocks) until it can decide whether the `context` keyword is used in a context declaration or a context reference. Additionally, some changes are made to satisfy the static type checker.

skaupper · 2023-09-20T13:12:34Z

pyVHDLParser/Blocks/InterfaceObject.py

-			# 	parserState.PushState = ExpressionBlockEndedByLoopORToORDownto.stateExpression
-			# 	return
-			# elif token == ';':
+				parserState.NewToken = BoundaryToken(fromExistingToken=token)


Do you know why you commented out this block in the first place?

skaupper · 2023-09-20T13:15:21Z