vscode-R
vscode-R copied to clipboard
syntax: leading and trailing periods / dots not tokenized as part of identifier
Given an R document with the code:
.hello.world.
The leading + trailing .
s are tokenized as separate entities.
data:image/s3,"s3://crabby-images/eaf04/eaf047dec7de3b9ad501d559e061d939b5bec01b" alt="Screen Shot 2022-08-25 at 4 37 25 PM"
I believe the use of \b
here forces VSCode to consider .
as a word boundary, and so it fails to tokenize the whole identifier as a single token:
https://github.com/REditorSupport/vscode-R/blob/da579cc17a9485f7a11bf5e75bf6b9c347eca116/syntax/r.json#L165-L168
If so, I think those \b
s can be safely removed?
Yes, I think you are right. We should remove those \b
. Do you find unnecessary \b
in other matches?
I guess another example 42.
, which R parses as a single number but appears to be tokenized as 42
and .
:
data:image/s3,"s3://crabby-images/d4773/d4773371d1ef546b4468e50dec4b595e17001104" alt="Screen Shot 2022-08-25 at 5 33 20 PM"
https://github.com/REditorSupport/vscode-R/blob/da579cc17a9485f7a11bf5e75bf6b9c347eca116/syntax/r.json#L123-L126
And also for references to packages containing a .
in their name:
data:image/s3,"s3://crabby-images/17982/17982e95984bf49ffcd184a26c35e3e6e8b1a438" alt="Screen Shot 2022-08-25 at 5 39 10 PM"
I'll poke around a bit more for other edge cases...
Could one potential edge case be on S3 methods ? i.e.
as.data.frame.list()
though tbh I'm not exactly sure what you would want to tokenise here :s
This issue is stale because it has been open for 365 days with no activity.
unstale