Convert byte offset to char, to fix wrong highlight in non ascii text #95

artur-shaik · 2020-11-30T06:50:40Z

This PR contains code for fixing wrong highlight due to that vim works with bytes not chars. Should fix #61.

Must say that byteidx method in vim sometimes return not exact result, so I used some hacks to shift highlight in right position.

I did't test it with multyline highlight, because couldn't emulate such grammatic error.

This feature disabled by default, to enable it you need to set g:grammarous#convert_char_to_byte to 1.

autoload/grammarous.vim

rhysd · 2020-11-30T13:08:59Z

autoload/grammarous.vim

+        let e.errorlength = len(strcharpart(line_from, e.fromx, e.errorlength))
+        let e.fromx = byteidx(line_from,str2nr(e.fromx))+1
+        let e.tox = byteidx(line_to,str2nr(e.tox))
+        if ch_from =~ '\(\s\|[`<>!@#$%^&*(){}\[\].,:;\"''\\/]\)'


This check looks very ad-hoc.

What is a purpose of this condition? I could not read the intention

Please use =~# since =~ behavior depends on user's configuration

As I said in first message byteidx sometimes return not enough bytes, I couldn't determine why. That's why I used such hacky solution to shift highlight if first character is a symbol or whitespace.

I'm still not understanding the problem well. Would you help me to understand it by showing some example which requies this hack?

Here is example result without this conditional shift:

Second match is missed one character. Sometimes byteidx result is not enough for matchaddpos for one byte. That's why I used this hack to find if we are on word's start position.

And here result with shifting:

…into byte-to-char

artur-shaik · 2021-03-03T10:50:02Z

Ok, I finally could make deep dive in this feature. As a result I could remove this hacky condition.

artur-shaik · 2021-03-29T12:42:34Z

Also, added --line-by-line argument to LanguageTool. This fix LT response, when fromx position is in wrong place.

artur-shaik added 2 commits November 30, 2020 12:40

convert char x shift to bytes

54c68ec

update readme

87d4c07

rhysd requested changes Nov 30, 2020

View reviewed changes

artur-shaik added 3 commits December 1, 2020 14:20

add idents

a40fc16

enable char to byte convertion by default

450b30c

use match case in regexp

126163d

artur-shaik requested a review from rhysd December 1, 2020 08:34

artur-shaik added 3 commits December 2, 2020 10:06

use match case in regexp

3e35c1e

Merge branch 'byte-to-char' of github.com:artur-shaik/vim-grammarous …

057485f

…into byte-to-char

get rid of hacky code

aa2caff

artur-shaik added 2 commits March 26, 2021 22:29

cleanup

9bb3945

add line-by-line argument

deade37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert byte offset to char, to fix wrong highlight in non ascii text #95

Convert byte offset to char, to fix wrong highlight in non ascii text #95

artur-shaik commented Nov 30, 2020

rhysd Nov 30, 2020

artur-shaik Dec 1, 2020

rhysd Dec 11, 2020 •

edited

Loading

artur-shaik Dec 14, 2020

artur-shaik commented Mar 3, 2021 •

edited

Loading

artur-shaik commented Mar 29, 2021 •

edited

Loading

Convert byte offset to char, to fix wrong highlight in non ascii text #95

Are you sure you want to change the base?

Convert byte offset to char, to fix wrong highlight in non ascii text #95

Conversation

artur-shaik commented Nov 30, 2020

rhysd Nov 30, 2020

Choose a reason for hiding this comment

artur-shaik Dec 1, 2020

Choose a reason for hiding this comment

rhysd Dec 11, 2020 • edited Loading

Choose a reason for hiding this comment

artur-shaik Dec 14, 2020

Choose a reason for hiding this comment

artur-shaik commented Mar 3, 2021 • edited Loading

artur-shaik commented Mar 29, 2021 • edited Loading

rhysd Dec 11, 2020 •

edited

Loading

artur-shaik commented Mar 3, 2021 •

edited

Loading

artur-shaik commented Mar 29, 2021 •

edited

Loading