A problem for editing Non-ASCII characters #282

tkfm-yamaguchi · 2019-02-07T08:15:53Z

Hi,

When I edit a file which includes Non-ASCII characters as the comments, string literals, ..etc., vim-lsp goes wrong.

For example:

export enum E {
  S = '𐐀',
}

When I remove 𐐀 (assigning a empty string to E.S) in the code above,

Unterminated literal string error is shown on that line (the line S is defined).
The completion for E does not work.

After some investigation, what I've got is:

The number of character in the parameter to the server should be counted on a UTF-16 string representation (LSP SPEC).
strlen() is used in vim-lsp for the calculation.

=> vim-lsp sends the wrong character parameter to the server when the non-ascii characters (whose code point is more than 0x10000) are edited.

A solution for this problem is, I think, replacing strlen() with the function like below:

function! s:count_utf16_code_units(str) abort
  let l:len = strchars(a:str)
  let l:i = 0
  let l:cnt = 0

  while l:i < l:len
    let l:chr = strcharpart(a:str, l:i, 1)
    if char2nr(l:chr) > 0x10000
      let l:cnt = l:cnt + 2
    else
      let l:cnt = l:cnt + 1
    endif

    let l:i = l:i + 1
  endwhile

  return l:cnt
endfunction

I'm not good at Vim script, so I hope these information helps fixing the problem.

The text was updated successfully, but these errors were encountered:

Fixes #282

mattn · 2019-02-07T10:11:40Z

Thanks for your report about this. Could you please try #284 ?

tkfm-yamaguchi · 2019-02-07T13:09:59Z

@mattn
Thank you for the quick response.

Could you please try #284 ?

I've tried it in my TS project and tiny Python scripts (which occurred the similar errors), and both work well in my env.

I found the benchmarks in PR. Do you think this introduce the undue overhead ?

mattn · 2019-02-07T14:04:10Z

Thanks your confirm. I have to make another benchmark case. Current code use small string. It should use long string too.

tkfm-yamaguchi · 2019-02-07T15:58:03Z

It should use long string too.

I see, that should be done. If there are anything I can do, let me know.

I wish if this proposal submitted to official LSP was accepted ...

tkfm-yamaguchi · 2019-02-08T02:16:17Z

Could you please try #284 ?

Sorry, my inspection was insufficient, and it seems to need some more process for the 2 code units characters.

Adding such characters is now OK on count-utf16 branch, but removing is still not.

I compare the ts-server logs, for the '𐐀' removal on the example of this issue, which comes from vim(-lsp) and VisualStudioCode, and found that the position calculation for removing the 2 code units characters should also be aware the size of code units.

I mean, even though text is "", it should be treated as 2 size of characters are changed when the 2 code unit character is removed.

Here is the logs of ts-server:

vim(-lsp):

Info 104  [10:39:18.394] request:
    {"command":"change","seq":8,"type":"request","arguments":{"file":"[masked]","line":2,"offset":9,"endLine":2,"endOffset":10,"insertString":""}}

VSC:

[Trace  - 10:41:09] Sending request: change (218). Response expected: no. Current queue length: 0
Arguments: {
    "insertString": "",
    "file": "[masked]",
    "line": 2,
    "offset": 9,
    "endLine": 2,
    "endOffset": 11
}

(the values of "endOffset" are different)

I'm not sure, but it seems hard to detect the length of the removed character's code units (from the both point of implementation and efficiency).

mattn · 2019-07-26T07:46:49Z

Fixed by #447

tkfm-yamaguchi changed the title ~~A problem while editing Non-ASCII characters~~ A problem for editing Non-ASCII characters Feb 7, 2019

mattn added a commit that referenced this issue Feb 7, 2019

Counter utf-8 characters

4b8a36f

Fixes #282

mattn mentioned this issue Feb 7, 2019

Counter utf-16 characters #284

Closed

thomasfaingnaert mentioned this issue Jul 4, 2019

Completion for symbols including colons (":") #420

Closed

clason mentioned this issue Jul 4, 2019

References to vim positions can't handle non-ASCII chars #425

Closed

mattn closed this as completed Jul 26, 2019

aktau mentioned this issue Dec 1, 2020

lsp: references is very slow when there are references in many files neovim/neovim#13359

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

A problem for editing Non-ASCII characters #282

A problem for editing Non-ASCII characters #282

tkfm-yamaguchi commented Feb 7, 2019

mattn commented Feb 7, 2019

Uh oh!

tkfm-yamaguchi commented Feb 7, 2019 •

edited

Loading

Uh oh!

mattn commented Feb 7, 2019

Uh oh!

tkfm-yamaguchi commented Feb 7, 2019

Uh oh!

tkfm-yamaguchi commented Feb 8, 2019 •

edited

Loading

Uh oh!

mattn commented Jul 26, 2019

Uh oh!

A problem for editing Non-ASCII characters #282

A problem for editing Non-ASCII characters #282

Comments

tkfm-yamaguchi commented Feb 7, 2019

mattn commented Feb 7, 2019

Uh oh!

tkfm-yamaguchi commented Feb 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattn commented Feb 7, 2019

Uh oh!

tkfm-yamaguchi commented Feb 7, 2019

Uh oh!

tkfm-yamaguchi commented Feb 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattn commented Jul 26, 2019

Uh oh!

tkfm-yamaguchi commented Feb 7, 2019 •

edited

Loading

tkfm-yamaguchi commented Feb 8, 2019 •

edited

Loading