-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chinese character displaying wrong in 'less' command #2592
Comments
This is not a sufficient issue description - we need at least the following:
As a bonus give us some hints about used environment (OS version + how the data ends up in xterm.js) as there are often hidden encoding issue at these steps. |
OS version: centos 7.2
|
@d270624 Please check/make sure, that both systems use an UTF8 locale. Furthermore please copy the characters in question here either as unicode chars or their hex values, so we can check to which unicode plane they belong (There are newer CJK codepoints that might be handled wrong by our current wcwidth). print(repr(u'<your char>'.encode('utf-8'))) and copy the output into a comment. |
@d270624 Does it output the correct chars with correct alignment if you do it locally (without ssh into centOS)? Can you run the same (locally + with ssh into foreign machine) in the xterm.js demo and check if the issue remains? If so, plz switch Edit: Just saw that this happens within less. Does the shell prompt input work correctly (locally and remote)? |
@jerch It is normal when executed locally, |
@d270624 Then this is most likely an encoding issue of the data on the way to xterm.js (either less itself, the ssh tunnel, docker?). Since you dont show the debug input/output I cannot help you to track this down. For general encoding issues plz have a look at https://xtermjs.org/docs/guides/encoding/. |
@d270624 Found a way to repro it, stay tuned... |
Ok here we go. Inputting '好' at the search line of less the following is sent to the terminal:
Meaning:
So the only difference here is the cursor back moving - one cell remotely, two cells locally. This pretty much looks like the systems dont agree on the wcwidth of that char and indeed - if I run less locally on that remote machine, any emulator shows the same issue with messed up input at the search line in less. The bottom line here is - those are incompatibilities between different Unicode/wcwidth versions used by systems. While one system thinks this is a half width char covering one cell, the other sees it as wide char covering 2 cells. Since the terminal interface has no way to level out this currently, it cannot be fixed. Not sure why less does this weird print+erase+print in the first place, but the different cell widths is the reason why we see one vs two erase commands. Still there is one bug in xterm.js linked to this issue - we do not erase full width chars properly as described here #1779. Thats the reason why the char ends up doubled partly overdrawn. |
The text was updated successfully, but these errors were encountered: