Outputting russian characters into the console - javascript

I am trying to output a russian string into the console like this
сonsole.log("Привет");
But the console ouputs this ПривеÑ. How to solve this problem?

You need to declare your site's (or scripts for that matter) encoding.
You can use <meta charset="UTF-8"> in the HEAD of your site to tell the whole page to be UTF-8 encoded. --
OR
If you just need your script to be encoded .. You can encode JUST the script -- IE <script type="text/javascript" charset="utf-8" src="blah.js"/>
Either way you should always tell your site/script which character set you are using.

Related

Convert unicode to Chinese characters

Supposing I have a string of code like so:
\u00e5\u00b1\u00b1\u00e4\u00b8\u008a\u00e7\u009a\u0084\u00e4\u00ba\u00ba
How would I convert these back into Chinese characters using Javascript:
山上的人
This is so that I can actually display Chinese on my web page. Right now it comes out as å±±ä¸ç人.
This website manages to accomplish this, however this is with PHP they don't expose.
I am not familiar with how character encoding works well at all, so I don't even know the terminology to search for a proper solution.
The string appears to be in UTF-8.
https://github.com/mathiasbynens/utf8.js is a helpful Javascript library that saves you the headache of learning the UTF-8 standard, and will decode the UTF-8 into text.
Here's a demo: https://mothereff.in/utf-8
Paste in \u00e5\u00b1\u00b1\u00e4\u00b8\u008a\u00e7\u009a\u0084\u00e4\u00ba\u00ba into the "UTF-8-encoded" textarea to decode it.
Add <meta charset="UTF-8"> inside the <head></head> tag of your HTML file so that it will display Chinese properly. Just put the Chinese characters directly in your HTML file

Why does this ​ sign pop up?

Whilst running a code on notepad ++ that makes traffic lights run automatically I notices that this sign pop ups next to one of the buttons, ​. I did some research and found out that to make it go away I need to specify the charset to equal utf-8. I did this and the sign went away however I am confused because the default character encoding in HTML5 is utf-8 and it is even shown in notepad that it is using utf-8.
I was wondering if someone could tell me why the sign pops up considering the fact that it was already encoded in utf-8.
There are a number of things that all need to be set to UTF8.
The original file, of course, needs to be UTF8.
However, there is also an HTML header that specifies the encoding of the file. If this header is set incorrectly, the browser may try another encoding.
So, using a specific over-ride in the HTML file can "work around" this issue.
There a bit of discussion here: <meta charset="utf-8"> vs <meta http-equiv="Content-Type">

Accents in inline javascript

I know that for javascript alerts accented characters should be written in unicode hex (for example š is \u0161).
But what if the javascript function which contains the alert is inlined inside a html document, that has a UTF-8 encoding? Should I still write the accented characters in unicode hex?
Example:
<!DOCTYPE html>
<html lang="ro">
<head>
<meta charset="UTF-8">
</head><body>
<script type="text/javascript">
window.onload=error_alert();function error_alert(){alert('A apărut o eroare!')}
</script>
</body></html>
Is it safe to write the character "ă" just like that, or should I use \u0103 ? Remember that meta charset is already defined as UTF-8.
Yes, you could still use escape sequences in JavaScript strings, even if they’re inline in the HTML.
Is it safe to write the character "ă" just like that, or should I use \u0103 ? Remember that meta charset is already defined as UTF-8.
It depends. If you’re sure that the HTML document includes <meta charset=utf-8> and will always do so, or if the server sends the Content-Type: text/html;charset=utf-8 HTTP header and you’re sure it will always do so, then by all means, use the raw symbols without escaping them.
There’s a problem, though — these things are not always under your control. Visitors might be behind a proxy that strips or mangles HTTP headers or modifies the encoding of the HTML response.
In any case, the safest thing to do is to use escape sequences for any non-ASCII characters, as that way the entire source code consists of ASCII characters only. (You could use HTML character references for the HTML parts of the document, and CSS escape sequences for CSS.)

Characters outside of ASCII are not displayed properly

I am trying to display characters outside ASCII but it doesn't work. I only get scrambled characters. The JavaScript file should also be encoded in UTF-8, at least IntelliJ says so. What is missing or causing the error?
I have this in the index.html (which also has its charset set to UTF-8).
<script src="javascript/app.js" charset="utf-8"></script>
Just trying to output
console.log("Å");
I have this in the index.html file. It is an AngularJs application.
<meta charset="utf-8"/>
Specify UTF-8 encoding in your HTML file. Here are some ways.
Check if your JavaScript file is really UTF-8-encoded (see also this question).

international characters in Javascript

I am working on a web application, where I transfer data from the server to the browser in XML.
Since I'm danish, I quickly run into problems with the characters æøå.
I know that in html, I use the "&aelig;&oslash;&aring;" for æøå.
however, as soon as the chars pass through JavaScript, I get black boxes with "?" in them when using æøå, and "æøå" is printed as is.
I've made sure to set it to utf-8, but that isn't helping much.
Ideally, I want it to work with any special characters (naturally).
The example that isn't working is included below:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" "http://www.w3.org/TR/html4/strict.dtd">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>Untitled Document</title>
<script type="text/javascript" charset="utf-8">
alert("æøå");
alert("æøå");
</script>
</head>
<body>
</body>
</html>
What am I doing wrong?
Ok, thanks to Grapefrukts answer, I got it working.
I actually needed it for data coming from an MySQL server. Since the saving of the files in UTF-8 encoding only solves the problem for static content, I figure I'd include the solution for strings from a MySQL server, pulled out using PHP:
utf8_encode($MyStringHere)
If you ever can't set the response encoding, you can use \u escape sequence in the JavaScript string literal to display these characters.
alert("\u00e6\u00f8\u00e5")
Just specifying UTF-8 in the header is not enough. I'd bet you haven't saved your file as UTF-8. Any reasonably advanced text editor will have this option. Try that and I'm sure it'll work!
You can also use String.fromCharCode() to output a character from a numeric entity.
e.g. String.fromCharCode( 8226 ) will create a bullet character.
I get "æøå" for the first one and some junk characters for the next. Could it be that the javascript is not mangling (or mojibake) your letters but the alert dialog uses the system default font, and the font is incapable of displaying the letters?
I use the code like this with Thai language. It's fine.
$message is my PHP variable.
echo("<html><head><meta charset='utf-8'></head><body><script type='text/javascript'>alert('" . $message . "');</script></body></html>");
Hope this can help. Thank you.
(I cannot post image of what I did as the system said "I don't have enough reputation", so I leave the image, here. http://goo.gl/9P3DtI Sorry for inconvenience.)
Sorry for my weak English.
This works as expected for me:
alert("æøå");
... creates an alert containing the string "æøå" whereas
alert("æøå");
... creates an alert with the non-ascii characters.
Javascript is pretty utf-8 clean and doesn't tend to put obstacles in your way.
Maybe you're putting this on a web server that serves it as ISO-8859-1? If you use Apache, in your Apache config file (or in .httaccess, if you can override), you should have a line
AddCharset utf-8 .js
(Note: edited to escape the ampersands... otherwise it didn't make sense.)

Categories

Resources