International language characters

Thalassicus

Bytes and Nibblers
Joined
Nov 9, 2005
Messages
11,057
Location
Texas
I recently learned it's possible to include international language characters like "ñ" directly, without using their decimal representations. To do this:

  1. Download notepad++
  2. Open your XML or SQL file.
  3. Encoding menu -> Convert to UTF-8 without BOM (byte order mark).

This appears to work correctly, with one caveat. The regular windows notepad appears to not save in the proper UTF-8 format, and adds a byte order mark. This messes up the file. MS Notepad is the default editor for such files, something people need to be aware of if they want to edit your files.
 
The standard on MS-Windows is to add a BOM to the start of Unicode files. Parsers should gracefully handle the presence or absence of any such BOM - they are trivial to detect and handle.

The fact that Civ does not gracefully handle this does NOT mean that MS-Notepad is broken, but exactly the opposite, yet again, Civ is broken.
 
The standard on MS-Windows is to add a BOM to the start of Unicode files. Parsers should gracefully handle the presence or absence of any such BOM - they are trivial to detect and handle.

The fact that Civ does not gracefully handle this does NOT mean that MS-Notepad is broken, but exactly the opposite, yet again, Civ is broken.

This is how it is in most things I've worked with. Generally, it's safe to not include bom.
 
Well I'm confused. Does it work or not? I pretty sure many players from different countries would apreciate possibility to write with their characters not using that stupid "use_extended font=1"
 
These were already working for me in XML files (with characters pasted into an XML file created in ModBuddy) and have been for the last 2 yrs. But wasn't working for SQL files (created in ModBuddy), so thanks for the fix.
 
I tried that with central europian characters (like. ě,š,č,ů,ę,ś,ą,ł...) and it wasn't working. Interesting is, that letter Ř,ř is working as default.
 
When importing Japanese text into another game, Notepad++ UTF without BOM doesn't work - it turns into a combination of random complex kanji and the odd square root sign or post symbol, with almost no kana, even though the original was mainly in kana. Is it the same for Civ 5?
 
hello, I'm translating CIV5 in ARABIC, and since arabic is written from right to left, I want to allign the long entries like the loading text for leaders right to left to avoid adding the [newline] every single time the "correct" line ending in arabic ends. because if I leave this as it is (without [newline]) it will cause the text to be in complete disorder and in the wrong allignement.
I found CRLF in text XML file.
How can I modify the xml file to force the text to be alligned right to left?
Thank you.
 
Back
Top Bottom