Unicode characters are not correctly parsed from byte[] if default charset is not UTF-8

The `TestUtf8.supportI18nBytes` test is currently broken on windows systems. Actually in my IDE (IntelliJ) the test is green. but the maven test target fails both in the IDE and on command line:

```
[ERROR] Failures:
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Amharic text ==> expected: <አማርኛ> but was: <áŠ áˆ›áˆ­áŠ›>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Malayalam text ==> expected: <മലയാളം> but was: <à´®à´²à´¯à´¾à´³à´‚>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Assyrian Neo-Aramaic text ==> expected: <ܐܬܘܪܝܐ> but was: <Ü?Ü¬Ü˜ÜªÜ?Ü?>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Georgian text ==> expected: <მარგალური> but was: <áƒ›áƒ?áƒ áƒ’áƒ?áƒšáƒ£áƒ áƒ˜>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Sinhala text ==> expected: <සිංහල ජාතිය> but was: <à·ƒà·’à¶‚à·„à¶½ à¶¢à·?à¶­à·’à¶º>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Japanese text ==> expected: <日本語> but was: <æ—¥æœ¬èªž>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Russian text ==> expected: <Русский> but was: <Ð ÑƒÑ?Ñ?ÐºÐ¸Ð¹>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Farsi text ==> expected: <فارسی> but was: <Ù?Ø§Ø±Ø³ÛŒ>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Korean text ==> expected: <한국어> but was: <í•œêµ­ì–´>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Armenian text ==> expected: <Հայերեն> but was: <Õ€Õ¡ÕµÕ¥Ö€Õ¥Õ¶>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Hindi text ==> expected: <हिन्दी> but was: <à¤¹à¤¿à¤¨à¥?à¤¦à¥€>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Hebrew text ==> expected: <עברית> but was: <×¢×‘×¨×™×ª>
[ERROR] TestUtf8.supportI18nBytes:62 Parsing bytes[] Chinese text ==> expected: <中文> but was: <ä¸­æ–‡>
```

The problem seems to be the creation of `String` using the system's default charset in `JSONParserByteArray.extractString`

https://github.com/netplex/json-smart-v2/blob/604281d6b085f02b745c1ef8f04b056035acf127/json-smart/src/main/java/net/minidev/json/parser/JSONParserByteArray.java#L62

and `JSONParserByteArray.extractStringTrim`

https://github.com/netplex/json-smart-v2/blob/604281d6b085f02b745c1ef8f04b056035acf127/json-smart/src/main/java/net/minidev/json/parser/JSONParserByteArray.java#L74

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Unicode characters are not correctly parsed from byte[] if default charset is not UTF-8 #73

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Unicode characters are not correctly parsed from byte[] if default charset is not UTF-8 #73

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions