Comment by 🚂 MrSVCD

Re: "It seems that CJK (Chinese-Japanese-Korean) posts are…"

In: s/AskGemini

@ColonelThirtyTwo That is true but the most common C&K characters have their own entries in unicode.

I think that unicode is trying to go precent encoded to not go to 5 bytes of utf-8.

🚂 MrSVCD

Mar 12 · 8 weeks ago

3 Later Comments ↓

🍵 tacomanator [OP] · Mar 12 at 23:58:

@skyjake thank you for your help. From there I found a way to post long text from the draft page after enabling Titan in the BBS settings.

The help mentions a ":" command to enter long text mode. I haven't figured how to get that to work yet, but for now I'm happy to have least one have one working method!

🚬 sy · Mar 13 at 15:47:

Maybe this (RFC2718 §2.2.5) should be explicitly allowed in gemini specification:

Unless there is some compelling reason for a particular scheme to do otherwise, translating character sequences into UTF-8 and then subsequently using the %HH encoding for *unsafe* octets is recommended.

Apparently most servers –including BBS and station– already allow it.

— Test with more than 300 kanji characters

🚂 MrSVCD · Mar 13 at 18:04:

Thanks @sy, that explains the difference between what I thought and what op said.

Original Post

🌒 s/AskGemini

🍵 tacomanator:

It seems that CJK (Chinese-Japanese-Korean) posts are effectively limited to about 100 characters due to limit of 1024 bytes for URIs in Gemini (each character is 9 bytes after encoding). Has there been discussion on this matter? It constrains CJK posts to about 100 characters: a sentence or two.

💬 10 comments · Mar 11 · 8 weeks ago