U++ forum: Welcome to the forum

Status & Roadmap

Authors & License

Funding Ultimate++

Search on this site

Search in forums

Home » Developing U++ » U++ Developers corner » Choosing the best way to go full UNICODE

Show: Today's Messages :: Show Polls :: Message Navigator
E-mail to friend

Return to the default flat view

Create a new topic

Submit Reply

Re: Choosing the best way to go full UNICODE [message #48180 is a reply to message #48179]

Wed, 31 May 2017 11:00

mirek is currently offline

mirek
Messages: 13975
Registered: November 2005

Ultimate Member

cbpporter wrote on Wed, 31 May 2017 10:30

It looks like there are many possible ways to go forward. We can try several things and probably a lot of things will work.

As long as we understand that there is no universal way to make Unicode indexable, but on a case by case basis, you can. The only thing you can universally do is to iterate linearly over Unicode.

If you can iterate linearly, Unicode is indexable...

Quote:

But I still think I gave you a partial solution so many years ago.

To reiterate:
1. Utf8 to Utf16 and vice-versa must be fixed under all scenarios. We also need to add Utf8 to Utf32, but that is trivial compared to Utf16. So proper error recovery must be implemented and 4 byte long sequences must be converted to surrogate pairs.

Agreed.

Quote:

2. The Unicode table must be expanded to more than 2048 characters. Maybe not full range, but Unicode is based on blocks. We can move a bit closer to the CJK block, because for CJK, nobody expects more than the basics. Probably 8k at least.

Not so sure about this - not that important IMO at this point. So I will not get correct ToUpper for many characters - that has little impact on most applications.

Quote:

4. Implement DString? I don't know yet. On the other hand, implementing DSting is easy and it can be dropped in its own header and available for use.

I am now leaning against it. Vector<int> is good enough for utf32 - what we eventually need to do with it.

I am now really thinking that "multibyte" String is the solution. The one that returns a variable sequence of bytes for each position. I am now even thinking this does not need to be bound to graphemes only.

The longterm point with that is to replace WString as processing facility in editors.

Report message to a moderator

Send a private message to this user

[Message index]

		Choosing the best way to go full UNICODE By: mirek on Sat, 27 May 2017 16:58
		Re: Choosing the best way to go full UNICODE By: Zbych on Sat, 27 May 2017 20:02
		Re: Choosing the best way to go full UNICODE By: mirek on Sat, 27 May 2017 20:48
		Re: Choosing the best way to go full UNICODE By: mirek on Sat, 27 May 2017 20:51
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 29 May 2017 14:36
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 29 May 2017 19:40
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 30 May 2017 10:31
		Re: Choosing the best way to go full UNICODE By: mirek on Tue, 30 May 2017 11:03
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 30 May 2017 11:23
		Re: Choosing the best way to go full UNICODE By: mirek on Tue, 30 May 2017 11:45
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 10:30
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 11:00
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 11:30
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 12:07
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 12:26
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 12:40
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 13:12
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 13:20
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 13:43
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 14:41
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 15:06
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 15:25
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 15:34
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 31 May 2017 15:38
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 31 May 2017 15:50
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 05 June 2017 17:51
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 06 June 2017 09:28
		Re: Choosing the best way to go full UNICODE By: mirek on Tue, 06 June 2017 10:41
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 06 June 2017 11:18
		Re: Choosing the best way to go full UNICODE By: mirek on Tue, 06 June 2017 13:21
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 06 June 2017 13:39
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 06 June 2017 13:58
		Re: Choosing the best way to go full UNICODE By: cbpporter on Thu, 08 June 2017 10:00
		Re: Choosing the best way to go full UNICODE By: mirek on Thu, 08 June 2017 10:26
		Re: Choosing the best way to go full UNICODE By: cbpporter on Thu, 08 June 2017 10:43
		Re: Choosing the best way to go full UNICODE By: cbpporter on Thu, 08 June 2017 11:22
		Re: Choosing the best way to go full UNICODE By: cbpporter on Thu, 08 June 2017 13:00
		Re: Choosing the best way to go full UNICODE By: mirek on Sun, 11 June 2017 13:57
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 09:39
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 12 June 2017 10:13
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 10:21
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 12 June 2017 10:28
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 12 June 2017 10:31
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 10:53
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 12 June 2017 10:57
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 11:37
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 12 June 2017 11:41
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 12:50
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 13:06
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 12 June 2017 14:20
		Re: Choosing the best way to go full UNICODE By: cbpporter on Tue, 13 June 2017 16:31
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 14 June 2017 11:07
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 14 June 2017 12:07
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 14 June 2017 12:30
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 14 June 2017 12:42
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 14 June 2017 19:09
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 14 June 2017 23:19
		Re: Choosing the best way to go full UNICODE By: cbpporter on Wed, 14 June 2017 23:31
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 19 June 2017 10:03
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 19 June 2017 10:22
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 19 June 2017 10:40
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 19 June 2017 10:51
		Re: Choosing the best way to go full UNICODE By: mirek on Mon, 19 June 2017 10:58
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 19 June 2017 11:07
		Re: Choosing the best way to go full UNICODE By: cbpporter on Mon, 19 June 2017 10:23
		Re: Choosing the best way to go full UNICODE By: mirek on Wed, 14 June 2017 12:17

Previous Topic:	Some addition proposals
Next Topic:	Help needed with link errors (serversocket)

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

] [

PDF

]

Current Time: Mon May 06 14:09:13 CEST 2024

Total time taken to generate the page: 0.03010 seconds