Draft proposal: Clarify guidance for use of a BOM as a UTF-8 encoding signature

Hans Åberg haberg-1 at telia.com
Tue Oct 13 13:32:26 CDT 2020


There is only a U+FEFF ZERO WIDTH NO-BREAK SPACE, and if somebody wants to use it to mean something else, that is something Unicode should not worry about.


> On 13 Oct 2020, at 19:45, Tom Honermann <tom at honermann.net> wrote:
> 
> On 10/13/20 4:57 AM, Hans Åberg wrote:
>> It would be best if stated that its use is a type of metadata, and such, Unicode has no opinion on its use.
> 
> I'm interpreting that as an endorsement for the first suggested resolution in the paper.
> 
> Tom.
> 
>> 
>> 
>>> On 10 Oct 2020, at 20:54, Tom Honermann via Unicode <unicode at unicode.org> wrote:
>>> 
>>> Attached is a draft proposal for the Unicode standard that intends to clarify the current recommendation regarding use of a BOM in UTF-8 text.  This is follow up to discussion on the Unicode mailing list back in June.
>>> 
>>> Feedback is welcome.  I plan to submit this to the UTC in a week or so pending review feedback.
>>> 
>>> Tom.
>>> 
>>> <Unicode-BOM-guidance.pdf>
> 
> 




More information about the Unicode mailing list