Skip to content

[lex] Provide unicode name for all control characters #7404

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
[lex] Provide unicode name for all control characters
This commit does not touch the new-line character as paper
P2348.  It resricts itself to consistent use of the unicode
character name for space, horizontal tab, and vertical tab.
Compared to PR #7359 it deliberately does not touch the
grammar that would necessitate a review by core review.
The intent is to rebase that PR if this one lands.
  • Loading branch information
AlisdairM committed Jul 17, 2025
commit bd462972090242542cf88dd66002484f9af23ee7
7 changes: 4 additions & 3 deletions source/lex.tex
Original file line number Diff line number Diff line change
Expand Up @@ -140,9 +140,9 @@
would arise from a source file ending with an unclosed \tcode{/*}
comment.
\end{footnote}
Each comment\iref{lex.comment} is replaced by one space character. New-line characters are
Each comment\iref{lex.comment} is replaced by one \unicode{0020}{space} character. New-line characters are
retained. Whether each nonempty sequence of whitespace characters other
than new-line is retained or replaced by one space character is
than new-line is retained or replaced by one \unicode{0020}{space} character is
unspecified.
As characters from the source file are consumed
to form the next preprocessing token
Expand Down Expand Up @@ -882,7 +882,8 @@
\end{footnote}
operators, and other separators.
\indextext{whitespace}%
Blanks, horizontal and vertical tabs, newlines, formfeeds, and comments
Comments and the characters \unicode{0020}{space}, \unicode{0009}{character tabulation},
\unicode{000b}{line tabulation}, \unicode{000c}{form feed}, and new-line
(collectively, ``whitespace''), as described below, are ignored except
as they serve to separate tokens.
\begin{note}
Expand Down
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy