[DOC] Tweaks for String#dump #13883

BurdetteLamar · 2025-07-14T19:47:17Z

No description provided.

nobu · 2025-07-15T09:05:48Z

doc/string/dump.rdoc

+  s        # => "\a\b\t\n\v\f\r"
+  s.dump   # => "\"\\a\\b\\t\\n\\v\\f\\r\""
+
+Multi-byte characters are rendered in unicode notation:


It is only for Unicode encodings.

Does this mean that there are multi-byte characters that are not in Unicode encodings? If so, I'll need examples.

BurdetteLamar · 2025-07-16T15:22:31Z

@peterzhu2118, I'll need help with this:

I think I need examples of multi-byte characters that will not dump in Unicode notation.
Old doc notwithstanding, I found to character than dumps in hexadecimal notation. Are there any?

peterzhu2118 · 2025-07-16T15:34:03Z

I think I need examples of multi-byte characters that will not dump in Unicode notation.

For example:

'тест'.dump # => "\"\\u0442\\u0435\\u0441\\u0442\""
'тест'.encode('utf-16le').dump # => "\"B\\x045\\x04A\\x04B\\x04\".dup.force_encoding(\"UTF-16LE\")"

Old doc notwithstanding, I found to character than dumps in hexadecimal notation. Are there any?

Sorry, I don't understand what you mean by this.

BurdetteLamar · 2025-07-16T15:37:20Z

Thanks, @peterzhu2118. What you've written above answers both questions (however poorly they're posed).

peterzhu2118 · 2025-07-18T17:55:26Z

doc/string/dump.rdoc

+  s = 'hello'
+  s.encoding                # => #<Encoding:UTF-8>
+  s.dump                    # => "\"hello\""
+  s.encode('utf-16').dump   # => "\"\\xFE\\xFF\\x00h\\x00e\\x00l\\x00l\\x00o\".dup.force_encoding(\"UTF-16\")"
+  s.encode('utf-16le').dump # => "\"h\\x00e\\x00l\\x00l\\x00o\\x00\".dup.force_encoding(\"UTF-16LE\")"
+
+  s = 'тест'
+  s.encoding                # => #<Encoding:UTF-8>
+  s.dump                    # => "\"\\u0442\\u0435\\u0441\\u0442\""
+  s.encode('utf-16').dump   # => "\"\\xFE\\xFF\\x04B\\x045\\x04A\\x04B\".dup.force_encoding(\"UTF-16\")"
+  s.encode('utf-16le').dump # => "\"B\\x045\\x04A\\x04B\\x04\".dup.force_encoding(\"UTF-16LE\")"
+
+  s = 'こんにちは'
+  s.encoding                # => #<Encoding:UTF-8>
+  s.dump                    # => "\"\\u3053\\u3093\\u306B\\u3061\\u306F\""
+  s.encode('utf-16').dump   # => "\"\\xFE\\xFF0S0\\x930k0a0o\".dup.force_encoding(\"UTF-16\")"
+  s.encode('utf-16le').dump # => "\"S0\\x930k0a0o0\".dup.force_encoding(\"UTF-16LE\")"


I think it would be better to move the examples of non-UTF8 encodings to a separate section with some text describing it (e.g. using hexadecimal format and adding dup.force_encoding(<encoding name>). This is because non-UTF8 is more of an edge case rather than a commonly used case.

I've moved the cited lines to the end. I think you want other changes, but I'm not sure what exactly is needed. Can you fix up one, as a guide for me?

@peterzhu2118, I'll take another shot at this; marking as Draft in the interim.

@peterzhu2118, I take it back. I don't know what to do with this.

I opened #13965

[DOC] Tweaks for String#dump

7559f7e

BurdetteLamar requested a review from peterzhu2118 July 14, 2025 19:47

BurdetteLamar added the Documentation Improvements to documentation. label Jul 14, 2025

nobu reviewed Jul 15, 2025

View reviewed changes

[DOC] Tweaks for String#dump

3816f4f

BurdetteLamar requested a review from nobu July 16, 2025 16:41

peterzhu2118 reviewed Jul 18, 2025

View reviewed changes

[DOC] Tweaks for String#dump

d52854b

BurdetteLamar requested a review from peterzhu2118 July 19, 2025 12:36

BurdetteLamar marked this pull request as draft July 21, 2025 14:09

BurdetteLamar marked this pull request as ready for review July 21, 2025 14:39

BurdetteLamar mentioned this pull request Jul 21, 2025

[DOC] Docs for String#dump #13965

Merged

BurdetteLamar closed this by deleting the head repository Jul 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DOC] Tweaks for String#dump #13883

[DOC] Tweaks for String#dump #13883

BurdetteLamar commented Jul 14, 2025

Uh oh!

nobu Jul 15, 2025

Uh oh!

BurdetteLamar Jul 15, 2025

Uh oh!

BurdetteLamar Jul 16, 2025

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 commented Jul 16, 2025

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 Jul 18, 2025

Uh oh!

BurdetteLamar Jul 18, 2025

Uh oh!

BurdetteLamar Jul 21, 2025

Uh oh!

BurdetteLamar Jul 21, 2025

Uh oh!

peterzhu2118 Jul 21, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

[DOC] Tweaks for String#dump #13883

[DOC] Tweaks for String#dump #13883

Conversation

BurdetteLamar commented Jul 14, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 commented Jul 16, 2025

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.