[DOC] Tweaks for String#dump #13883

BurdetteLamar · 2025-07-14T19:47:17Z

No description provided.

nobu · 2025-07-15T09:05:48Z

doc/string/dump.rdoc

+  s        # => "\a\b\t\n\v\f\r"
+  s.dump   # => "\"\\a\\b\\t\\n\\v\\f\\r\""
+
+Multi-byte characters are rendered in unicode notation:


It is only for Unicode encodings.

Does this mean that there are multi-byte characters that are not in Unicode encodings? If so, I'll need examples.

BurdetteLamar · 2025-07-16T15:22:31Z

@peterzhu2118, I'll need help with this:

I think I need examples of multi-byte characters that will not dump in Unicode notation.
Old doc notwithstanding, I found to character than dumps in hexadecimal notation. Are there any?

peterzhu2118 · 2025-07-16T15:34:03Z

I think I need examples of multi-byte characters that will not dump in Unicode notation.

For example:

'тест'.dump # => "\"\\u0442\\u0435\\u0441\\u0442\""
'тест'.encode('utf-16le').dump # => "\"B\\x045\\x04A\\x04B\\x04\".dup.force_encoding(\"UTF-16LE\")"

Old doc notwithstanding, I found to character than dumps in hexadecimal notation. Are there any?

Sorry, I don't understand what you mean by this.

BurdetteLamar · 2025-07-16T15:37:20Z

Thanks, @peterzhu2118. What you've written above answers both questions (however poorly they're posed).

peterzhu2118 · 2025-07-18T17:55:26Z

doc/string/dump.rdoc

+  s = 'hello'
+  s.encoding                # => #<Encoding:UTF-8>
+  s.dump                    # => "\"hello\""
+  s.encode('utf-16').dump   # => "\"\\xFE\\xFF\\x00h\\x00e\\x00l\\x00l\\x00o\".dup.force_encoding(\"UTF-16\")"
+  s.encode('utf-16le').dump # => "\"h\\x00e\\x00l\\x00l\\x00o\\x00\".dup.force_encoding(\"UTF-16LE\")"
+
+  s = 'тест'
+  s.encoding                # => #<Encoding:UTF-8>
+  s.dump                    # => "\"\\u0442\\u0435\\u0441\\u0442\""
+  s.encode('utf-16').dump   # => "\"\\xFE\\xFF\\x04B\\x045\\x04A\\x04B\".dup.force_encoding(\"UTF-16\")"
+  s.encode('utf-16le').dump # => "\"B\\x045\\x04A\\x04B\\x04\".dup.force_encoding(\"UTF-16LE\")"
+
+  s = 'こんにちは'
+  s.encoding                # => #<Encoding:UTF-8>
+  s.dump                    # => "\"\\u3053\\u3093\\u306B\\u3061\\u306F\""
+  s.encode('utf-16').dump   # => "\"\\xFE\\xFF0S0\\x930k0a0o\".dup.force_encoding(\"UTF-16\")"
+  s.encode('utf-16le').dump # => "\"S0\\x930k0a0o0\".dup.force_encoding(\"UTF-16LE\")"


I think it would be better to move the examples of non-UTF8 encodings to a separate section with some text describing it (e.g. using hexadecimal format and adding dup.force_encoding(<encoding name>). This is because non-UTF8 is more of an edge case rather than a commonly used case.

I've moved the cited lines to the end. I think you want other changes, but I'm not sure what exactly is needed. Can you fix up one, as a guide for me?

[DOC] Tweaks for String#dump

7559f7e

BurdetteLamar requested a review from peterzhu2118 July 14, 2025 19:47

BurdetteLamar added the Documentation Improvements to documentation. label Jul 14, 2025

nobu reviewed Jul 15, 2025

View reviewed changes

[DOC] Tweaks for String#dump

3816f4f

BurdetteLamar requested a review from nobu July 16, 2025 16:41

peterzhu2118 reviewed Jul 18, 2025

View reviewed changes

[DOC] Tweaks for String#dump

d52854b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[DOC] Tweaks for String#dump #13883

[DOC] Tweaks for String#dump #13883

BurdetteLamar commented Jul 14, 2025

Uh oh!

nobu Jul 15, 2025

Uh oh!

BurdetteLamar Jul 15, 2025

Uh oh!

BurdetteLamar Jul 16, 2025

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 commented Jul 16, 2025

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 Jul 18, 2025

Uh oh!

BurdetteLamar Jul 18, 2025

Uh oh!

Uh oh!

[DOC] Tweaks for String#dump #13883

Are you sure you want to change the base?

[DOC] Tweaks for String#dump #13883

Conversation

BurdetteLamar commented Jul 14, 2025

Uh oh!

nobu Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

BurdetteLamar Jul 15, 2025

Choose a reason for hiding this comment

Uh oh!

BurdetteLamar Jul 16, 2025

Choose a reason for hiding this comment

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 commented Jul 16, 2025

Uh oh!

BurdetteLamar commented Jul 16, 2025

Uh oh!

peterzhu2118 Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

BurdetteLamar Jul 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!