untokenize() does not round-trip for code containing line breaks (`\` + `\n`)

# Bug report

### Bug description:

Code which contains line breaks is not round-trip invariant:
```python
import tokenize, io

source_code = r"""
1 + \
    2
"""

tokens = list(tokenize.generate_tokens(io.StringIO(source_code).readline))
x = tokenize.untokenize(tokens)
print(x)
# 1 +\
#     2
```

Notice that the space between `+` and `\` is now missing. The current tokenizer code simply inserts a backslash when it encounters two subsequent tokens with a differeing row offset:

https://github.com/python/cpython/blob/9c2bb7d551a695f35db953a671a2ddca89426bef/Lib/tokenize.py#L179-L182

I think this should be fixed. The docstring of `tokenize.untokenize` says:
> Round-trip invariant for full input:
        Untokenized source will match input source exactly

To fix this, it will probably be necessary to inspect the raw line contents and count how much whitespace there is at the end of the line.

### CPython versions tested on:

CPython main branch

### Operating systems tested on:

Linux


### Linked PRs
* gh-126010
* gh-129153
* gh-130579

	row_offset = row - self.prev_row
	if row_offset:
	self.tokens.append("\\\n" * row_offset)
	self.prev_col = 0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

untokenize() does not round-trip for code containing line breaks (`\` + `\n`) #125553

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

untokenize() does not round-trip for code containing line breaks (\ + \n) #125553

Description

Bug report

Bug description:

CPython versions tested on:

Operating systems tested on:

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

untokenize() does not round-trip for code containing line breaks (`\` + `\n`) #125553