Skip to content

Commit 5449d5b

Browse files
committed
Fix infinite wait when reading a partially written WAL record
If a crash occurs while writing a WAL record that spans multiple pages, the recovery process marks the page with the XLP_FIRST_IS_OVERWRITE_CONTRECORD flag. However, logical decoding currently attempts to read the full WAL record based on its expected size before checking this flag, which can lead to an infinite wait if the remaining data is never written (e.g., no activity after crash). This patch updates the logic first to read the page header and check for the XLP_FIRST_IS_OVERWRITE_CONTRECORD flag before attempting to reconstruct the full WAL record. If the flag is set, decoding correctly identifies the record as incomplete and avoids waiting for WAL data that will never arrive. Discussion: https://postgr.es/m/CAAKRu_ZCOzQpEumLFgG_%2Biw3FTa%2BhJ4SRpxzaQBYxxM_ZAzWcA%40mail.gmail.com Discussion: https://postgr.es/m/CALDaNm34m36PDHzsU_GdcNXU0gLTfFY5rzh9GSQv%3Dw6B%2BQVNRQ%40mail.gmail.com Author: Vignesh C <vignesh21@gmail.com> Reviewed-by: Hayato Kuroda <kuroda.hayato@fujitsu.com> Reviewed-by: Dilip Kumar <dilipbalaut@gmail.com> Reviewed-by: Michael Paquier <michael@paquier.xyz> Reviewed-by: Alexander Korotkov <aekorotkov@gmail.com> Backpatch-through: 13
1 parent 27c7c11 commit 5449d5b

File tree

1 file changed

+15
-5
lines changed

1 file changed

+15
-5
lines changed

src/backend/access/transam/xlogreader.c

Lines changed: 15 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -723,11 +723,12 @@ XLogDecodeNextRecord(XLogReaderState *state, bool nonblocking)
723723
/* Calculate pointer to beginning of next page */
724724
targetPagePtr += XLOG_BLCKSZ;
725725

726-
/* Wait for the next page to become available */
727-
readOff = ReadPageInternal(state, targetPagePtr,
728-
Min(total_len - gotlen + SizeOfXLogShortPHD,
729-
XLOG_BLCKSZ));
730-
726+
/*
727+
* Read the page header before processing the record data, so we
728+
* can handle the case where the previous record ended as being a
729+
* partial one.
730+
*/
731+
readOff = ReadPageInternal(state, targetPagePtr, SizeOfXLogShortPHD);
731732
if (readOff == XLREAD_WOULDBLOCK)
732733
return XLREAD_WOULDBLOCK;
733734
else if (readOff < 0)
@@ -776,6 +777,15 @@ XLogDecodeNextRecord(XLogReaderState *state, bool nonblocking)
776777
goto err;
777778
}
778779

780+
/* Wait for the next page to become available */
781+
readOff = ReadPageInternal(state, targetPagePtr,
782+
Min(total_len - gotlen + SizeOfXLogShortPHD,
783+
XLOG_BLCKSZ));
784+
if (readOff == XLREAD_WOULDBLOCK)
785+
return XLREAD_WOULDBLOCK;
786+
else if (readOff < 0)
787+
goto err;
788+
779789
/* Append the continuation from this page to the buffer */
780790
pageHeaderSize = XLogPageHeaderSize(pageHeader);
781791

0 commit comments

Comments
 (0)