Validate UTF-16 surrogate pairs before combining by jarvis24young · Pull Request #187 · postgresql-interfaces/psqlodbc

jarvis24young · 2026-05-12T03:42:49Z

SQLWCHAR-to-UTF-8 conversion currently treats any UTF-16 high surrogate as the start of a surrogate pair. It then advances to the next code unit and reads it unconditionally.

That can read past the caller-supplied length when a wide-character ODBC API receives a dangling high surrogate at the end of its input. The new regression test exercises this through the public SQLPrepareW() path with a guarded one-code-unit SQLWCHAR buffer, so the old implementation faults deterministically if it reads wstr[1].

Fix this by only taking the surrogate-pair path when:

the current code unit is a high surrogate,
there is another code unit within ilen, and
the next code unit is a low surrogate.

Otherwise the existing non-pair path is used, avoiding the out-of-bounds read.

Reproduction on the old implementation, using the same black-box test with ASan and a guarded buffer:

ERROR: AddressSanitizer: SEGV on unknown address
The signal is caused by a READ memory access.
#0 ucs2_to_utf8 win_unicode.c:191
#1 SQLPrepareW odbcapiw.c:439
#2 SQLPrepareW libodbc.so.2
#3 main test/src/surrogate-pair-test.c:109

Tested after the fix:

cd ~/psqlodbc-surrogate-oob-build/test
ODBCSYSINI=. ODBCINSTINI=./odbcinst.ini ODBCINI=./odbc.ini ./runsuite surrogate-pair --inputdir=.
TAP version 13
1..1
ok 1 - surrogate-pair

Also tested the target binary directly under ASan/UBSan with detect_leaks=0; it returns normally.

Validate UTF-16 surrogate pairs before combining

2135193

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validate UTF-16 surrogate pairs before combining#187

Validate UTF-16 surrogate pairs before combining#187
jarvis24young wants to merge 1 commit into
postgresql-interfaces:mainfrom
jarvis24young:fix-surrogate-pair-boundary

jarvis24young commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jarvis24young commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant