pretext: vectorize field on pipeline + PUT /pipelines/{id}/vectorize#151
Open
beckyconning wants to merge 1 commit into
Open
pretext: vectorize field on pipeline + PUT /pipelines/{id}/vectorize#151beckyconning wants to merge 1 commit into
beckyconning wants to merge 1 commit into
Conversation
vectorization-spec-v4 / LD13 + LD15:
- pipeline schema gains `vectorize: boolean` (nullable, optional). When
true, loads of datasets in the pipeline produce 384-dim IBM Granite-30M
embeddings landed in a sibling `PRETEXT` table on the destination.
Absent or false preserves pre-feature behaviour byte-identically.
- New endpoint PUT /pipelines/{id}/vectorize for runtime opt-in/out at
pipeline scope (mirrors the pii-config pattern).
- POST /pipelines and the new PUT both return 400 when vectorize=true is
set against a destination that does not support the Snowflake VECTOR
column type (LD15 — Snowflake-family destinations only).
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
vectorization-spec-v4 / LD13 + LD15 API surface:
vectorize: boolean(nullable, optional). When true, loads of datasets in the pipeline produce 384-dim embeddings landed in a siblingPRETEXTtable on the destination. Absent or false preserves pre-feature behaviour byte-identically.PUT /pipelines/{id}/vectorizefor runtime opt-in/out at pipeline scope (mirrors thepii-configpattern).POST /pipelinesand the newPUTboth return 400 whenvectorize=trueis set against a destination that does not support the SnowflakeVECTORcolumn type (LD15 — Snowflake-family destinations only).Companion PRs
SnowflakeKind.parseConfigValuesVerification
End-to-end verified on staging via new pipeline
customer service vectorize(Intercom Conversations source,vectorize: true):Merge-readiness summary: Linear PRD-212.
Test plan
vectorizefield is correctly typed as nullable boolean.PUT /pipelines/{id}/vectorizerequest/response shapes match what the implementation in precog/services#3478 ships.🤖 Generated with Claude Code