fix(policy): handle OSError from is_file() on macOS PATH_MAX limit by chaobo8484 · Pull Request #860 · microsoft/apm

chaobo8484 · 2026-04-23T01:54:55Z

When load_policy() receives a YAML string that exceeds ~1023 bytes, the is_file() syscall fails on macOS (PATH_MAX ≈ 1024) with OSError [Errno 63] File name too long. Wrap the call in try/except to gracefully fall back to treating the input as a YAML string.

Fixes #848

Description

Brief description of changes and motivation.

Fixes # (issue)

Type of change

Bug fix
New feature
Documentation
Maintenance / refactor

Testing

Tested locally
All existing tests pass
Added tests for new functionality (if applicable)

When load_policy() receives a YAML string that exceeds ~1023 bytes, the is_file() syscall fails on macOS (PATH_MAX ≈ 1024) with OSError [Errno 63] File name too long. Wrap the call in try/except to gracefully fall back to treating the input as a YAML string. Fixes microsoft#848

Copilot

Pull request overview

Fixes load_policy() to avoid crashing on macOS when given a long YAML string that triggers Path.is_file() to raise OSError: [Errno 63] File name too long, preserving the intended "path or YAML content" input behavior.

Changes:

Wraps the Path.is_file() probe in try/except OSError and falls back to treating the input as YAML content when the probe fails.

Show a summary per file

File	Description
`src/apm_cli/policy/parser.py`	Adds defensive handling around `Path.is_file()` so long YAML inputs do not raise on macOS PATH_MAX limits.

Copilot's findings

Files reviewed: 1/1 changed files
Comments generated: 2

danielmeppiel

Per Copilot reviewer

* docs(ci): document merge-gate architecture and ruleset context gotcha - Document merge-gate.yml as the single-authority PR-time aggregator - Mark ci-integration-pr-stub.yml as DEPRECATED (slated for deletion) - Renumber workflow list (now 6 entries, was misnumbered with two #3s) - New section: Branch Protection & Required Checks - Ruleset 'context' field MUST match check-run name ('gate'), not the UI display string ('Merge Gate / gate'). Storing the display string causes permanent 'Expected - Waiting for status to be reported' that blocked PR #860 today - Adding new required checks goes through EXPECTED_CHECKS in merge-gate.yml, not the ruleset Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> * chore(ci): remove deprecated PR-time stub workflow The four PR stubs (Build/Smoke/Integration/Release Validation - Linux) were a holdover from the pre-merge-gate model where branch protection required each Tier 2 check name directly. After #867, branch protection requires only the single 'gate' check from merge-gate.yml, so the stubs are dead weight that fire on every PR for no reason. Changes: - Delete .github/workflows/ci-integration-pr-stub.yml - Reduce EXPECTED_CHECKS in merge-gate.yml to just 'Build & Test (Linux)' (the only PR-time check we still emit) - Update merge-gate.yml + ci-integration.yml header comments - Update cicd.instructions.md (drop DEPRECATED entry, renumber to 5 workflows) - Drop stale CODEOWNERS reference to the deleted file - CHANGELOG entry under [Unreleased] > Removed Stacked on #874 (docs). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

chaobo8484 · 2026-04-23T16:09:21Z

@danielmeppiel I have made adjustments according to Copilot's suggestion.Have a good day！

github-actions · 2026-04-23T19:50:01Z

DevX UX Review — PR #860

Scope: CLI error ergonomics & silent-failure risk. No command surface / flag / help-text changes in this PR — review is limited to internal load_policy() behaviour and its downstream user-visible effects.

Findings

✅ Error recovery is solid. When ENAMETOOLONG fires, the code falls back to treating the input as YAML. If that YAML is malformed, yaml.safe_load raises YAMLError, which is wrapped into a PolicyValidationError with a human-readable message. The user still gets a clear "YAML parse error: ..." diagnostic — no silent swallowing of bad input.
✅ _looks_like_yaml_content() is a reasonable fast-path. The heuristics (newlines, ---, - , {, [, : ) match what every YAML-literate developer would expect. This avoids the filesystem probe entirely for the vast majority of inline YAML, which is the right default. No UX concern here.
⚠️ Minor: a valid-looking file path that doesn't exist is silently parsed as YAML. This is pre-existing behaviour, not introduced by this PR, but worth noting: load_policy("apm-policy.yml") where the file is missing will try to YAML-parse the string "apm-policy.yml" and return {name: apm-policy.yml} (bare YAML scalar coerced to a mapping? — actually it would hit "Policy must be a YAML mapping"). So the failure mode is noisy enough. No new risk from this PR.
✅ No user-facing output changes. apm audit --policy <path> UX is unaffected — callers in discovery.py already pass either a Path object or pre-read content strings. The fix only matters when raw YAML strings happen to exceed PATH_MAX, which is a programmatic edge case (not a CLI-surface issue).
✅ Test coverage is appropriate. The new test constructs a >1 KiB YAML payload and asserts it parses without OSError. Good enough for a defensive fix — no over-engineering.

Verdict: Ship it. Clean defensive fix, no UX regressions, no silent failures introduced. 👍

Generated by PR Review Panel for issue #860 · ● 4M · ◷

github-actions · 2026-04-24T19:17:58Z

APM Review Panel Verdict

Disposition: APPROVE (with one minor pre-merge fix: CHANGELOG entry)

Per-persona findings

Python Architect: Routine bug-fix PR. One helper extracted (_looks_like_yaml_content), one function refactored (load_policy). Two mermaid diagrams follow.

OO / class diagram

classDiagram
    class ApmPolicy {
        <<dataclass>>
        +name: str
        +version: str
        +enforcement: str
    }
    class load_policy:::touched {
        <<function>>
        +source: Union[str, Path]
        +returns: Tuple[ApmPolicy, List[str]]
    }
    class _looks_like_yaml_content:::touched {
        <<function>>
        +source: str
        +returns: bool
    }
    class _build_policy {
        <<function>>
        +data: dict
        +returns: ApmPolicy
    }
    load_policy:::touched ..> _looks_like_yaml_content:::touched : calls (new guard)
    load_policy:::touched ..> _build_policy : calls
    _build_policy ..> ApmPolicy : creates
    note for _looks_like_yaml_content "Guard: avoids Path.is_file() on large\nstrings (macOS ENAMETOOLONG fix)"
    note for load_policy "Fixed: OSError ENAMETOOLONG caught\nand treated as is_file=False"

Execution-flow diagram

flowchart TD
    A["load_policy(source)"] --> B{"isinstance(source, Path)?"}
    B -- Yes --> C["source.read_text() if is_file else str(source)"]
    B -- No --> D{"_looks_like_yaml_content(source)?<br/>(newline, YAML marker, key: val)"}
    D -- Yes --> E["raw = source (skip filesystem probe)"]
    D -- No --> F["path = Path(source)"]
    F --> G["try: path.is_file()"]
    G -- "OSError ENAMETOOLONG" --> H["is_file = False"]
    G -- "other OSError" --> I["re-raise"]
    G -- success --> J{"is_file?"}
    J -- Yes --> K["raw = path.read_text()"]
    J -- No --> L["raw = source"]
    H --> L
    E --> M["yaml.safe_load(raw)"]
    K --> M
    L --> M
    M --> N["_build_policy(data)"]
    N --> O["return ApmPolicy, warnings"]

Design patterns: Guard clause (_looks_like_yaml_content short-circuits before filesystem IO), defensive programming (specific errno.ENAMETOOLONG catch, re-raise on all others), single-responsibility extraction (heuristic is its own named, independently testable function). All patterns are correct and proportionate to the fix scope.

Coverage gap (not a blocker): The test exercises only the _looks_like_yaml_content() fast path. The yaml_str contains \n (the trailing newline of long_comment), so "\n" in source returns True and Path.is_file() is never called. The errno.ENAMETOOLONG catch branch has no explicit coverage.

CLI Logging Expert: No output, logging, CommandLogger, DiagnosticCollector, or STATUS_SYMBOLS changes. Non-ENAMETOOLONG OSError exceptions are re-raised, preserving the existing failure signal path. No concerns.

DevX UX Expert: No CLI command surface, flag, help text, or user-facing error message changes. The fix is transparent to users (silent crash eliminated on macOS). No documentation update required -- no command surface changed. No UX regression.

Supply Chain Security Expert: _looks_like_yaml_content() determines whether a string is treated as a file path or inline YAML. Misclassifying a file path as YAML prevents a file read rather than enabling one -- security-neutral at worst. yaml.safe_load() is unchanged; no arbitrary-object deserialization risk. No new path traversal surface (no new Path() construction from user-controlled strings). Fixing the crash is security-positive: an unhandled ENAMETOOLONG could cause policy loading to abort entirely, potentially triggering fail-open behavior upstream. No concerns.

Auth Expert: Not activated -- changed files (src/apm_cli/policy/parser.py, tests/unit/policy/test_parser.py) are in the policy parsing subsystem with no auth, token, credential, host-classification, or AuthResolver semantics.

OSS Growth Hacker: macOS is the primary developer platform for APM's target audience. A crash when loading large inline policy strings is a trust regression that disproportionately hits enterprise users (who are more likely to have complex, multi-line policies). The fix is correct and the test is appropriate. One missing item: no CHANGELOG entry. Per repo convention, every code change needs a Fixed line under ## [Unreleased]. Side-channel to CEO: this is a 1-line add that protects the quality signal in release notes and demonstrates active platform parity maintenance to the community.

CEO arbitration

The panel is unanimous and the findings are clean. This is a targeted, non-breaking, correct bug fix for a real macOS platform crash. There are no architectural concerns, no security regressions, no UX changes, and no auth implications. The only gap is the missing CHANGELOG.md entry -- a repo convention requirement, not a nice-to-have -- and an optional (non-blocking) coverage gap for the errno.ENAMETOOLONG catch branch. I ratify APPROVE with one required pre-merge action.

Required actions before merge

Add a Fixed entry to CHANGELOG.md under ## [Unreleased] -> ### Fixed. Suggested line:

- `load_policy()` no longer raises `OSError: [Errno 63] ENAMETOOLONG` on macOS when called with large inline YAML strings longer than PATH_MAX. (#860)

Optional follow-ups

Add an explicit unit test for the errno.ENAMETOOLONG branch by patching Path.is_file to raise OSError(errno.ENAMETOOLONG, "File name too long") -- the current test only exercises the _looks_like_yaml_content() fast path (newline check).
Audit other Path.is_file() / Path.exists() call sites in src/apm_cli/policy/discovery.py for the same macOS ENAMETOOLONG exposure if they can receive user-supplied strings.

Generated by PR Review Panel for issue #860 · ● 510.3K · ◷

chaobo8484 requested review from danielmeppiel and sergio-sisternes-epam as code owners April 23, 2026 01:54

chaobo8484 and others added 3 commits April 23, 2026 15:44

Merge branch 'main' into fix/issue-848-macos-path-max-crash

f9ec121

Merge branch 'main' into fix/issue-848-macos-path-max-crash

c32ac6a

Merge branch 'main' into fix/issue-848-macos-path-max-crash

1670641

danielmeppiel closed this Apr 23, 2026

danielmeppiel reopened this Apr 23, 2026

danielmeppiel requested review from Copilot April 23, 2026 11:17

Copilot started reviewing on behalf of danielmeppiel April 23, 2026 11:18 View session

Copilot AI reviewed Apr 23, 2026

View reviewed changes

Comment thread src/apm_cli/policy/parser.py Outdated

Comment thread src/apm_cli/policy/parser.py Outdated

danielmeppiel requested changes Apr 23, 2026

View reviewed changes

danielmeppiel mentioned this pull request Apr 23, 2026

docs(ci): improve Branch Protection section per review feedback #874

Merged

chaobo8484 and others added 4 commits April 23, 2026 19:48

Merge branch 'main' into fix/issue-848-macos-path-max-crash

9e59697

Merge branch 'main' into fix/issue-848-macos-path-max-crash

d7fc5a7

fix: address review comments

e1ed429

Merge branch 'main' into fix/issue-848-macos-path-max-crash

93ae735

danielmeppiel added the panel-review Trigger the apm-review-panel gh-aw workflow label Apr 23, 2026

danielmeppiel mentioned this pull request Apr 23, 2026

Handle OSError when checking file existence for long paths on macOS #880

Closed

danielmeppiel and others added 3 commits April 23, 2026 21:51

Merge branch 'main' into fix/issue-848-macos-path-max-crash

12ca532

Merge branch 'main' into fix/issue-848-macos-path-max-crash

2b14e5d

Merge branch 'main' into fix/issue-848-macos-path-max-crash

d44dfdb

danielmeppiel added panel-review Trigger the apm-review-panel gh-aw workflow and removed panel-review Trigger the apm-review-panel gh-aw workflow labels Apr 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(policy): handle OSError from is_file() on macOS PATH_MAX limit#860

fix(policy): handle OSError from is_file() on macOS PATH_MAX limit#860
chaobo8484 wants to merge 11 commits intomicrosoft:mainfrom
chaobo8484:fix/issue-848-macos-path-max-crash

chaobo8484 commented Apr 23, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

danielmeppiel left a comment

Uh oh!

chaobo8484 commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chaobo8484 commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

Uh oh!

Uh oh!

danielmeppiel left a comment

Choose a reason for hiding this comment

Uh oh!

chaobo8484 commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

DevX UX Review — PR #860

Findings

Uh oh!

github-actions Bot commented Apr 24, 2026

APM Review Panel Verdict

Per-persona findings

CEO arbitration

Required actions before merge

Optional follow-ups

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chaobo8484 commented Apr 23, 2026 •

edited

Loading