Skill: backend/observability-audit Skill

Skill: backend/observability-audit

Improve and standardize observability so the system is operable after changes:

This skill is a pass over changed behavior, not feature work.

Improved logging:
- include correlation id
- include primary identifiers (request id, user id where safe, entity id)
- include outcome + error codes
Metrics updates if repo uses them:
- success/fail counters
- latency timing
- queue depth / retries (for jobs)
Tracing spans or propagation fixes if repo uses tracing
Audit entries for protected actions if applicable

Identify repo observability standards (profile/docs).
Add structured logs at critical boundaries:
- request start/end
- job start/end
- external call start/end (adapter boundary)
Ensure errors are observable:
- log error codes/taxonomy, not raw stack spam only
- include retry decisions (retrying vs terminal)
Add metrics if the repo uses them:
- count outcomes
- measure latency
Ensure correlation ids propagate:
- inbound request → domain → adapter → logs
Add audit entries where required:
- authz-protected actions
- fund movement / order placement / credential updates (example categories)
Run validations.

No logging standard → follow conservative structured logging and document assumptions.
High-cardinality metrics risk → avoid unbounded labels.
Sensitive data risk → redact or omit; prefer identifiers over payloads.

Log: