Pipeline
Each validtr run executes this sequence:
- Task analysis
- Stack recommendation
- Container provisioning and execution
- Test generation and test execution
- Scoring
- Retry decision
Retry Behavior
If score is below threshold (default 95), validtr adjusts stack strategy and retries until:
- score reaches threshold, or
max_retriesis reached.
