OptPilot¶

OptPilot is a lightweight orchestration layer for iterative optimization studies. It connects a user-owned optimization method to a user-owned evaluation environment, runs candidate solutions, records objective results, and keeps the evidence needed to inspect, compare, or reproduce a study.

OptPilot does not try to become your simulator, dataset evaluator, LLM agent, Bayesian optimizer, RL trainer, or metaheuristic. Those pieces stay in your code. OptPilot provides the contract and runtime around them:

what a method must return
how the environment evaluates it
how each trial workspace is prepared
how metrics, records, output files, and provenance are stored
how compatible environments and methods are discovered and launched

What You Build¶

Most OptPilot integrations have three authored YAML files:

File role	Main question
`config: environment`	What can be evaluated, what candidate format is valid, and how are metrics returned?
`config: method`	How are candidates proposed, and which environment contracts can the method target?
`config: study`	Which environment and method should run together, with which objective, budget, and runtime?

Environment and method configs are reusable components. Study configs are concrete run plans.

The boundary between environment and method is the candidate contract. That contract is the first thing to understand when adding a new integration.

Who Owns What?¶

Piece	Owned by	What it does
Environment	You	Evaluates one candidate and returns metrics.
Method	You	Proposes candidates from the current study state and evidence.
Study	You	Binds one environment to one method for a run.
Runner	OptPilot	Validates, materializes, evaluates, and records each trial.
Evidence store	OptPilot	Stores what happened: observations, artifacts, method calls, events, and summaries.

Core Loop¶

Every OptPilot run follows the same loop:

method proposes candidate
runner validates and materializes candidate
environment evaluates materialized candidate
runner records evidence

That loop supports parameter search, file/code evolution, simulator control, metaheuristics, Bayesian optimization, LLM agents, LLM-assisted methods, and coarse-grained wrappers around existing search repositories.

flowchart LR
  subgraph Configs["Public YAML configs"]
    Env["EnvironmentConfig\ncandidate contract + evaluator"]
    Method["MethodConfig\nentrypoint + compatibility"]
    Study["StudyConfig\nobjective + budget + runtime"]
  end

  subgraph Runtime["Run time"]
    Runner["OptPilot runner"]
    MethodCode["User method"]
    Candidate["Candidate\nparameters | files | opaque"]
    Trial["Trial workspace"]
    Eval["User evaluator"]
    Evidence["Evidence store"]
  end

  Study --> Env
  Study --> Method
  Study --> Runner
  Env --> Runner
  Method --> Runner
  Runner --> MethodCode
  MethodCode --> Candidate
  Candidate --> Trial
  Trial --> Eval
  Eval --> Evidence
  Evidence --> MethodCode

Candidate Contracts Are The Spine¶

An environment declares a candidate contract:

Environment candidate-contract fragment:

candidate:
  format: parameters
  parameters:
    schema:
      x:
        valueType: float

A method declares what it can target:

Method compatibility fragment:

accepts:
  formats: [parameters]
  requires:
    context: [candidate.parameters.schema]

Some methods are schema-general: their code first reads the environment's schema, then decides what candidate to return. For example, if one environment asks for {x, mode} and another asks for {learning_rate, batch_size}, the same method can read the schema and fill in either set of fields. A random sampler, Bayesian optimizer, or LLM parameter proposer can be written this way.

Other methods are specific: their code is written to return one known candidate shape. For example, a route solver might always return {route: [...]} and a schedule solver might always return {solutions: ...}. These methods can declare produces, which means "this is the candidate shape my method promises to return." OptPilot then checks that promised shape against the environment's candidate contract before the run starts.

See Candidate Contracts for the full model and examples.

Ways To Use OptPilot¶

Use the CLI when you want a simple validate/run loop:

uv run optpilot validate examples/studies/job_shop_rule_parameters_baseline.yaml
uv run optpilot run examples/studies/job_shop_rule_parameters_baseline.yaml

Use OptPilot Studio when you want a local GUI for browsing reusable components, opening workspaces, launching studies, inspecting run evidence, and asking the assistant for help:

uv run optpilot ui --open-browser

Studio scans examples/ and user_catalog/ by default. It also has an assistant-enabled mode backed by OpenHands and per-workspace Code Server containers; see UI for setup.

Start Here¶

Run the first example with Getting Started.
Read Candidate Contracts for the environment/method boundary.
Read Concepts for the vocabulary.
Read How A Run Works and Evidence when you want the runtime model.
Use Examples and Job-Shop Environment to choose a method track.
Use User Catalog and Configuration when you start writing your own integrations.

For personal or team use, put reusable environments, methods, and resources under user_catalog/. Keep study YAML files where you draft or launch them; they are run plans rather than reusable catalog entries.