Skip to content
Navigation

Training framework with trajectory capture, data synthesis, evolution pipelines, and VeRL integration.

Installation

bash
pip install "orbiter-train @ git+https://github.com/Midsphere-AI/orbiter-ai.git#subdirectory=packages/orbiter-train"

# For VeRL integration:
pip install "orbiter-train[verl] @ git+https://github.com/Midsphere-AI/orbiter-ai.git#subdirectory=packages/orbiter-train"

Module path

python
import orbiter.train

Package exports

ExportModuleDescription
TrajectoryItemorbiter.train.trajectoryA single step in an agent execution trajectory
TrajectoryDatasetorbiter.train.trajectoryDataset of trajectory items with capture and export
TrajectoryStrategyorbiter.train.trajectoryABC for building trajectory items from messages
DefaultStrategyorbiter.train.trajectoryDefault strategy extracting input/output/tool_calls
TrajectoryErrororbiter.train.trajectoryError during trajectory operations
Trainerorbiter.train.trainerAbstract base class for agent training
TrainConfigorbiter.train.trainerMinimal training configuration
TrainerStateorbiter.train.trainerTrainer lifecycle state enum
TrainMetricsorbiter.train.trainerMetrics produced by training or evaluation
TrainerErrororbiter.train.trainerError during training operations
DataSynthesiserorbiter.train.synthesisABC for custom data synthesisers
TemplateSynthesiserorbiter.train.synthesisGenerate items via template transforms
SynthesisPipelineorbiter.train.synthesisOrchestrates data synthesis from trajectory items
SynthesisConfigorbiter.train.synthesisConfiguration for a synthesis pipeline run
SynthesisResultorbiter.train.synthesisOutput of a synthesis pipeline run
SynthesisStrategyorbiter.train.synthesisStrategy enum (llm, template, augment)
SynthesisErrororbiter.train.synthesisError during data synthesis
augment_swap_ioorbiter.train.synthesisAugment by swapping input/output
augment_add_noiseorbiter.train.synthesisAugment by adding noise to input
filter_by_scoreorbiter.train.synthesisFilter items by minimum score
deduplicateorbiter.train.synthesisRemove duplicate items
split_datasetorbiter.train.synthesisSplit items into train/test sets
EvolutionPipelineorbiter.train.evolutionMulti-epoch evolution pipeline
EvolutionConfigorbiter.train.evolutionConfiguration for an evolution run
EvolutionStrategyorbiter.train.evolutionABC for evolution phase strategies
EvolutionStateorbiter.train.evolutionPipeline state enum
EvolutionPhaseorbiter.train.evolutionEvolution epoch phases enum
EpochResultorbiter.train.evolutionMetrics for a single evolution epoch
EvolutionResultorbiter.train.evolutionAggregate result of a full evolution run
EvolutionErrororbiter.train.evolutionError during evolution operations
VeRLTrainerorbiter.train.verlConcrete trainer integrating with VeRL
VeRLConfigorbiter.train.verlVeRL-specific training configuration
VeRLAlgorithmorbiter.train.verlSupported VeRL RL algorithms enum
RewardSpecorbiter.train.verlDescriptor for a reward function

Submodules