bluebird_gymnasium documentation
A suite of gymnasium API compliant environments for air traffic control (ATC). The environments are based on the digital twin simulator, bluebird-dt, serving as a wrapper around it. The environments support research in learning-based agents (e.g., reinforcement learning and imitation learning) for ATC. It supports either single agent (aka centralized) or multi-agent (aka decentralized) agent set up.
In addition to the environment specification, the package also contains a set of state spaces (encodings), reward functions, and discrete action spaces definitions that are configurable, thus enabling users to set up custom problems according to their defined specifications. Furthermore, users develop new reward functions. See Rewards for more information.
Installation
bluebird-gymnasium is available on pypi, therefore it can be installed using
or, if using UV, you can add it to your environment using
Simple usage
To instantiate an environment in the suite, for example, the X sector environment, enter the commands below (after the installation and activation of the virtual environment).
For more details, please see the Getting started section below.
Sample agent
Below, an example agent that takes random actions.
import gymnasium as gym
import bluebird_gymnasium
env = gym.make("SectorXEnv-v0")
obs, info = env.reset()
done = False
while not done:
action = env.action_space.sample()
obs, reward, done, truncated, info = env.step(action)
Additional examples
Additional examples are available as rendered jupyter notebooks.