IROS 2022 Safe Robot Learning Competition

For the IROS 2022 Safe Robot Learning Competition, check out branch beta-iros-competition

IROS 2022 Safe Robot Learning Competition

(The image above links to the video of a flight example)

Note: beta release subject to change throughout the month of August 2022; register for updates

Official Webpage
IROS Competition Page
GitHub Discussions
Google Form (to register your interest and receive e-mail updates)

Description

The task is to design a controller/planner that enables a quadrotor (Crazyflie 2.x) to safely fly through a set of gates and reach a predefined target despite uncertainties in the robot dynamics (e.g., mass and inertia) and the environment (e.g., wind and position of the gates). The algorithms will be evaluated regarding their safety (e.g., no collisions) and performance (e.g., time to target). We encourage participants to explore both control and reinforcement learning approaches (e.g., robust, adaptive, predictive, learning-based and optimal control, and model-based/model-free reinforcement learning). The controller/planner has access to the position and attitude measurements provided by a motion capture system and the noisy pose of the closest next gate. The controller can send position, velocity, acceleration and heading references to an onboard position controller.

Install on Ubuntu/macOS

We recommend Ubuntu 20.04 on a mid-tier laptop and GPU (e.g., a Lenovo P52 with i7-8850H/Quadro P2000)

git clone https://github.com/utiasDSL/safe-control-gym.git
cd safe-control-gym
git checkout beta-iros-competition

Create and access a Python 3.8 environment using conda

conda create -n safe python=3.8
conda activate safe

Install the safe-control-gym repository

pip install --upgrade pip
pip install -e .

Install `pycffirmware`

cd ..
git clone https://github.com/utiasDSL/pycffirmware.git
cd pycffirmware/
git submodule update --init --recursive

On Ubuntu

sudo apt update
sudo apt -y install swig
sudo apt install build-essential
cd wrapper/
chmod +x build_linux.sh
conda activate safe
./build_linux.sh

On macOS

Install brew, then

brew install swig
brew install gcc            # Also run `xcode-select --install` if prompted
brew install make
cd wrapper/
chmod +x build_osx.sh       # Assumes `gcc` is at `/usr/local/bin/gcc-12`
conda activate safe
./build_osx.sh

Also see how to install SWIG and pycffirmware's README.md

Getting Started

Run the scripts in competition/

cd ../../safe-control-gym/competition/
python3 getting_started.py --overrides ./getting_started.yaml

Modify file edit_this.py to customize your controller based on Crazyswarm's Crazyflie interface

Development and Evaluation Scenarios

A complete problem is specified by a YAML file, e.g. getting_started.yaml

Proposed solutions will be evaluated in 5 scenarios with different challenges:

Evaluation Scenario	Constraints	Rand. Inertial Properties	Randomized Obstacles, Gates	Rand. Between Episodes	Notes
`level0.yaml`	Yes	No	No	No	Perfect knowledge
`level1.yaml`	Yes	Yes	No	No	Adaptive
`level2.yaml`	Yes	Yes	Yes	No	Learning, re-planning
`level3.yaml`	Yes	Yes	Yes	Yes	Robustness

sim2real	Yes	Real-life hardware	Yes, injected	No	Sim2real transfer

"Rand. Between Episodes" (governed by argument reseed_on_reset) states whether randomized properties and positions vary or are kept constant (by re-seeding the random number generator on each env.reset()) across episodes

Note 1: the random seed used to score solution will be picked at the time of the competition

Note 2: if the base scenarios do not allow to determine a unique winner, we will progressively raise the difficulty by, alternately, (i) adding intermediate gates and (ii) increasing the parameters of the random distributions and input/dynamics disturbances by 50% (except in level0).

Implement Your Controller/Solution

Methods to Re-implement in edit_this.py

You can also use/create additional custom modules such as example_custom_utils.py

Required (1 of 2)

edit_this.py : Controller.__init__(initial_obs, initial_info)           # Initialize the controller

    Args:
        initial_obs (ndarray): The initial observation of the quadrotor's state
            [x, x_dot, y, y_dot, z, z_dot, phi, theta, psi, p, q, r].

        initial_info (dict): The a priori problem information as a dictionary with keys

            - 'ctrl_timestep'                   Control time step (in seconds)
            - 'ctrl_freq'                       Control frequency (in Hz)
            - 'episode_len_sec'                 Maximum duration of an episode (in seconds)
            - 'nominal_physical_parameters'     *Nominal* mass and inertia of the quadrotor

            - 'gate_dimensions'                 Shape and measurements of the gates
            - 'obstacle_dimensions'             Shape and measurements of the obstacles
            - 'nominal_gates_pos_and_type'      *Nominal* pose and type (tall, low, etc.) of the gates (NOTE: this is the ordering in which the gates must be traversed)
            - 'nominal_obstacles_pos'           *Nominal* pose of the obstacles (NOTE: these are provided once, through the controller's constructor, and might differ from the exact positions, if `randomized_gates_and_obstacles` is True, exact positions will also change across episodes if `reseed_on_reset` is False)
            - 'x_reference'                     Final position to reach/hover at

            - 'initial_state_randomization'     Distributions of the randomized additive error on the initial pose
            - 'inertial_prop_randomization'     Distributions of the randomized additive error on the inertial properties
            - 'gates_and_obs_randomization'     Distributions of the randomized additive error on the gates and obstacles positions
            - 'disturbances'                    Distributions of the dynamics and input disturbances  

            - 'symbolic_model'                  CasADi's 3D quadrotor dynamics
            - 'symbolic_constraints'            CasADi's constraints

    Returns: N/A

Required (2 of 2)

edit_this.py : Controller.cmdFirmware(time, obs, reward, done, info)    # Select the next command for the quadrotor

    Args:
        time (float): Episode's elapsed time, in seconds.
        obs (ndarray): The quadrotor's pose from PyBullet or Vicon
            [x, 0, y, 0, z, 0, phi, theta, psi, 0, 0, 0].

        reward (float, optional): The reward signal.
        done (bool, optional): Wether the episode has terminated.
        info (dict, optional): Current step information as a dictionary with keys

            - 'current_target_gate_id'          ID of the next gate (-1 when all gates have been traveled through)
            - 'current_target_gate_type'        Type of the next gate (0: tall, 1: low)
            - 'current_target_gate_in_range'    Boolean, whether the next gate is close enough (i.e., <= VISIBILITY_RANGE == 0.45m) for perfect visibility (affects the value of the next key 'current_target_gate_pos')
            - 'current_target_gate_pos'         *Nominal* or **exact** position of the next gate (depending on the value of the key above, 'current_target_gate_in_range')
            
            - 'at_goal_position'                Boolean, whether the quadrotor is at the final position ('x_reference')
            - 'task_completed'                  Boolean, whether the quadrotor stayed at the final position ('x_reference') for 2''

            - 'constraint_values'               Constraint evaluations
            - 'constraint_violation'            Boolean, whether any of the constraints is violated
            - 'collision'                       Collision, as a tuple (collided object id, boolean), note when False, ID==None

    Returns:
        Command: selected type of command (FINISHED, NONE, FULLSTATE, TAKEOFF, LAND, STOP, GOTO, NOTIFYSETPOINTSTOP, see Enum-like class `Command`).
        List: arguments for the type of command
            - FINISHED's args: []
            - NONE's args: []
            - FULLSTATE's args: [pos (3 val), vel (3 val), acc (3 val), yaw, rpy_rates (3 val), curr_time] 
            - TAKEOFF's args: [height, duration]
            - LAND's args: [height, duration]
            - STOP's args: []
            - GOTO's args: [x, y, z, yaw, duration, relative (bool)]
            - NOTIFYSETPOINTSTOP's args: []

        Also see: https://github.com/utiasDSL/safe-control-gym/blob/beta-iros-competition/competition/competition_utils.py#L10
        and: https://crazyswarm.readthedocs.io/en/latest/api.html#crazyflie-class

Optional, recommended for learning, adaptive control (1 of 2)

edit_this.py : Controller.interStepLearn(...)       # Update the controller's internal state at each step

    Args:
        N/A

    Leverage the data in `self.action_buffer`, `self.obs_buffer`, `self.reward_buffer`, self.done_buffer`, `self.info_buffer`

    Returns: N/A

Optional, recommended for learning, adaptive control (2 of 2)

edit_this.py : Controller.interEpisodeLearn(...)    # Update the controller's internal state between episodes

    Args:
        N/A

    Leverage the data in `self.action_buffer`, `self.obs_buffer`, `self.reward_buffer`, self.done_buffer`, `self.info_buffer`

    Returns: N/A

Submission

Fork this repository (help)
Checkout this branch (beta-iros-competition)
Implement your solution by modifying edit_this.py
Create a Pull Request into utiasDSL/safe-control-gym:beta-iros-competition from your fork (help)
Mention in the Pull Request's Conversation tab (i) how many num_episodes you want your solution to use in each level (mandatory) and (ii) what method(s) you used and results you obtained (optional)
Tag @JacopoPan in the Pull Request's Conversation tab

Note: Private Submissions

If you prefer not to publicly disclose your solution implementation, you can instead create a private import of this repository to develop it and give access to @JacopoPan and @utiasDSLadmin, or even send your edit_this.py to jacopo.panerati@utoronto.ca. In this case, please submit early to allow the time to run extra checks.

Scoring (v0.3)

A) For ALL levels (0-3, sim2real), solutions will be evaluated—on the last episode—by:

Safety: avoid ALL collisions with gates & obstacles and constraint violations—i.e., only runs with 0 collisions/violations will count as TASK COMPLETION
Performance: minimizing the task time (in sec.) required to complete the task (fly through all the gates and reach the goal)

B) For ALL levels (0-3, sim2real), solutions that accomplish A) will be evaluated—across all episodes—by:

Data & compute efficiency: minimizing the simulation/flight-clock time of the no. of episodes (in sec.) plus their overall wall-clock learning time (in sec.) used by interStepLearn() and interEpisodeLearn() to improve performance

C) For ALL levels (0-3, sim2real), the top 3 solutions ranked by the criteria in A) and the top 3 solutions ranked by the criteria in B) will score 20, 10, and 5 points respectively. The sum of these points will determine the final classification.

Important Dates

IROS Conference and Competition days: October 24-26, 2022

Prizes

Kindly supported by Bitcraze:

1st: Bitcraze Crazyflie AI Bundle
2nd: Bitcraze Crazyflie STEM Ranging Bundle
3rd: Bitcraze Crazyflie STEM Bundle

The authors of the best solutions will be invited to co-author a "lessons learned" paper with the organizers.

Organizers

Angela Schoellig (Technische Universität München, University of Toronto, Vector Institute)
Davide Scaramuzza (University of Zurich)
Vijay Kumar (University of Pennsylvania)
Nicholas Roy (Massachusetts Institute of Technology)
Todd Murphey (Northwestern University)
Sebastian Trimpe (RWTH Aachen University)
Wolfgang Hönig (TU Berlin)
Mark Muller (University of California Berkeley)
Jose Martinez-Carranza (INAOE)
SiQi Zhou (University of Toronto, Vector Institute)
Melissa Greeff (University of Toronto, Vector Institute)
Jacopo Panerati (University of Toronto, Vector Institute)
Wenda Zhao (University of Toronto, Vector Institute)
Spencer Teetaert (University of Toronto)
Yunlong Song (University of Zurich)
Leticia Oyuki Rojas Pérez (INAOE)
Adam W. Hall (University of Toronto, Vector Institute)
Justin Yuan (University of Toronto, Vector Institute)
Lukas Brunke (University of Toronto, Vector Institute)
Antonio Loquercio (UC Berkeley)

University of Toronto's Dynamic Systems Lab / Vector Institute for Artificial Intelligence

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
competition		competition
figures		figures
free_space_tracking_simulation		free_space_tracking_simulation
generated_trajectories		generated_trajectories
results		results
safe_control_gym		safe_control_gym
simulation_new		simulation_new
tests		tests
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IROS 2022 Safe Robot Learning Competition

Description

Install on Ubuntu/macOS

Install `pycffirmware`

On Ubuntu

On macOS

Getting Started

Development and Evaluation Scenarios

Implement Your Controller/Solution

Required (1 of 2)

Required (2 of 2)

Optional, recommended for learning, adaptive control (1 of 2)

Optional, recommended for learning, adaptive control (2 of 2)

Submission

Note: Private Submissions

Scoring (v0.3)

Important Dates

Prizes

Organizers

About

Releases

Packages

Languages

License

ustc-arg/ReferenceTrajectoryResamplingLayer

Folders and files

Latest commit

History

Repository files navigation

IROS 2022 Safe Robot Learning Competition

Description

Install on Ubuntu/macOS

Install pycffirmware

On Ubuntu

On macOS

Getting Started

Development and Evaluation Scenarios

Implement Your Controller/Solution

Required (1 of 2)

Required (2 of 2)

Optional, recommended for learning, adaptive control (1 of 2)

Optional, recommended for learning, adaptive control (2 of 2)

Submission

Note: Private Submissions

Scoring (v0.3)

Important Dates

Prizes

Organizers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Install `pycffirmware`

Packages