Object Pose Estimation Tutorial: Part 4

In Part 1 of the tutorial, we learned how to create our Scene in Unity Editor. In Part 2, we set up the Scene for data collection. In Part 3 we have learned:

How to collect the data
How to train the deep learning model

In this part, we will use our trained deep learning model to predict the pose of the cube, and pick it up with our robot arm.

Table of Contents

Setup
Add the Pose Estimation Model
Set Up the ROS Side
Set Up the Unity Side
Put It All Together

Set up

If you have correctly followed parts 1 and 2, whether or not you choose to use the Unity project given by us or start it from scratch, you should have cloned the repository.

Note: This project uses Git Submodules to grab the ROS package dependencies for the universal_robot, moveit_msgs, ros_tcp_endpoint, and the robotiq) folders. If you cloned the project and forgot to use --recurse-submodules, or if any submodule in this directory doesn't have content (e.g. moveit_msgs or ros_tcp_endpoint), you can run the following command to grab the Git submodules.
cd /PATH/TO/Robotics-Object-Pose-Estimation &&
git submodule update --init --recursive

In your ROS/src folder, you should now have five subdirectories: moveit_msgs, robotiq, ros_tcp_endpoint, universal_robot and ur3_moveit.

Add the Pose Estimation Model

Here you have two options for the model:

Option A: Use Our Pre-trained Model

To save time, you may use the model we have trained. Download this UR3_single_cube_model.tar file, which contains the pre-trained model weights.

Option B: Use Your Own Model

You can also use the model you have trained in Part 3. However, be sure to rename your model to UR3_single_cube_model.tar as the script that will call the model is expecting this name.

Moving the Model to the ROS Folder

Go inside the ROS/src/ur3_moveit folder and create a folder called models. Then copy your model file (.tar) into it.

Set Up the ROS Side

Note: This project has been developed with Python 3 and ROS Noetic.

The provided ROS files require the following packages to be installed. The following section steps through configuring a Docker container as the ROS workspace for this tutorial. If you would like to manually set up your own ROS workspace with the provided files instead, follow the steps in Part 0: ROS Setup to do so.

Building this Docker container will install the necessary packages for this tutorial.

Install the Docker Engine if not already installed. Start the Docker daemon. To check if the Docker daemon is running, when you open you Docker application you should see something similar to the following (green dot on the bottom left corner with the word running at the foot of Docker):

In the terminal, ensure the current location is at the root of the Robotics-Object-Pose-Estimation directory. Build the provided ROS Docker image as follows:

docker build -t unity-robotics:pose-estimation -f docker/Dockerfile .

Note: The provided Dockerfile uses the ROS Noetic base Image. Building the image will install the necessary packages as well as copy the provided ROS packages and submodules to the container, predownload and cache the VGG16 model, and build the catkin workspace.

Start the newly built Docker container:

docker run -it --rm -p 10000:10000 -p 5005:5005 unity-robotics:pose-estimation /bin/bash

When this is complete, it will print: Successfully tagged unity-robotics:pose-estimation. This console should open into a bash shell at the ROS workspace root, e.g. root@8d88ed579657:/catkin_ws#.

Note: If you encounter issues with Docker, check the Troubleshooting Guide for potential solutions.

Source your ROS workspace:

source devel/setup.bash

The ROS workspace is now ready to accept commands!

Note: The Docker-related files (Dockerfile, bash scripts for setup) are located in Robotics-Object-Pose-Estimation/docker.

Set Up the Unity Side

If your Pose Estimation Tutorial Unity project is not already open, select and open it from the Unity Hub.

We will work on the same Scene that was created in the Part 1 and Part 2, so if you have not already, complete Parts 1 and 2 to set up the Unity project.

Connecting with ROS

Prefabs have been provided for the UI elements and Trajectory Planner for convenience. These are grouped under the parent ROSObjects tag.

In the Project tab, go to Assets/TutorialAssets/Prefabs/Part4 and drag and drop the ROSObjects Prefab into the Hierarchy tab.
The ROS TCP connection needs to be created. In the top menu bar in Unity Editor, select Robotics -> ROS Settings. Find the IP address of your ROS machine.
- If you are going to run ROS services with the Docker container introduced above, fill ROS IP Address and Override Unity IP with the loopback IP address 127.0.0.1. If you will be running ROS services via a non-Dockerized setup, you will most likely want to have the Override Unity IP field blank, which will let the Unity IP be determined automatically.
- If you are not going to run ROS services with the Docker container, e.g. if you are using a dedicated Linux machine or VM instead, open a terminal window in this ROS workspace. Set the ROS IP Address field in Unity Editor to the output of the following command:
```
hostname -I
```
Ensure that ROS Port is set to 10000 and Unity Port is set to 5005.

Opening the ROS Settings window creates a ROSConnectionPrefab asset in the Assets/Resources folder, with the user-input settings. When the static ROSConnection.instance is referenced in a script, if a ROSConnection instance is not already present, the Prefab will be instantiated in the Unity scene, and the connection will begin.

Note: While using the ROS Settings window is the suggested workflow, you may still manually create a GameObject with an attached ROSConnection component.

The provided script Assets/TutorialAssets/Scripts/TrajectoryPlanner.cs contains the logic to invoke the motion planning services, as well as the logic to control the gripper and end effector tool. This has been adapted from the Pick-and-Place tutorial. The component has been added to the ROSObjects/Publisher object.

In this TrajectoryPlanner script, there are two functions that are defined, but not yet implemented. InvokePoseEstimationService() and PoseEstimationCallback() will create a ROS Service request and manage the ROS Service response, respectively. The following steps will provide the code and explanations for these functions.

Open the TrajectoryPlanner.cs script in an editor. Find the empty InvokePoseEstimationService(byte[] imageData) function definition, starting at line 165. Replace the empty function with the following:

private void InvokePoseEstimationService(byte[] imageData)
{
    uint imageHeight = (uint)renderTexture.height;
    uint imageWidth = (uint)renderTexture.width;

    RosMessageTypes.Sensor.Image rosImage = new RosMessageTypes.Sensor.Image(new RosMessageTypes.Std.Header(), imageWidth, imageHeight, "RGBA", isBigEndian, step, imageData);
    PoseEstimationServiceRequest poseServiceRequest = new PoseEstimationServiceRequest(rosImage);
    ros.SendServiceMessage<PoseEstimationServiceResponse>("pose_estimation_srv", poseServiceRequest, PoseEstimationCallback);
}

The InvokePoseEstimationService function will be called upon pressing the Pose Estimation button in the Unity Game view. It takes a screenshot of the Scene as an input, and instantiates a new RGBA sensor_msgs/Image with the defined dimensions. Finally, this instantiates and sends a new Pose Estimation service request to ROS.

Note: The C# scripts for the necessary ROS msg and srv files in this tutorial have been generated via the ROS-TCP-Connector and provided in the project's Assets/TutorialAssets/RosMessages directory.

Next, the function that is called to manage the Pose Estimation service response needs to be implemented.

Still in the TrajectoryPlanner script, find the empty PoseEstimationCallback(PoseEstimationServiceResponse response) function definition. Replace the empty function with the following:

void PoseEstimationCallback(PoseEstimationServiceResponse response)
{
    if (response != null)
    {
        // The position output by the model is the position of the cube relative to the camera so we need to extract its global position
        var estimatedPosition = Camera.main.transform.TransformPoint(response.estimated_pose.position.From<RUF>());
        var estimatedRotation = Camera.main.transform.rotation * response.estimated_pose.orientation.From<RUF>();

        PublishJoints(estimatedPosition, estimatedRotation);

        EstimatedPos.text = estimatedPosition.ToString();
        EstimatedRot.text = estimatedRotation.eulerAngles.ToString();
    }
    InitializeButton.interactable = true;
    RandomizeButton.interactable = true;
}

This callback is automatically run when the Pose Estimation service response arrives. This function simply converts the incoming pose into UnityEngine types and updates the UI elements accordingly. Once converted, the estimated position and rotation are sent to PublishJoints, which will send a formatted request to the MoveIt trajectory planning service.

Note: The incoming position and rotation are converted From<RUF>, i.e. Unity's coordinate space, in order to cleanly convert from a geometry_msgs/Point and geometry_msgs/Quaternion to UnityEngine.Vector3 and UnityEngine.Quaternion, respectively. This is equivalent to creating a new Vector3(response.estimated_pose.position.x, response.estimated_pose.position.y, response.estimated_pose.position.z), and so on. This functionality is provided via the ROSGeometry component of the ROS-TCP-Connector package.

Note: The Randomize Cube button calls the RandomizeCube() method. This randomizes the position and orientation of the cube, the position of the goal, and the color, intensity, and position of the light. This is achieved through running the Randomizers defined in the Pose Estimation Scenario component of the Simulation Scenario GameObject. If you want to learn more about how we created a modified Scenario so that we could trigger Iterations manually at inference time, check out the PoseEstimationScenario.cs script.

Note that the TrajectoryPlanner component shows its member variables in the Inspector window, which need to be assigned.

Return to Unity. Select the ROSObjects/Publisher GameObject. Assign the ur3_with_gripper GameObject to the Robot field. Drag and drop the Cube GameObject from the Hierarchy onto the Target Inspector field. Drag and drop Goal to the Goal field. Finally, assign the Simulation Scenario object to the Scenario field. You should see the following:

Switching to Inference Mode

On the Simulation Scenario GameObject, uncheck the Automatic Iteration property of the Pose Estimation Scenario, as we are no longer in the Data Collection part. If you want to collect new data in the future, you can always enable Automatic Iteration and disable ROSObjects.
On the Main Camera GameObject, uncheck the Perception Camera script component, since we do not need it anymore.

Also note that UI elements that have been provided in ROSObjects/Canvas, including the Event System that is added by default by Unity. In ROSObjects/Canvas/ButtonPanel, the OnClick callbacks have been pre-assigned in the Prefab. These buttons set the robot to its upright default position, randomize the cube position and rotation, randomize the target, and call the Pose Estimation service.

Put It All Together

Run the following roslaunch command in order to start roscore, set the ROS parameters, start the server endpoint, start the Mover Service and Pose Estimation nodes, and launch MoveIt.

In the terminal window of your ROS workspace opened in Set up the ROS side, run the provided launch file:

roslaunch ur3_moveit pose_est.launch

This launch file also loads all relevant files and starts ROS nodes required for trajectory planning for the UR3 robot (demo.launch). The launch files for this project are available in the package's launch directory, i.e. src/ur3_moveit/launch.

This launch will print various messages to the console, including the set parameters and the nodes launched. The final message should confirm You can start planning now!.

Note: The launch file may throw errors regarding [controller_spawner-5] process has died. These are safe to ignore as long as the final message is Ready to plan. This confirmation may take up to a minute to appear.

Return to Unity, and press Play.

Note: If you encounter connection errors such as a SocketException or don't see a completed TCP handshake between ROS and Unity in the Console window, return to the Connecting with ROS section above to update the ROS Settings and generate the ROSConnectionPrefab.

Note: If you encounter a SocketException on Ubuntu, check the Troubleshooting Guide for potential solutions.

Note that the robot arm must be in its default position, i.e. standing upright, to perform Pose Estimation. This is done by simply clicking the Reset Robot Position button after each run.

Press the Pose Estimation button to send the image to ROS.

This will grab the current camera view, generate a sensor_msgs/Image message, and send a new Pose Estimation Service Request to the ROS node running pose_estimation_service.py. This will run the trained model and return a Pose Estimation Service Response containing an estimated pose, which is subsequently converted and sent as a new Mover Service Response to the mover.py ROS node. Finally, MoveIt calculates and returns a list of trajectories to Unity, and the poses are executed to pick up and place the cube.

The target object and empty goal object can be moved around during runtime for different trajectory calculations, or can be randomized using the Randomize Cube button.

Note: You may encounter a UserWarning: CUDA initialization: Found no NVIDIA driver on your system. error upon the first image prediction attempt. This warning can be safely ignored.

Note: If you encounter issues with the connection between Unity and ROS, check the Troubleshooting Guide for potential solutions.

You should see the following:

Congrats! You did it!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4_pick_and_place.md

4_pick_and_place.md

Object Pose Estimation Tutorial: Part 4

Set up

Add the Pose Estimation Model

Option A: Use Our Pre-trained Model

Option B: Use Your Own Model

Moving the Model to the ROS Folder

Set Up the ROS Side

Set Up the Unity Side

Connecting with ROS

Switching to Inference Mode

Put It All Together

Click here to go back to Part 3.

Files

4_pick_and_place.md

Latest commit

History

4_pick_and_place.md

File metadata and controls

Object Pose Estimation Tutorial: Part 4

Set up

Add the Pose Estimation Model

Option A: Use Our Pre-trained Model

Option B: Use Your Own Model

Moving the Model to the ROS Folder

Set Up the ROS Side

Set Up the Unity Side

Connecting with ROS

Switching to Inference Mode

Put It All Together

Click here to go back to Part 3.