Follow Person RR

Goal

The objective of this practice is to implement the logic of a navigation algorithm that will be used to follow a person in a hospital using a R-CNN (Region based Convolutional Neural Network) called SSD (Single Shot Detector)

Follow Person cover — Follow Person Cover

Note: If you haven’t, take a look at the user guide to understand how the installation is made, how to launch a RoboticsBackend and how to perform the exercises.

Simulated Turtlebot 2 (ROS Humble)

The robot that we will use is a Turtlebot2 (a circular mobile robot) implemented and developed for ROS Foxy and ROS 2 Humble. It has a RGBD camera so that we can detect objects or people, and it has a laser 360º for implement algorithms as VFF if you need to avoid obstacles.

Person model teleoperation mode

The web template includes a teleoperation mode that allows you to control a person within the hospital. To enable this, switch to the “Follow Person Teleop” universe and then you will can use the WASD keys to move the model.

W: forward movement
S: backward movement
A: left rotation
D: right rotation

If it doesn’t react, click on the area where the image is shown and try again.

Frequency API

import Frequency - to import the Frequency library class. This class contains the tick function to regulate the execution rate.
Frequency.tick(ideal_rate) - regulates the execution rate to the number of Hz specified. Defaults to 50 Hz.

Robot API

import HAL - to import the HAL(Hardware Abstraction Layer) library class. This class contains the functions that sends and receives information to and from the Hardware(Gazebo).
import WebGUI - to import the WebGUI (Web Graphical User Interface) library class. This class contains the functions used to view the debugging information, like image widgets.
HAL.getImage() - to obtain the current frame of the camera robot.
HAL.getPose3d().x - to get the position of the robot (x coordinate)
HAL.getPose3d().y - to obtain the position of the robot (y coordinate)
HAL.getPose3d().yaw - to get the orientation of the robot with regarding the map
HAL.getLaserData() - it allows to obtain the data of the laser sensor. It returns a list of 180 laser measurements (0 - 180 degrees)
HAL.setV() - to set the linear speed
HAL.setW() - to set the angular velocity
HAL.getBoundingBoxes() - this method calls a detect() neural network’s method to obtain a list of detected objets from an image passed as argument.
WebGUI.showImage() - to show an opencv image in the web template

Laser attributes

HAL.getLaserData() returns an instance of a Class with the following attributes:

minAngle - Start angle of the scan [rad]
maxAngle - End angle of the scan [rad]
minRange - minimum range value [m]
maxRange - maximum range value [m]
values - A list of 180 measurements [m] (Note: values < minRange or > maxRange should be discarded)

Bounding Box attributes

HAL.getBoundingBoxes() returns an instance a list of Bounding Box Classes with the following attributes:

id - identifier of the type of object (1, 2, 3)
class-id - name of the object (1->person, 2->bicycle, 3->car, …). It uses a coco_names.py file which you can see in this link: (TODO)
xmin - x value of the top left point of the bounding box
ymin - y value of the top left point of the bounding box
xmax - x value of the bottom right point of the bounding box
ymax - y value of the bottom right point of the boudning box

Example of use

# Move forward
HAL.setV(0.3)
HAL.setW(0.0)

while True:
    # -- Read from sensors
    img = HAL.getImage()
    bounding_boxes = HAL.getBoundingBoxes(img)
    laser_data = HAL.getLaserData()

    # -- Process sensors data (bounding boxes, laser ...).

    # -- Send commands to actuators.

    # -- Show some results
    WebGUI.showImage(img)

Theory

When we are designing a robotic application that knows how to follow a person, the most important mission is knowing how to detect it and not lose it.

The first step is the detection of people; we perform this first task using a Region-based Convolutional Neural Network (R-CNN). CNNs are a type of network where the first neurons capture groups of pixels and these neurons form new groups for next layers by doing convolutions with Kernel filters. The neurons of the output layer return the percentage probability of an image that belong to a given class (classification). For more information, see this link. With an R-CNN we use a CNN on many regions of the image and we select those regions with a higher probability of success. There are many types of architectures based on R-CNN such as Yolo or SSD. In this exercise you will use an SSD trained model. If you want to know how SSD works, you can access this link.

Once we have detected all the people in the image, we can establish several criteria to decide which person are we going to follow.

In order not to lose our target, we can use tracking algorithms. A homemade method that usually works well, consists on locating the centroid of every bounding box in each iteration and comparing it with the chosen Centroid of the previous frame. We will stay with that bounding box that have the closest centroid and the most similar area to the chosen bounding box of the previous frame.

The second step is to use the Kobuki base actuators to move and get closer to the person. To achieve this goal, we look at the location of the centroid of the candidate bounding box. Depending on the position, we will establish a certain angular speed.

An easy method to implement this is by discretized case-based behavior. We take the width of an image and divide it into X number of columns. We assign an specific angular velocity to each range, and, depending on where the centroid is, we will apply the corresponding velocity.

Another method, a bit more complicated but more efficient, is to implement a PID controller for the angular velocity. With a good design, we will obtain a more precise and less oscillatory turning response.

However, the robot moves through an environment where there may be obstacles. There is an algorithm called VFF (Virtual Force Field) that allows us to avoid collisions while following a target. It is based on the sum of attraction and repulsion vectors that determine the direction of movement.

Virtual Force Field

The Virtual Force Field Algorithm works as follows:

The robot assigns an Attraction Vector to the objective (person). With an image, you will have to use the Field of View of the camera (60 degrees) to know the angle of each pixel with the center of the image. You can set a fixed module vector with a 2D camera.
The robot assigns a Repulsion Vector to the obstacle, according to its sensor readings that points away from the waypoint. This is done by adding all the vectors that are translated from the sensor readings.
The robot follows the Final Vector obtained by adding the attraction and repulsion vector.

Virtual Force Field (VFF) — Virtual Force Field

PID Controller

To understand PID Control, let us first understand what is Control in general.

Control System

A system or set of devices, that manages, commands, directs or regulates the behavior of other devices or systems to achieve the desired results. Simply speaking, a system which controls other systems. Control Systems help a robot to execute a set of commands precisely, in the presence of unforeseen errors.

Types of Control System

Open Loop Control System

A control system in which the control action is completely independent of the output of the system. A manual control system is on Open Loop System.

Closed Loop Control System

A control system in which the output has an effect on the input quantity, in such a manner that the input will adjust itself based on the output generated. An open loop system can be converted to a closed one by providing feedback.

PID Control

A control loop mechanism employing feedback. A PID Controller continuously calculates an error value as the difference between desired output and the current output and applies a correction based on proportional, integral and derivative terms(denoted by P, I, D respectively).

Proportional

Proportional Controller gives an output which is proportional to the current error. The error is multiplied with a proportionality constant to get the output. And hence, 0 if the error is 0.

Integral

Integral Controller provides a necessary action to eliminate the offset error which is accumulated by the P Controller. It integrates the error over a period of time until its value reaches zero.

Derivative

Derivative Controller gives an output depending on the rate of change or error with regard to time. It gives the kick start for the output, thereby increasing system response.

Types of Control Systems — Control Systems and PID

Videos

Example of use of Person Teleoperator

Reference solution of Simulated Follow Person

Contributors

Contributors: Carlos Caminero Abad, Jose María Cañas, Lucía Lishan Chen Huang
Maintained by Carlos Caminero Abad, Lucía Lishan Chen Huang.

References

1 2 3