Implementing BEVDet in Autoware #4635

cyn-liu · 2024-04-18T04:08:37Z

Checklist

I've read the contribution guidelines.
I've searched other issues and no duplicate issues were found.
I've agreed with the maintainers that I can plan this task.

Description

BEVDet is a BEV perception algorithm based on panoramic cameras. It unifies multi-view images into the perspective of BEV for 3D object detection task. It is different from the current 3D perception feature of Autoware.
BEVDet code repos

Purpose

Integrating BEVDet into Autoware for 3D object detection based on multi-view images, this task related to Sensing& Perception task.

Possible approaches

BEVDet is a 3D object detection model trained on NuScenes dataset using 6 surround view camera images. The 6 cameras form a 360 degree field of view with overlapping fields of view. When mapping from 2D to 3D, some parameters are required, including camera intrinsic parameters and extrinsic parameters between each camera and ego.
Integrating BEVDet into Autoware involves the placement of 6 cameras and calibration. Convert BEVDet model into ONNX format for deployment in Autoware.

Definition of done

The placement of 6 cameras and calibration
Convert BEVDet model into ONNX format
Deploying BEVDet model on device using TensorRT
BEVDet output result adaptation to Autoware topics

liuXinGangChina · 2024-04-18T07:38:22Z

Great，maybe you can make a todo task list first and see what others can take part in

cyn-liu · 2024-05-21T08:01:49Z

We refer to this project and successfully ran it on our own machine.
We use RTX3080 GPU and Trt FP16 inference BEVDet-R50-4DLongterm-Depth model. The mAP and inference speed of BEVDet-R50-4DLongterm-Depth TensorRT version can refer this project link.
The following is the running results on our machine:

v1.mp4

The following is the inference speed on our machine:

v2.mp4

Next, we will modify ROS1 node to ROS2 node based on this project, then, we will use TIER IV's dataset for testing, and we hope that this dataset can provide ROS2 bag format.

Our plan of integrate the BEVDet ROS2 node into Autoware:

define a bevdet_node in Autoware perception module
organize the 3D boxes results into autoware_perception_msgs::msg::DetectedObjects type
input the output result of bevdet_node into the object_merger node and fuse it with the detection results of other models

cyn-liu · 2024-05-29T08:59:18Z

Environment:
CUDA11.3.1
cudnn- linux-x86_64-8.8.1.3_cuda11
TensorRT-8.5.1.7.Linux.x86_64-gnu

liuXinGangChina · 2024-06-18T07:31:11Z

Maybe try with AWSIM data

liuXinGangChina · 2024-07-02T07:54:36Z

list the cuda env here

cyn-liu · 2024-07-03T12:23:46Z

Using the BEVDet model to infer the TIER4 dataset, it was found that the model had poor generalization performance on the TIER4 dataset.

Visualization results on TIER4 data：

(1)

(2)

Visualization results on NuScenes data：

liuXinGangChina · 2024-07-04T02:35:41Z

Looks like the original pre-trian（based on nuScenes dataset） model‘s generalization on tire4 dataset is not as well as we expected. Obstacles's direction is almost right but the depth of them ge

we plan to close this task once we have the node tested. And creat a new task of "retrain the model" to see whether the new model’s performance on tire4 dataset increase.

cyn-liu · 2024-07-11T02:04:45Z

Our plan of integrate the BEVDet ROS2 node into Autoware:

define a bevdet_node in Autoware perception module

organize the 3D boxes results into autoware_perception_msgs::msg::DetectedObjects type

input the output result of bevdet_node into the object_merger node and fuse it with the detection results of other models

Considering that running the BEV 3D detection algorithm based on multi-cameras and the Lidar based 3D detection algorithm simultaneously is too heavy a load. we have decided not to merge the results of BEVDet with the output results of Lidar, but to create a new perception_mode, when perception_mode = camera, launch bevdet_node.

cyn-liu · 2024-07-22T02:17:05Z

@xmfcx The PR related this issue has been successfully tested in the newer Autoware docker image.
The environment information of this image:

CUDA==12.3
libnvinfer==8.6.1.6

Note: Outside in docker, I must upgrade to my nvidia GPU driver version to ensure that this driver supports a maximum CUDA version >= 12.3.

cyn-liu added component:calibration Calibration of sensors and hardware. component:perception Advanced sensor data processing and environment understanding. component:sensing Data acquisition from sensors, drivers, preprocessing. labels Apr 18, 2024

cyn-liu added this to Autoware Labs Apr 18, 2024

cyn-liu moved this to Todo in Autoware Labs Apr 18, 2024

xmfcx assigned cyn-liu Apr 19, 2024

liuXinGangChina moved this from Todo to In Progress in Autoware Labs May 21, 2024

cyn-liu linked a pull request Jul 11, 2024 that will close this issue

feat(autoware_tensorrt_bevdet): add new 3d object detection method autowarefoundation/autoware.universe#7956

Open

cyn-liu linked a pull request Jul 16, 2024 that will close this issue

feat(autoware_tensorrt_bevdet): add new 3d object detection method autowarefoundation/autoware.universe#7956

Open

xmfcx removed this from Autoware Labs Oct 4, 2024

xmfcx added this to Software Working Group Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing BEVDet in Autoware #4635

Implementing BEVDet in Autoware #4635

cyn-liu commented Apr 18, 2024 •

edited

Loading

liuXinGangChina commented Apr 18, 2024

cyn-liu commented May 21, 2024 •

edited

Loading

cyn-liu commented May 29, 2024

liuXinGangChina commented Jun 18, 2024

liuXinGangChina commented Jul 2, 2024

cyn-liu commented Jul 3, 2024 •

edited

Loading

liuXinGangChina commented Jul 4, 2024

cyn-liu commented Jul 11, 2024

cyn-liu commented Jul 22, 2024

Implementing BEVDet in Autoware #4635

Implementing BEVDet in Autoware #4635

Comments

cyn-liu commented Apr 18, 2024 • edited Loading

Checklist

Description

Purpose

Possible approaches

Definition of done

liuXinGangChina commented Apr 18, 2024

cyn-liu commented May 21, 2024 • edited Loading

cyn-liu commented May 29, 2024

liuXinGangChina commented Jun 18, 2024

liuXinGangChina commented Jul 2, 2024

cyn-liu commented Jul 3, 2024 • edited Loading

Visualization results on TIER4 data：

Visualization results on NuScenes data：

liuXinGangChina commented Jul 4, 2024

cyn-liu commented Jul 11, 2024

cyn-liu commented Jul 22, 2024

cyn-liu commented Apr 18, 2024 •

edited

Loading

cyn-liu commented May 21, 2024 •

edited

Loading

cyn-liu commented Jul 3, 2024 •

edited

Loading