天池交通状况预测比赛

Result

Method	F1₀	F1₁	F1₂	F1₃	F1_val	F1_test
hand-crafted features + LGBM	0.86	0.19	0.65	0.97	0.706	0.6307
DCNN features + Resnet101	0.88	0.21	0.65	0.98	0.714
hand-crafted features + DCNN features + Resnet101	0.90	0.18	0.66	0.98	0.716	0.6170

Taks1: Scene Recognition

Logistic regression

Directly classify scene images into several traffic status (unimpeded, congested and slow), based on the deep convolutional features.

Backbone	F1₀	F1₁	F1₂	F1₃	score
Resnet50	0.00	0	0	0.67	0.268
Resnet50 + re-weighting	0.00	0	0	0.67	0.268
Resnet50 + oversampling	0.44	0.26	0.42	0.65	0.483
Resnet101 + oversampling	0.44	0.46	0.57	0.76	0.610
Resnet101 + oversampling + GRU	0.60	0.48	0.52	0.91	0.676

Note:

All above results are obtained before fixing preprocessing error.

Method	F1₀	F1₁	F1₂	F1₃	score
Resnet101	0.88	0.21	0.65	0.98	0.714
Resnet101 + feat_mask	0.89	0.16	0.64	0.98	0.703
Resnet101 + feat_vector	0.89	0.16	0.66	0.98	0.710
Resnet101 + feat_mask + feat_vector	0.90	0.18	0.66	0.98	0.716
Resnet101 + feat_mask + feat_vector + fc-DP50	0.90	0.18	0.65	0.98	0.707
Resnet101 + FT3 + feat_mask + feat_vector	0.91	0.12	0.70	0.98	0.716
Resnet50 + feat_mask + feat_vector	0.89	0.14	0.66	0.98	0.708
Resnet50 + FT2	0.91	0.09	0.70	0.99	0.715
Resnet50 + FT2 + mixup	0.87	0.20	0.66	0.99	0.720
Resnet50 + FT2 + BBN(LSTM rep)	0.90	0.19	0.66	1.00	0.725
Resnet50 + FT2 + BBN(LSTM cls)	0.89	0.17	0.67	1.00	0.722
Resnet50 + FT2 + BBN(LSTM rep) + data aug	0.90	0.16	0.63	0.95	0.693
**Resnet50 + FT2 + BBN(LSTM rep) + ISR1	0.91	0.17	0.69	1.00	0.731
**Resnet50 + FT2 + BBN(LSTM rep) + ISR2	0.91	0.15	0.70	1.00	0.730
EfficientNetB4 + FT2(BS8) + BBN(LSTM rep)	0.80	0.12	0.40	0.93	0.596
EfficientNetB4 + FT4(BS32) + BBN(LSTM rep)	0.82	0.17	0.58	0.96	0.673
Resnet50 + FT1	0.87	0.19	0.62	0.99	0.708
*Resnet50 + FT2 + bilinear pooling	0.85	0.19	0.57	0.97	0.679
*ResNeSt101 + feat_mask + feat_vector	0.90	0.06	0.66	0.97	0.689

Note:

All methods use oversampling(except for BBN) and GRU.
* denotes models trained and evaluated in the first fold, and trained for 2 epochs.
FT denotes fine-tuning. By default, the bakbones are fixed during training. FTn denotes layers after n-th layer are fine-tuned during training.
LSTM rep denotes LSTM acts as part of a feature extractor, while LSTM cls acts as part of a classifier.
DP denotes dropout.
ISR denotes Improved Sequence Representation. Vanilla sequence representation is an averages over all LSTM hidden_states, inlucding valid frames and invalid paddings. By using the sequence information(key frame index and sequence length), ISR is a re-weighted average over valid hidden_states. The weight of hidden_state at key frame is twice greater than others'. ISR1 denotes ISR with sequence length. ISR2 denotes ISR with both sequence length and key frame index.
Only results denoted by ** are latest.

Ordinal Logistic Regression

Ordinal logistic regression models the ordinal relationship between classes. It is commonly applied to ranking problems, e.g. age estimation, height estimation. Considering the congestion grows from uninpeded to congested, ordinal logistic regression is suitable to model the traffic status estimation problem. A classic approach to ordinal logistic regression is K-rank model. The model makes K(num_classes - 1) predictions O = {O₀, ..., O_K-1}, where the i-th prediction gives the probability that the class > i. The probability of i-th class equals to O_{i - 1} - O_i . K-rank can be implemented with K binary classification models.

Method	F1₀	F1₁	F1₂	F1₃	score
Res50 + FT2 + 4-rank	0.91	0.23	0.66	0.99	0.734
Res50 + FT2 + 2-rank + 2-cls	0.92	0.09	0.73	0.99	0.727
Res50 + FT2 + 3-rank + 1-cls	0.93	0.26	0.71	1.00	0.756
Res50 + FT2 + 3-rank + 1-cls + LS	0.92	0.20	0.72	0.99	0.744
Res50 + FT2 + 3-rank + 1-cls + BBN	0.89	0.19	0.52	0.99	0.679
Res50 + FT2 + 3-rank + 1-cls + FF	0.93	0.29	0.71	1.00	0.764
Res50 + FT2 + 3-rank + 1-cls + FF + RF	0.92	0.28	0.73	1.00	0.767
Res50 + FT2 + 3-rank + 1-cls + RF + RC	0.84	0.25	0.63	0.91	0.685
Res50 + FT2 + 3-rank + 1-cls + aug	0.91	0.25	0.73	0.99	0.759
Res50 + FT2 + 3-rank + 1-cls + FF + aug	0.92	0.30	0.72	0.99	0.764
Res50 + FT2 + 3-rank + 1-cls + FF + aug + TTA	0.90	0.29	0.70	0.99	0.755
Res50 + FT2 + 3-rank + 1-cls + FF + RF + TTA	0.93	0.27	0.72	1.00	0.763
Res50 + FT2 + SORD	0.91	0.16	0.64	0.99	0.710
Res50 + FT2 + SORD + 1-cls	0.92	0.20	0.66	0.99	0.726

NOTE:

By default, the oversampling strategy is adopted to handle imbalanced class distribution.
3-rank + 1-cls denotes ranking the first 3 classes and classify the last single class. The oridinal regression of the first 3 classes is conditioned on that the class is not the last class.
LS denotes label smooth. FF denotes feature fusion. RF denotes RandomFlip at training time.RC denotes RandomResizedCrop at training time. TTA denots Test Time Augmentation.
SORD: Soft Labels for Ordinal Regression SORD

Sequence and Key Frame Joint Classification

Method	F1₀	F1₁	F1₂	F1₃	score
Baseline	0.90	0.26	0.69	0.99	0.746
+sequence prediction only	0.92	0.30	0.70	1.00	0.759
+ seq:key=1:2	0.91	0.29	0.71	0.99	0.762

NOTE:

Baseline model comprises r50 backbone, FF neck and double 3-rank+1cls head for sequence classification and key frame classification respectively.

Usage

在运行之前需要把mmclassification安装到环境中:

cd lib/mmclassification
pip install -e .

CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch --nproc_per_node 1 train.py \
    --config configs/classifiers/classifier_r50_or_bbn.py \
    --img_root /path/to/amap_traffic_final_train_data \
    --ann_file  /path/to/amap_traffic_final_train_0906.json/or/enriched/one \
    --lr 0.00025 --max_epoch 8 --milestones 8 --samples_per_gpu 8

python -u test.py \
    --img_root ../data/amap_traffic_final_train_data \
    --ann_file  ../data/amap_traffic_final_train_0906.json \
    --device cuda:0 --model_path /path/to/saved/model

python e2e_demo.py --img_root /tcdata/amap_traffic_final_test_data \
    --ann_file  /path/to/amap_traffic_final_test_0906.json/or/enriched/one \
    --test_file /tcdata/amap_traffic_final_test_0906.json \
    --device cuda:0 --model_path /path/to/saved/model

Task2: Visual Odometry

Paper	Year	Code
Visual Odometry Revisited: What Should Be Learnt?	ICRA2020	Pytorch
DeepVO : Towards Visual Odometry with Deep Learning	ICRA2017	Pytorch
Unsupervised Learning of Depth and Ego-Motion from Video	CVPR2017	Pytorch TensorFlow
Fast, Robust, Continuous Monocular Egomotion Computation	ICRA2016	None

Task3: Road Lane detection

Traditional methods

Road lane detection based on hough line detection algorithm

SOTA

Refer to awesome-lane-detection

Paper	Year	Code
Ultra Fast Structure-aware Deep Lane Detection	ECCV2020	Pytorch
Inter-Region Affinity Distillation for Road Marking Segmentation	CVPR2020	Pytorch
key points estimation and point instance segmentation approach for lane detection	arxiv2020	Pytorch

lanedet usage

lanedet is modified for easier usage from Ultra-Fast-Lane-Detection。

The demo code is refactored.
- Build 3 APIs in inference.py (init_model, inference_model and show_result)
- Support single image test by running inference.py（See run.sh).

The project is refactored to be a package for external calls.

from lanedet.utils.config import Config
from lanedet.inference import init_model, inference_model, show_result
from utils.geometry import split_rectangle, point_in_polygon

config_file = /path/to/config
config = Config.fromfile(config_file)
config.test_model = /path/to/model_weight

model = init_model(config, 'cuda:0')
img_file = /path/to/image
result = inference_model(model, img_file)
img = show_result(img_file, result)
img.save(/path/to/output_image)

# The lane detections are used to determine which is the main lane.
lines = [_[_[:, 0] > 0] for _ in result if len(_[_[:, 0] > 0]) > 2] # filter high quality lane detections
lanes = split_rectangle(lines, img.size)
w, h = img.size
main_lane = [point_in_polygon([w/2, h], lane) for lane in lanes].index(True)

Task4: Vehicle detection

Vehicle detection is completed within general object detection pretrained with MS COCO dataset, based on mmdetection.

mmdetection usage

Refer to mmdetection docs.

import cv2
import numpy as np

from mmdet.apis import inference

config = /path/to/config # e.g. mmdetection/configs/cascade_rcnn/cascade_rcnn_r50_fpn_1x_coco.py
checkpoint = /path/to/model/weight # Download from mmdetection model_zoo

detector = inference.init_detector(config, checkpoint=checkpoint, device='cuda:0')
img_file = /path/to/image
out = inference.inference_detector(detector, img_file)

vehicle_labels = ['car', 'motorcycle', 'bus', 'truck', ]
vehicle_ids = [detector.CLASSES.index(label) for label in vehicle_labels]

result = [np.empty((0, 5)) for i in range(len(out))]
for id in vehicle_ids:
    result[id] = out[id]

img = detector.show_result(img_file, result)
cv2.imwrite(/path/to/output_image, img)

Traffic Status Estimation

Method

Overview

Combining vehicle detection and lane detection, we can make a first simple traffic status hypothesis. The hypothesis follows the 4-step pipeline: generate lane areas (polygons), determine the main lane, filter main-lane vehicles, and predict the traffic status as a function of the distance of the closest main-lane vehicle.
Generate lane areas

We first generate lane areas with the detected lane markers. We regress the lane lines, then split the image with lane lines, resulting in several lane areas represented by polygon vertexes.
Determine the main lane

The lane areas are used to judge the main lane where the car is driving on, and to filter out vehicles that we care. The main lane is determined by checking which lane area the bottom-center viewpoint (w/2, h) locates in.
Filter main-lane vehicles

The vehicle detection results contain vehicle bounding-boxes (and sementic segmentation maps). The vehicles we care are those that locate on the main lane. The filtering is done by judging if the bottom-center points of the vehicle bounding-boxes locate in the main lane area.
Predict traffic status

Based on the key vehicles that locate on the main lane, we predict the traffic status as a heuristic function of the distance of the closest key vehicle from the camera. The distance is measured as the y-axis L1 distance in the image plane between the bottom-center point of the bounding-box and the bottom-center viewpoint (w/2, h). The function is fomulated based on two parameterized thresholds thr1 and thr2. If the distance is below thr1, then the traffic status is hypothesized as congested; if the distance is between thr1 and thr2, the traffic status is slow; otherwise the traffic status is unimpeded.

Name		Name	Last commit message	Last commit date
Latest commit History 197 Commits
configs		configs
lib		lib
.gitignore		.gitignore
.gitmodules		.gitmodules
AMAP_TRAFFIC_B_TEST_DATA_README_0828.md		AMAP_TRAFFIC_B_TEST_DATA_README_0828.md
AMAP_TRAFFIC_DATA_README_0712.md		AMAP_TRAFFIC_DATA_README_0712.md
AMAP_TRAFFIC_FINAL_DATA_README_0906.md		AMAP_TRAFFIC_FINAL_DATA_README_0906.md
README.md		README.md
cal_features.py		cal_features.py
demo.ipynb		demo.ipynb
demo.py		demo.py
e2e_demo.py		e2e_demo.py
gen_dnn_feats.py		gen_dnn_feats.py
gen_seg_feats.py		gen_seg_feats.py
requirements.txt		requirements.txt
run.sh		run.sh
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

天池交通状况预测比赛

Result

Taks1: Scene Recognition

Logistic regression

Ordinal Logistic Regression

Sequence and Key Frame Joint Classification

Usage

Task2: Visual Odometry

Task3: Road Lane detection

Traditional methods

SOTA

lanedet usage

Task4: Vehicle detection

mmdetection usage

Traffic Status Estimation

Method

About

Releases

Packages

Contributors 2

Languages

Jokoe66/TianchiTrafficEstimation

Folders and files

Latest commit

History

Repository files navigation

天池交通状况预测比赛

Result

Taks1: Scene Recognition

Logistic regression

Ordinal Logistic Regression

Sequence and Key Frame Joint Classification

Usage

Task2: Visual Odometry

Task3: Road Lane detection

Traditional methods

SOTA

lanedet usage

Task4: Vehicle detection

mmdetection usage

Traffic Status Estimation

Method

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages