TX-cloud-models

FlashOCC

Prepare nuScenes dataset as introduced in nuscenes_det.md and create the pkl for FlashOCC by running:

python tools/create_data_bevdet.py

For Occupancy Prediction task, download (only) the 'gts' from CVPR2023-3D-Occupancy-Prediction
rename customer ops in projects/mmdet3d_plugin/ops to solve "ninja: error: build.ninja:26: multiple rules generate" issue:

cd projects/mmdet3d_plugin/ops
mv bev_pool/src/bev_sum_pool_cuda.cu bev_pool/src/bev_sum_pool_hip_cuda.cu
mv bev_pool/src/bev_max_pool_cuda.cu bev_pool/src/bev_max_pool_hip_cuda.cu
mv bev_pool_v2/src/bev_pool_cuda.cu bev_pool_v2/src/bev_pool_hip_cuda.cu
mv nearest_assign/src/nearest_assign_cuda.cu nearest_assign/src/nearest_assign_hip_cuda.cu

Modify setup.py the cuda file name.
Modify projects/mmdet3d_plugin/core/evaluation/ray_metrics.py L13

#dvr = load("dvr", sources=["lib/dvr/dvr.cpp", "lib/dvr/dvr.cu"], verbose=True, extra_cuda_cflags=['-allow-unsupported-compiler'])
dvr = load("dvr", sources=["lib/dvr/dvr.cpp", "lib/dvr/dvr.cu"], verbose=True, extra_cuda_cflags=[])

Modify the cuda interface specially for torch2.7

lib/dvr/dvr.cu L371: sigma.type() -> sigma.scalar_type()
lib/dvr/dvr.cu L683: sigma.type() -> sigma.scalar_type()
lib/dvr/dvr.cu L736: points.type() -> points.scalar_type()

Modify get_ego_coor in projects/mmdet3d_plugin/models/necks/view_transformer.py L153 to speedup torch.bmm.
Training

python tools/train.py projects/configs/flashocc/flashocc-r50.py

Sparse4D

Prepare data
rename customer ops in projects/mmdet3d_plugin/ops to solve "ninja: error: build.ninja:26: multiple rules generate" issue:

cd projects/mmdet3d_plugin/ops
mv src/deformable_aggregation_cuda.cu src/deformable_aggregation_hip_cuda.cu

modify setup.py the cuda file name
Download pre-trained weights and Training

bash local_train.sh sparse4dv3_temporal_r50_1x8_bs6_256x704

Deformable-DETR

Dataset preparation Please download COCO 2017 dataset and organize them as following:

code_root/
└── data/
    └── coco/
        ├── train2017/
        ├── val2017/
        └── annotations/
        	├── instances_train2017.json
        	└── instances_val2017.json

Compiling CUDA operators

cd ./models/ops
sh ./make.sh
python test.py

Modified: util/misc.py comment L30~L59
Training

GPUS_PER_NODE=8 ./tools/run_dist_launch.sh 8 ./configs/r50_deformable_detr.sh

CenterPoint

Dataset preparation

Check and follow the instruction in prepare_nuscenes.sh to set related variables properly and then run the sctipt to prepare the nuScenes dataset.
It will take a long time (a few hours) to download and process the dataset.

bash prepare_nuscenes.sh

Model training

Set config file, working directory properly, GPU numbers in run_centerpoint.sh then execute it for training.

bash run_centerpoint.sh

PointPillars

Dataset preparation

Check and follow the instruction in prepare_nuscenes.sh to set related variables properly and then run the sctipt to prepare the nuScenes dataset.
It will take a long time (a few hours) to download and process the dataset.

bash prepare_nuscenes.sh

Model training

Set config file, working directory properly, GPU numbers in run_pointpillars.sh then execute it for training.

bash run_pointpillars.sh

Issues

You will meet the following error under the docker of torch2.7/mmcv1.7.1/mmdet2.26.0/mmdet3d1.0.0rc4, try the solution provided below.

Error Messages:
[rank0]:   File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/parallel/distributed.py", line 1654, in forward
[rank0]:     else self._run_ddp_forward(*inputs, **kwargs)
[rank0]:   File "/mmopenlab/mmcv/mmcv/parallel/distributed.py", line 160, in _run_ddp_forward
[rank0]:     self._use_replicated_tensor_module else self.module
[rank0]:   File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1938, in getattr
[rank0]:     raise AttributeError(
[rank0]: AttributeError: 'MMDistributedDataParallel' object has no attribute '_use_replicated_tensor_module'

Solution:
Replace the orifinal line160 of mmcv/mmcv/parallel/distributed.py with the modified version.
Original: self._use_replicated_tensor_module else self.module
Modified: getattr(self, '_use_replicated_tensor_module', False) else self.module

Mask2Former

Dataset preparation Please download COCO 2017 dataset and organize them as following:

code_root/
└── data/
    └── coco/
        ├── train2017/
        ├── val2017/
        └── annotations/
        	├── instances_train2017.json
        	└── instances_val2017.json

Modified: change the value of data_root to the value of code_root in ./configs/mask2former/mask2former_r50_lsj_8x2_50e_coco.py
Training

bash ./tools/dist_train.sh configs/mask2former/mask2former_r50_lsj_8x2_50e_coco.py 8

FCOS_EfficientNetB0

Modified: use COCO 2017 dataset, change the value of data_root to the value of code_root in ./configs/fcos/fcos_efficientnet_caffe_fpn_gn-head_1x_coco.py
Training

python ./tools/train.py configs/fcos/fcos_efficientnet_caffe_fpn_gn-head_1x_coco.py

MapTRv2

Please refer to the guide file in MapTRv2-rocm/READ_rocm_guide.md

BEVFormer

Install dependency

pip install einops fvcore seaborn iopath==0.1.9 timm==0.6.13  typing-extensions==4.5.0

Prepare data

# prepare data before install detectron2, otherwise cause path error.
python tools/create_data.py nuscenes --root-path ./data/nuscenes --out-dir ./data/nuscenes --extra-tag nuscenes --version v1.0 --canbus ./data

Install Detectron2

python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'

Insert torch.multiprocessing.set_start_method('fork') before the main() call at line 271 of tools/train.py to fix TypeError: cannot pickle 'dict_keys' object.
Run the scripts:

./tools/dist_train.sh ./projects/configs/bevformer/bevformer_base.py 8

ResNet50

prepare code

cd resnet50
git clone https://github.com/pytorch/vision.git

MIOpen tuning (optional)

export MIOPEN_FIND_MODE=1
export MIOPEN_FIND_ENFORCE=4
bash run_train.sh

Train

unset MIOPEN_FIND_MODE=1
unset MIOPEN_FIND_ENFORCE=4
bash run_train.sh

Train with NHWC layout (optional)

export PYTORCH_MIOPEN_SUGGEST_NHWC=1
export PYTORCH_MIOPEN_SUGGEST_NHWC_BATCHNORM=1

also need to change layout of input and model:

data = data.to(memory_format=torch.channels_last) # in train.py line 28
model = model.to(memory_format=torch.channels_last) # in train.py line 248
model = torch.compile(model) # to get better performance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TX-cloud-models

FlashOCC

Sparse4D

Deformable-DETR

CenterPoint

PointPillars

Mask2Former

FCOS_EfficientNetB0

MapTRv2

BEVFormer

ResNet50

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
BEVFormer		BEVFormer
CenterPoint		CenterPoint
Deformable-DETR		Deformable-DETR
FlashOCC		FlashOCC
MapTRv2-rocm		MapTRv2-rocm
PointPillars		PointPillars
Sparse4D		Sparse4D
UniAD		UniAD
mask2former-fcos_efficientnetB0/mmdetection		mask2former-fcos_efficientnetB0/mmdetection
resnet50		resnet50
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

TX-cloud-models

FlashOCC

Sparse4D

Deformable-DETR

CenterPoint

PointPillars

Mask2Former

FCOS_EfficientNetB0

MapTRv2

BEVFormer

ResNet50

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages