Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemmStridedBatched( handle, opa, opb, m, n, k, &alpha, a, lda, stridea, b, ldb, strideb, &beta, c, ldc, stridec, num_batches) #47

Open
liu83 opened this issue Jul 30, 2024 · 1 comment

Comments

@liu83
Copy link

liu83 commented Jul 30, 2024

Hello,
I followed the ReadMe, creating a conda environment, activating it and running the demo with hero_model and vdr dataset according to the section "Setup" and "Running out of the box!".
However it did not work but having an error in the end (RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemmStridedBatched( handle, opa, opb, m, n, k, &alpha, a, lda, stridea, b, ldb, strideb, &beta, c, ldc, stridec, num_batches)).
Could you please help me figuring it out, where I did wrong?
I have checked, pytorch version 1.10.0, CUDA version 11.3. My GPU is NVIDIA GeForce RTX 3080 Ti with sm86.

Thank you!

Below you can find the loggings in my terminal.


/src/simplerecon(main)$ CUDA_VISIBLE_DEVICES=0 python test.py --name HERO_MODEL \

        --output_base_path OUTPUT_PATH \
        --config_file configs/models/hero_model.yaml \
        --load_weights_from_checkpoint weights/hero_model.ckpt \
        --data_config configs/data/vdr_dense.yaml \
        --num_workers 8 \
        --batch_size 2 \
        --fast_cost_volume \
        --run_fusion \
        --depth_fuser open3d \
        --fuse_color \
        --dump_depth_visualization;

########################### Options ###########################

 random_seed: 0
 name: HERO_MODEL
 log_dir: /home/aime/tmp/tensorboard
 notes: 
 log_interval: 100
 val_interval: 1000
 val_batches: 100
 dataset: vdr
 dataset_path: /home/aime/bliu_workspace/src/simplerecon/datasets/vdr
 num_workers: 8
 tuple_info_file_location: data_splits/vdr/
 mv_tuple_file_suffix: _eight_view_deepvmvs_dense.txt
 frame_tuple_type: dense
 model_num_views: 8
 num_images_in_tuple: 8
 dataset_scan_split_file: data_splits/vdr/scans.txt
 split: test
 image_width: 512
 image_height: 384
 shuffle_tuple: False
 test_keyframe_buffer_size: 30
 lr: 0.0001
 wd: 0.0001
 num_sanity_val_steps: 0
 max_steps: 110000
 batch_size: 2
 val_batch_size: 16
 gpus: 2
 precision: 16
 lr_steps: [70000, 80000]
 resume: None
 load_weights_from_checkpoint: weights/hero_model.ckpt
 image_encoder_name: efficientnet
 depth_decoder_name: unet_pp
 loss_type: log_l1
 matching_encoder_type: resnet
 matching_feature_dims: 16
 matching_scale: 1
 matching_num_depth_bins: 64
 min_matching_depth: 0.25
 max_matching_depth: 5.0
 cv_encoder_type: multi_scale_encoder
 feature_volume_type: mlp_feature_volume
 output_base_path: OUTPUT_PATH
 run_fusion: True
 fuse_color: True
 fusion_max_depth: 3.0
 fusion_resolution: 0.04
 depth_fuser: open3d
 single_debug_scan_id: None
 skip_frames: None
 skip_to_frame: None
 pc_fusion_z_thresh: 0.04
 n_consistent_thresh: 3
 voxel_downsample: 0.02
 mask_pred_depth: False
 cache_depths: False
 fusion_use_raw_lowest_cost: False
 high_res_validation: False
 fast_cost_volume: True
 standard_fps: 30
 dump_depth_visualization: True
 use_precomputed_partial_meshes: False
 viz_render_width: 640
 viz_render_height: 480
 cam_marker_size: 0.7
 back_face_alpha: 0.5

###############################################################

################################################################################
####################### VDR Dataset, number of scans: 2 ########################
################################################################################

################################################################################
######################### Running fusion! Using open3d #########################
Output directory:
OUTPUT_PATH/HERO_MODEL/vdr/dense/meshes/0.04_3.0_open3d_color
################################################################################

################################################################################
############################### Saving quick viz.###############################
#######Output directory:
OUTPUT_PATH/HERO_MODEL/vdr/dense/viz/quick_viz ########
################################################################################

WARNING - 2024-07-30 10:26:22,099 - warnings - /home/aime/miniconda3/envs/simplerecon/lib/python3.9/site-packages/timm/models/_factory.py:117: UserWarning: Mapping deprecated model name tf_efficientnetv2_s_in21ft1k to current tf_efficientnetv2_s.in21k_ft_in1k.
model = create_fn(

INFO - 2024-07-30 10:26:22,268 - _builder - Loading pretrained weights from Hugging Face hub (timm/tf_efficientnetv2_s.in21k_ft_in1k)
INFO - 2024-07-30 10:26:22,488 - _hub - [timm/tf_efficientnetv2_s.in21k_ft_in1k] Safe alternative available for 'pytorch_model.bin' (as 'model.safetensors'). Loading weights using safetensors.
WARNING - 2024-07-30 10:26:22,574 - _builder - Unexpected keys (bn2.bias, bn2.num_batches_tracked, bn2.running_mean, bn2.running_var, bn2.weight, classifier.bias, classifier.weight, conv_head.weight) found while loading pretrained weights. This may be expected if model is being adapted.
################################################################################
########################## Using FeatureVolumeManager ##########################
Number of source views: 7
Using all metadata.
Number of channels: [202, 128, 128, 1]
################################################################################

################################################################################
########################## Using FeatureVolumeManager ##########################
Number of source views: 7
Using all metadata.
Number of channels: [202, 128, 128, 1]
################################################################################

################################################################################
######################## Using FastFeatureVolumeManager ########################
Number of source views: 7
Using all metadata.
Number of channels: [202, 128, 128, 1]
################################################################################

0%| | 0/562 [00:04<?, ?it/s]
0%| | 0/2 [00:04<?, ?it/s]
Traceback (most recent call last):
File "/home/aime/bliu_workspace/src/simplerecon/test.py", line 473, in
main(opts)
File "/home/aime/bliu_workspace/src/simplerecon/test.py", line 270, in main
outputs = model(
File "/home/aime/miniconda3/envs/simplerecon/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "/home/aime/bliu_workspace/src/simplerecon/experiment_modules/depth_model.py", line 328, in forward
src_cam_T_cur_cam = src_cam_T_world @ cur_world_T_cam.unsqueeze(1)
RuntimeError: CUDA error: CUBLAS_STATUS_NOT_SUPPORTED when calling cublasSgemmStridedBatched( handle, opa, opb, m, n, k, &alpha, a, lda, stridea, b, ldb, strideb, &beta, c, ldc, stridec, num_batches)

@liu83
Copy link
Author

liu83 commented Jul 30, 2024

My PC is Ubuntu 20.04.6 LTS

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant