Question about input channels and semantic conditioning in MinkUNetDiff

Hi, thank you for open-sourcing this excellent work! The diffusion-based semantic scene completion for point clouds is truly an interesting and promising direction. I have a question regarding the model architecture. While exploring the code, I noticed that in the instantiation of `MinkUNetDiff`, the `in_channels` is set to 3:

```python
self.model = minknet.MinkUNetDiff(in_channels=3, out_channels=self.hparams['model']['out_dim'])
```
Given that the model performs semantic prediction, I was wondering: does the semantic information serve as an additional attribute/condition during the diffusion process, or is it handled differently in the network architecture? I would appreciate any clarification on how the semantic prediction is integrated into the diffusion framework. Looking forward to your insights!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about input channels and semantic conditioning in MinkUNetDiff #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Question about input channels and semantic conditioning in MinkUNetDiff #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions