Hi, thanks for the great work!
I've encountered some issues when running the demos. It seems that the number of GPUs should be a factor of the number of objects to avoid bugs in the distributed processing setup.
I also have a question about input point numbers: How does the number of input points influence the performance? In your paper, you mentioned using 100,000 points per shape. Would the performance degrade with fewer points?