Could you explain how you deal with distractor points included in the initial Gaussian point cloud?
I'm trying to reproduce your code. When processing real-world captured scenes, I found that when using the SfM point cloud obtained from COLMAP sparse reconstruction, if there are similar distractors in adjacent views, some points from these distractors may pass the SfM consistency check. As a result, the initialization point cloud for static Gaussians will contain points originating from distractors. This leads to artifacts in the static scene, as seen in the "spot" scene from the NeRFOnTheGo dataset.
Simple downsampling or random initialization of the point cloud doesn't resolve this issue.
How did you address it? This seems to significantly impact your method's final results.