Anyone can easily create browser-accessible photorealistic 3D scenes without code
DiverseShot is a local AI graphics client that allows you to easily create photorealistic 3D scenes and objects from videos and photos in minutes. DiverseShot uses a technique called "3D Gaussian Splatting." The 3D files produced by DiverseShot differ from traditional 3D file formats, as they do not include concepts like mesh or textures. Instead, they support display expressions suitable for opening and sharing in a browser.
As a non-graphics researcher, What can I do with DiverseShot?
To perform 3D reconstruction with DiverseShot, you need to capture a video of the Object from multiple angles (or take several photos from different angles). The recording device can be a phone or a camera. The more diverse the angles captured in the video, the better the training results.
Attention⚠️: There should be no dynamic objects in the video to be trained, otherwise it will affect the reconstruction effect
After successfully downloading and installing the client, the initial run will auto-download some libraries necessary for machine learning.
After download libraries,the user interface appears below ⬇️. Users need to create a project when using DiverseShot. If using the Windows client, and if the user does not customize the project's save location, project files will be saved by default in the "projects" folder📁 within the installation directory.
Drag the video (or images) into the project to start splat training. By default, the output location of the training results is the same as the current folder 📁 where the video is located, but users can customize the output location.
After starting splat training, the client will extract images from the video and reconstruct the camera 📷 trajectory. Upon completing ✅ the camera trajectory reconstruction, machine learning will generate the splat file. An "Entity" will be created in the hierarchy on the right. Selecting this entity will display its parameters and iteration count. The client remains in training mode until the training limit is reached. At the top🔝 of the interface, there is a pause ⏸️ button to pause the training.
During the filming 🎬 process, it is inevitable that background elements outside the main Object will be captured.
These background elements will be included in the machine learning process and reconstructed, using up training resources and time. After pausing the training, you can modify the splat file📃 in two ways:
There are two ways to export the 3D files📃 obtained after training:
We recommend using Spline as your final display platform. Spline is a free 3D design software with real-time collaboration to create web interactive experiences in the browser. It allowed user import .ply files and perform secondary editing such as 'Apply Cropping' / 'Make animation'.
Apart from Spline, if you have some programming skills, you can use the open-source WebGL library PlayCanvas to deploy a webpage for sharing .ply files. If you are working on a project using game engines like Unity or Unreal Engine (UE), you can also import .ply files into your project using the following plugins: 3D Gaussians Plugin for ue / UnityGaussianSplatting