Choose a task to start voting
Each match shows two 3D objects from different VLMs given the same brief. Pick the one that better matches.
Mode A
Text → 3D
The model received only a text description. Vote on which output better matches the words.
Mode B
Image → 3D
The model saw reference images. Vote on which output better matches the pictures.
Or see the live ratings on the Leaderboard.