|
|
|
|
|
|
|
|
|
|
|
Overview. Our approach combines a non-sequential autoregressive prior for 3D shapes with task-specific conditionals to generate multiple plausible and high-quality shapes consistent with input conditioning. We show the efficacy of our approach across diverse tasks such as (Left) shape completion, (Middle) single-view reconstruction and (Right) language-guided generation. |
Input
Multimodal Shape Completion
Input
Multimodal Shape Completion
|
Input Image Ground Truth Ours ResNet2Voxel ResNet2SDF Pix2Vox |
Input Image
Multimodal Shape Completion
Input Image
Multimodal Shape Completion
|
thin legs, thin arms | Stool, has a square floor mount | No holes in arms? |
no arm rest | kitchen chair | tall thinest legs |
cup shaped | lawn chair, two slats | Most ornate, rounded back with design |
Ours | JE | T2S |
curved looking one with four legs | ||
Ours | JE | T2S |
tall narrow back | ||
Ours | JE | T2S |
Single leg on square base | ||
Ours | JE | T2S |
Wide seat with armrest |
ShapeNet |
Ground-truth Reconstruction Ground-truth Reconstruction Ground-truth Reconstruction |
Pix3D |