|
|
|
|
|
|
|
|
|
|
|
Overview. Our approach combines a non-sequential autoregressive prior for 3D shapes with task-specific conditionals to generate multiple plausible and high-quality shapes consistent with input conditioning. We show the efficacy of our approach across diverse tasks such as (Left) shape completion, (Middle) single-view reconstruction and (Right) language-guided generation. |
|
|
|
|
|
Input
Multimodal Shape Completion
Input
Multimodal Shape Completion
|
|
|
|
|
|
Input Image
Ground Truth
Ours
ResNet2Voxel
ResNet2SDF
Pix2Vox |
|
Input Image
Multimodal Shape Completion
Input Image
Multimodal Shape Completion
|
|
|
|
| thin legs, thin arms | Stool, has a square floor mount | No holes in arms? |
|
|
|
| no arm rest | kitchen chair | tall thinest legs |
|
|
|
| cup shaped | lawn chair, two slats | Most ornate, rounded back with design |
|
|
|
| Ours | JE | T2S |
| curved looking one with four legs |
|
|
|
| Ours | JE | T2S |
| tall narrow back |
|
|
|
| Ours | JE | T2S |
| Single leg on square base |
|
|
|
| Ours | JE | T2S |
| Wide seat with armrest |
|
|
|
| ShapeNet |
Ground-truth
Reconstruction
Ground-truth
Reconstruction
Ground-truth
Reconstruction |
| Pix3D |
|
|
|