Multi-view Consistency as Supervisory Signal for Learning Shape and Pose\n Prediction

Shubham Tulsiani; Alexei A. Efros; Jitendra Malik

doi:10.48550/arxiv.1801.03910

Abstract

1 min read

We present a framework for learning single-view shape and pose prediction\nwithout using direct supervision for either. Our approach allows leveraging\nmulti-view observations from unknown poses as supervisory signal during\ntraining. Our proposed training setup enforces geometric consistency between\nthe independently predicted shape and pose from two views of the same instance.\nWe consequently learn to predict shape in an emergent canonical (view-agnostic)\nframe along with a corresponding pose predictor. We show empirical and\nqualitative results using the ShapeNet dataset and observe encouragingly\ncompetitive performance to previous techniques which rely on stronger forms of\nsupervision. We also demonstrate the applicability of our framework in a\nrealistic setting which is beyond the scope of existing techniques: using a\ntraining dataset comprised of online product images where the underlying shape\nand pose are unknown.\n

Open reviews(0)

Public, signed peer feedback on this preprint.

No reviews yet.

Multi-view Consistency as Supervisory Signal for Learning Shape and Pose\n Prediction

Abstract

Discussion(0)

Open reviews(0)

Related publications

Multi-view Consistency as Supervisory Signal for Learning Shape and Pose Prediction

Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency

Multi-view Supervision for Single-view Reconstruction via Differentiable\n Ray Consistency

Learning a Multi-View Stereo Machine

Learning a Multi-View Stereo Machine

Related publications

Preprint2018
Multi-view Consistency as Supervisory Signal for Learning Shape and Pose Prediction
Preprint2018

Preprint2017
Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency
Preprint2017

Preprint2017
Multi-view Supervision for Single-view Reconstruction via Differentiable\n Ray Consistency
Preprint2017

Article2017
Learning a Multi-View Stereo Machine
Article2017

Preprint2017
Learning a Multi-View Stereo Machine
Preprint2017