e-mail to: zhouh@cs.uni-freiburg.de 1. Single image depth estimation usually overfits. How does the proposed method achieve a good generalization performance? 2. What datasets are used in this paper? How do they get the depth ground truth? 3. How do they handle different sources of data? What losses do they use?