What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?

International Journal of Computer Vision, 126(9): 942-960, 2018
Abstract: The finding that very large networks can be trained efficiently and reliably has led to a paradigm shift in computer vision from engineered solutions to learning formulations. As a result, the research challenge shifts from devising algorithms to creating suitable and abundant training data for supervised learning. How to efficiently create such training data? The dominant data acquisition method in visual recognition is based on web data and manual annotation. Yet, for many computer vision problems, such as stereo or optical flow estimation, this approach is not feasible because humans cannot manually enter a pixel-accurate flow field. In this paper, we promote the use of synthetically generated data for the purpose of training deep networks on such tasks. We suggest multiple ways to generate such data and evaluate the influence of dataset properties on the performance and generalization properties of the resulting networks. We also demonstrate the benefit of learning schedules that use different types of data at selected stages of the training process.
Paper Publisher's link Project page

See also

  • Online-published PDF version available here.

BibTex reference

  author       = "N. Mayer and E. Ilg and P. Fischer and C. Hazirbas and D. Cremers and A. Dosovitskiy and T. Brox",
  title        = "What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?",
  journal      = "International Journal of Computer Vision",
  number       = "9",
  volume       = "126",
  pages        = "942-960",
  month        = " ",
  year         = "2018",
  note         = "https://arxiv.org/abs/1801.06397",
  url          = "http://lmb.informatik.uni-freiburg.de/Publications/2018/MIFDB18"

Other publications in the database