Abstract
Shallow depth-of-field is commonly used by photographers to isolate a subjectfrom a distracting background. However, standard cell phone cameras cannotproduce such images optically, as their short focal lengths and small aperturescapture nearly all-in-focus images. We present a system to computationallysynthesize shallow depth-of-field images with a single mobile camera and asingle button press. If the image is of a person, we use a person segmentationnetwork to separate the person and their accessories from the background. Ifavailable, we also use dense dual-pixel auto-focus hardware, effectively a2-sample light field with an approximately 1 millimeter baseline, to compute adense depth map. These two signals are combined and used to render a defocusedimage. Our system can process a 5.4 megapixel image in 4 seconds on a mobilephone, is fully automatic, and is robust enough to be used by non-experts. Themodular nature of our system allows it to degrade naturally in the absence of adual-pixel sensor or a human subject.