Abstract
We present a lightweight network that infers grouping and boundaries,including curves, corners and junctions. It operates in a bottom-up fashion,analogous to classical methods for sub-pixel edge localization andedge-linking, but with a higher-dimensional representation of local boundarystructure, and notions of local scale and spatial consistency that are learnedinstead of designed. Our network uses a mechanism that we call boundaryattention: a geometry-aware local attention operation that, when applieddensely and repeatedly, progressively refines a pixel-resolution field ofvariables that specify the boundary structure in every overlapping patch withinan image. Unlike many edge detectors that produce rasterized binary edge maps,our model provides a rich, unrasterized representation of the geometricstructure in every local region. We find that its intentional geometric biasallows it to be trained on simple synthetic shapes and then generalize toextracting boundaries from noisy low-light photographs.