Abstract
We propose a novel dialogue modeling framework which uses binary hashcodes ascompressed text representations, allowing for efficient similarity search, anda novel lower bound on mutual information between the hashcodes of the twodialog agents, which serves as a model selection criterion for optimizing thoserepresentations towards better alignment between the dialog participants andhigher predictability of one response from another, facilitating better dialoggeneration. Empirical evaluation on several datasets, from depression therapysessions to Larry King TV show interviews and Twitter data, demonstrate thatour hashing-based approach is competitive with state-of-art neural networkbased dialogue generation systems, often significantly outperforming them interms of response quality and computational efficiency, especially onrelatively small datasets.