Abstract
We propose a demand estimation method that leverages unstructured text andimage data to infer substitution patterns. Using pre-trained deep learningmodels, we extract embeddings from product images and textual descriptions andincorporate them into a random coefficients logit model. This approach enablesresearchers to estimate demand even when they lack data on product attributesor when consumers value hard-to-quantify attributes, such as visual design orfunctional benefits. Using data from a choice experiment, we show that ourapproach outperforms standard attribute-based models in counterfactualpredictions of consumers' second choices. We also apply it across 40 productcategories on Amazon and consistently find that text and image data helpidentify close substitutes within each category.