Abstract
E-commerce platforms rely on structured product descriptions, in the form ofattribute/value pairs to enable features such as faceted product search andproduct comparison. However, vendors on these platforms often provideunstructured product descriptions consisting of a title and a textualdescription. To process such offers, e-commerce platforms must extractattribute/value pairs from the unstructured descriptions. State-of-the-artattribute/value extraction methods based on pre-trained language models (PLMs),such as BERT, face two drawbacks (i) the methods require significant amounts oftask-specific training data and (ii) the fine-tuned models have problems togeneralize to attribute values that were not part of the training data. Weexplore the potential of using large language models (LLMs) as a more trainingdata-efficient and more robust alternative to existing attribute/valueextraction methods. We propose different prompt templates for instructing LLMsabout the target schema of the extraction, covering both zero-shot and few-shotscenarios. In the zero-shot scenario, textual and JSON-based approaches forrepresenting information about the target attributes are compared. In thescenario with training data, we investigate (i) the provision of exampleattribute values, (ii) the selection of in-context demonstrations, (iii)shuffled ensembling to prevent position bias, and (iv) fine-tuning the LLM. Theprompt templates are evaluated in combination with hosted LLMs, such as GPT-3.5and GPT-4, and open-source LLMs based on Llama2 which can be run locally. Thebest average F1-score of 86% was reached by GPT-4 using an ensemble of shuffledprompts that combine attribute names, attribute descriptions, example values,and demonstrations. Given the same amount of training data, this prompt/modelcombination outperforms the best PLM baseline by an average of 6% F1.