A Deep Patent Landscaping Model using Transformer and Graph Embedding

Abstract

Patent landscaping is a method that is employed for searching related patentsduring the process of a research and development~(R\&D) project. To avoid therisk of patent infringement and to follow the current trends of technologydevelopment, patent landscaping is a crucial task that needs to be conductedduring the early stages of an R\&D project. Because the process of patentlandscaping requires several advanced resources and can be tedious, the demandfor automated patent landscaping is gradually increasing.However, the shortageof well-defined benchmarking datasets and comparable models makes it difficultto find related research studies. In this paper, we propose an automated patentlandscaping model based on deep learning. The proposed model comprises amodified transformer structure for analyzing textual data present in patentdocuments and a graph embedding method using diffusion graph called Diff2Vecfor analyzing patent metadata. Four patent landscaping benchmarking datasets,which were produced by querying to Google BigQuery based on search formula madeby the Korean patent attorney, are proposed for comparing related researchstudies. Obtained results indicate that the proposed model with the datasetscan attain state-of-the-art performance comparing current patent landscapingmodels.

Quick Read (beta)

loading the full paper ...