Hu, Z., Zhou, X. and Lin, A. orcid.org/0000-0003-2331-083X (2023) Evaluation and identification of potential high-value patents in the field of integrated circuits using a multidimensional patent indicators pre-screening strategy and machine learning approaches. Journal of Informetrics, 17 (2). 101406. ISSN: 1751-1577
Abstract
Early identification of high-value patents has strategic and technological importance to firms, institutions, and governments. This study demonstrates the usefulness of the machine learning (ML) method for automatically evaluating and identifying potential high-value patents. The study collected 31,463 patents in the integrated circuits sector using the DII platform and used them to conduct experiments using five standard ML models. A multidimensional value indicator portfolio was established to measure patents’ legal, technological, competitiveness, and scientific values and construct feature vector space. The portfolio also formed a part of the pre-screening strategy providing a valid positive sample for identifying potential high-value patents. The results suggest that the multidimensional patent indicator portfolio effectively measured patent values. amongst all indicators, patent family size (legal value), first citation speed (technological value), forward citations and extended patent family size (competitiveness value), length of the patent document, non-patent reference count, and patent citation count (scientific value) play a significant informing role in identifying potential high-value patents. The proposed first-citation speed indicator proved valuable for measuring patents’ technological value. The Random Forest model had the best overall performance in classifying and recognizing potential high-value patents(PHVPs) with accuracy and precision rates above 95%.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2023 The Authors. This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ ) |
Keywords: | High-value patents; Zero-cited patent; Machine learning; Integrated circuits; Patent indicator portfolio; Automatic classification; First-citation speed |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Social Sciences (Sheffield) > Information School (Sheffield) ?? Sheffield.IJC ?? |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 19 Sep 2025 15:34 |
Last Modified: | 19 Sep 2025 15:34 |
Status: | Published |
Publisher: | Elsevier BV |
Refereed: | Yes |
Identification Number: | 10.1016/j.joi.2023.101406 |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:231950 |