Published July 7, 2024 | Version v1
Dataset Open

Integrating big data with KNIME as an alternative without programming code: an application to the PATSTAT patent database [Data set]

Description

Dataset accompanying the publication "Integrating big data with KNIME as an alternative without programming code: an application to the PATSTAT patent database ". Accessing massive datasets can be challenging for users unfamiliar with programming codes. Combining Konstanz Information Miner (KNIME) and MySQL tools on standard configuration equipment allows for addressing this issue. This research proposal aims to present a methodology that describes the necessary configuration steps in both tools and the required manipulation in KNIME to transmit the information to the MySQL environment for further processing in a database management system (DBMS). In addition, we propose a procedure so that the use of this point-and-click software in research work can gain in reproducibility and, therefore, in credibility in the scientific community. To achieve this, we will use a big database regarding patent applications as a reference, the PATSTAT Global 2023, provided by the European Patent Office (EPO). As well known, patent data can be a valuable source for understanding innovation dynamics and technological trends, whether for studies on companies, sectors, nations or even regions, at aggregated and disaggregated levels.

Other

How to cite the database (APA style): Taques, F.H.; Chasco, C. & Taques, F. (2024) Integrating big data with KNIME as an alternative without programming code: an application to the PATSTAT patent database [Data set] (doi: 10.23728/b2share.645943e855924aa299a2e2dc873ce530) Source: Taques, F.H.; Chasco, C. & Taques, F. (2024) Integrating big data with KNIME as an alternative without programming code: an application to the PATSTAT patent database. Journal of Geographical Systems (doi: 10.1007/s10109-024-00445-0).

Files

TLS201_part01_modif.csv.zip

Files (44.5 MB)

Name Size Download all
Checksum: md5:9eb37e66ff276e57555e9c48c9cbae87

PID: http://hdl.handle.net/11304/d61a4537-f4c6-42cd-b551-8c32f2f86747
44.5 MB Preview Download
Checksum: md5:2dabbbd040d3410c2cfd3e9d862f8f79

PID: http://hdl.handle.net/11304/7b5a78a5-493d-4377-95f6-e309f91e21b2
11.6 kB Download

Additional details

Identifiers

B2SHARE Legacy Record ID
645943e855924aa299a2e2dc873ce530

Funding

Regional Studies Association (RSA)
Small Grant Scheme on Pandemics, Cities, Regions, and Industry

Temporal Coverage

Spans:

Span:2000-2022