Skip to content

Nanozymes – Catalytic Properties of Nanozyme Materials


Original Data

Title: Nanozymes dataset – Comprehensive information about nanozyme materials and their catalytic activity
Description: This dataset captures essential experimental information on nanozymes, including their chemical composition, structural and morphological properties, catalytic behavior, and reaction conditions. Data was manually extracted from scientific publications, with each row representing a unique experimental setup or measurement. The dataset supports the exploration of structure–activity relationships and benchmarking of nanozyme performance under varying conditions.
Total number of records: 1135
Number of features (columns): 27
Data type: Mixed
Application: Nanomaterials
Automatic validation: No


Data Scheme

Nanozymes Dataset – Column Descriptions

Column Name Description
formula Chemical formula of the nanozyme (e.g., Fe₃O₄, CeO₂)
activity Catalytic activity type (e.g., peroxidase, oxidase)
syngony Crystal system (e.g., cubic, hexagonal)
length Nanoparticle length in nm
width Width in nm
depth Depth or thickness in nm
surface Surface functionalization or modification (e.g., PEG, PVP)
km_value Michaelis constant (Km)
km_unit Unit for Km value (e.g., mM, µM)
vmax_value Maximum rate of reaction (Vmax)
vmax_unit Unit for Vmax value (e.g., µmol/min, mM/s)
target_source Source of target activity value in publication
reaction_type Substrate and co-substrate used in the reaction (e.g., TMB + H₂O₂)
c_min Minimum substrate concentration (in mM)
c_max Maximum substrate concentration (in mM)
c_const Constant concentration of co-substrate
c_const_unit Unit of co-substrate concentration (e.g., mM, µM)
ccat_value Catalyst (nanozyme) concentration used
ccat_unit Unit of catalyst concentration (e.g., mg/mL)
ph pH at which reaction was carried out
temperature Temperature in Celsius
doi DOI of the source article
pdf PDF filename in the dataset
access Access status (1 = open access, 0 = closed access)
title Title of the source publication
journal Journal name
year Year of publication

Metadata

Column Name Description
journal Name of the journal
title Title of the original article
doi Digital Object Identifier of the article
year Year of publication
access Access status (1 = Open Access, 0 = Closed Access)
pdf Filename of the source article's PDF in the archive

Key Notes

  • Units: All concentration- and activity-related units are embedded directly in column names or represented in separate "unit" columns
  • Surface: If blank, indicates no surface modification or not reported
  • Missing values: Parameters not provided in an article are left blank in the dataset
  • Each row corresponds to a distinct experiment (not just a unique article)
  • Dataset is intended for benchmarking and structure–activity analysis of nanozymes

Dataset Description

The Nanozymes dataset aggregates detailed experimental measurements related to the catalytic properties of synthetic enzyme-mimicking nanoparticles (nanozymes). These materials mimic natural enzymes and are used in biosensing, environmental detection, and therapeutic applications.

For each experiment, the dataset captures:

  • Composition and structure: including chemical formula and crystal system (syngony)
  • Morphology: dimensions of the particles (length, width, depth)
  • Surface chemistry: coating or functional groups, if present
  • Catalytic performance: Michaelis constant (Km), maximum reaction rate (Vmax), type of catalytic reaction
  • Reaction conditions: substrate concentrations, catalyst concentration, pH, temperature

The dataset supports comprehensive benchmarking of nanozyme activity and deeper investigation into structure–function relationships. It also facilitates machine learning and data-driven modeling in nanozyme design.


Validation Results

The Nanozymes dataset underwent 439 corrections across 1,135 rows and 21 columns, comprising 398 pattern-based and 41 isolated issues. The most frequently affected fields were syngony, formula, and temperature. Common errors included inferred values not explicitly stated in the source material—particularly crystal symmetry—along with formatting inconsistencies in chemical formulas and unit reporting. Many corrections followed recurring patterns within individual articles, allowing reliable extrapolation across related entries.