Nanozymes – Catalytic Properties of Nanozyme Materials¶
Original Data¶
Title: Nanozymes dataset – Comprehensive information about nanozyme materials and their catalytic activity
Description: This dataset captures essential experimental information on nanozymes, including their chemical composition, structural and morphological properties, catalytic behavior, and reaction conditions. Data was manually extracted from scientific publications, with each row representing a unique experimental setup or measurement. The dataset supports the exploration of structure–activity relationships and benchmarking of nanozyme performance under varying conditions.
Total number of records: 1135
Number of features (columns): 27
Data type: Mixed
Application: Nanomaterials
Automatic validation: No
Data Scheme¶
Nanozymes Dataset – Column Descriptions¶
Column Name | Description |
---|---|
formula | Chemical formula of the nanozyme (e.g., Fe₃O₄, CeO₂) |
activity | Catalytic activity type (e.g., peroxidase, oxidase) |
syngony | Crystal system (e.g., cubic, hexagonal) |
length | Nanoparticle length in nm |
width | Width in nm |
depth | Depth or thickness in nm |
surface | Surface functionalization or modification (e.g., PEG, PVP) |
km_value | Michaelis constant (Km) |
km_unit | Unit for Km value (e.g., mM, µM) |
vmax_value | Maximum rate of reaction (Vmax) |
vmax_unit | Unit for Vmax value (e.g., µmol/min, mM/s) |
target_source | Source of target activity value in publication |
reaction_type | Substrate and co-substrate used in the reaction (e.g., TMB + H₂O₂) |
c_min | Minimum substrate concentration (in mM) |
c_max | Maximum substrate concentration (in mM) |
c_const | Constant concentration of co-substrate |
c_const_unit | Unit of co-substrate concentration (e.g., mM, µM) |
ccat_value | Catalyst (nanozyme) concentration used |
ccat_unit | Unit of catalyst concentration (e.g., mg/mL) |
ph | pH at which reaction was carried out |
temperature | Temperature in Celsius |
doi | DOI of the source article |
PDF filename in the dataset | |
access | Access status (1 = open access, 0 = closed access) |
title | Title of the source publication |
journal | Journal name |
year | Year of publication |
Metadata¶
Column Name | Description |
---|---|
journal | Name of the journal |
title | Title of the original article |
doi | Digital Object Identifier of the article |
year | Year of publication |
access | Access status (1 = Open Access, 0 = Closed Access) |
Filename of the source article's PDF in the archive |
Key Notes
- Units: All concentration- and activity-related units are embedded directly in column names or represented in separate "unit" columns
- Surface: If blank, indicates no surface modification or not reported
- Missing values: Parameters not provided in an article are left blank in the dataset
- Each row corresponds to a distinct experiment (not just a unique article)
- Dataset is intended for benchmarking and structure–activity analysis of nanozymes
Dataset Description¶
The Nanozymes dataset aggregates detailed experimental measurements related to the catalytic properties of synthetic enzyme-mimicking nanoparticles (nanozymes). These materials mimic natural enzymes and are used in biosensing, environmental detection, and therapeutic applications.
For each experiment, the dataset captures:
- Composition and structure: including chemical formula and crystal system (syngony)
- Morphology: dimensions of the particles (length, width, depth)
- Surface chemistry: coating or functional groups, if present
- Catalytic performance: Michaelis constant (Km), maximum reaction rate (Vmax), type of catalytic reaction
- Reaction conditions: substrate concentrations, catalyst concentration, pH, temperature
The dataset supports comprehensive benchmarking of nanozyme activity and deeper investigation into structure–function relationships. It also facilitates machine learning and data-driven modeling in nanozyme design.
Validation Results¶
The Nanozymes dataset underwent 439 corrections across 1,135 rows and 21 columns, comprising 398 pattern-based and 41 isolated issues.
The most frequently affected fields were syngony
, formula
, and temperature
.
Common errors included inferred values not explicitly stated in the source material—particularly crystal symmetry—along with formatting inconsistencies in chemical formulas and unit reporting.
Many corrections followed recurring patterns within individual articles, allowing reliable extrapolation across related entries.