Methods Documentation
Protocols, procedures, methodology descriptions
Justification
Methods and protocols enable reproducibility. schema.org uniquely recommends measurementTechnique for dataset descriptions. The RDA model rates provenance information as Important (RDA-R1.2-01M). The distinct signal recognizes that methods documentation goes beyond a general abstract.
Practical Guide
Document methods. Near-universal in domain repositories.
Methods and protocols are rarely provided as structured metadata in general-purpose repositories like Zenodo (0.2%), but they're near-universal in domain-specific repositories where compliance is enforced (OpenNeuro BIDS, GEO MIAME). The 0.39x citation ratio on Zenodo reflects low adoption, not low value. Where methods are standard, they dramatically improve reuse.
Why this signal matters despite the numbers
The 0.39x citation ratio on Zenodo reflects the signal's rarity (0.2%), not its value. In domain repositories where methods documentation is enforced (OpenNeuro, GEO), it's near-universal and strongly associated with reuse.
For Repositories
- Add structured methods/protocol fields for domain-specific submissions
- For general repos: encourage methods description in the abstract
- Map to schema.org measurementTechnique for Google Dataset Search
For Depositors
- Describe your methods in the dataset description if no structured field exists
- Link to published protocols (protocols.io, methods papers) via related identifiers
- For domain repos: follow the required methods standard (BIDS, MIAME, etc.)
High value in domain repositories with method standards. General-purpose repos lack structured methods fields.
Standards Sources
Convergence score: 2/4 independent sources —
| Standard | Field / Property | Obligation Level |
|---|---|---|
| schema.org | measurementTechnique | Recommended |
| RDA FAIR | RDA-R1.2-01M | Important |
FAIR Principle Alignment
Primary mapping: Reusable (R1.2)
- R1.2: (Meta)data are associated with detailed provenance
RDA FAIR Data Maturity Model Indicators:
- RDA-R1.2-01M: Metadata includes provenance information according to community-specific standards
How This Signal Is Measured
Presence of methods, protocols, or procedures description in metadata. Binary: substantive methods text present or absent.
Empirical Evidence (Zenodo, n=1.3M)
Per-signal statistics use Zenodo as the primary validation source because it is the largest general-purpose repository with structured DataCite metadata, natural variance across all 25 signals, and available citation/usage data. Domain-specific repositories exhibit ceiling effects or restricted variance that preclude per-signal discrimination. Cross-repository validation is reported separately.
Prevalence
0.2%
of Zenodo datasets
Citation Lift
0.4x
vs. datasets without
Data Source
Zenodo (CERN)
1,328,100 records analyzed
Interpretation: Methods documentation is rarely provided as structured metadata in Zenodo. However, when present in domain-specific repositories (OpenNeuro BIDS, GEO MIAME), it is near-universal and strongly associated with reuse.
Quantitative Evidence
Scoring Formula
methods_documentation ∈ record → 4 pts
Contribution: 4 of 100 points · Harmonization bucket (0–20)
With Signal Present
2,583
datasets (0.2%)
μ = 0.096 citations/dataset
Without Signal
1,325,517
datasets (99.8%)
μ = 0.244 citations/dataset
Rate Ratio
0.39
95% CI: [0.35–0.45]
P-value
< 0.001
z = -14.68
Significance
Method: Poisson rate ratio · Source: Zenodo (n = 1,328,100)
Note: Rarely provided as structured metadata in Zenodo (0.2%). Near-universal in domain repositories: OpenNeuro (BIDS protocols), GEO (MIAME methods).
H — Harmonization Bucket
All signals in this bucket: