Related Publication
Linked journal articles, preprints, conference papers
Justification
Linking datasets to publications creates bidirectional discovery. DataCite Recommends RelatedIdentifier. Dublin Core includes Relation. schema.org includes citation. RDA-I3-01M (Important) requires qualified references. Four sources converge.
Practical Guide
Link to your paper. 16.5x citation lift — the highest-impact engagement signal.
Linking datasets to publications creates bidirectional discovery and is the highest-impact engagement signal. Datasets with publication links receive 16.5x more citations (RR = 16.46, p < 0.001). With 54% prevalence on Zenodo, nearly half of datasets still miss this connection. Four standards converge on this signal — DataCite, Dublin Core, schema.org, and RDA.
For Repositories
- Add a prominent "related publication" field during deposit
- Auto-detect related publications from DOI metadata
- Map to DataCite #12 RelatedIdentifier with IsSupplementTo relation type
For Depositors
- Always link your dataset to its associated publication via DOI
- Add the link at deposit time — don't wait for the paper to publish (use preprint DOI)
- Link to all publications that used or describe the dataset
Highest-impact engagement signal (16.5x lift). Four standards converge. Easy to implement — just add a DOI link.
Standards Sources
Convergence score: 4/4 independent sources —
| Standard | Field / Property | Obligation Level |
|---|---|---|
| DataCite 4.6 | #12 RelatedIdentifier (IsSupplementTo) | Recommended |
| Dublin Core | Relation | Core Element |
| schema.org | citation | Recommended |
FAIR Principle Alignment
Primary mapping: Interoperable (I3)
- I3: (Meta)data include qualified references to other (meta)data
RDA FAIR Data Maturity Model Indicators:
- RDA-I3-01M: Metadata includes references to other metadata
How This Signal Is Measured
Presence of DOI/PMID link to a related publication. Binary: at least one publication linked.
Empirical Evidence (Zenodo, n=1.3M)
Per-signal statistics use Zenodo as the primary validation source because it is the largest general-purpose repository with structured DataCite metadata, natural variance across all 25 signals, and available citation/usage data. Domain-specific repositories exhibit ceiling effects or restricted variance that preclude per-signal discrimination. Cross-repository validation is reported separately.
Prevalence
54.3%
of Zenodo datasets
Citation Lift
16.2x
vs. datasets without
Data Source
Zenodo (CERN)
1,328,100 records analyzed
Interpretation: The highest-impact engagement signal. Datasets linked to publications receive 16.2x more citations. This validates the SHARE framework's emphasis on cross-resource linking — publication links create bidirectional discovery paths.
Quantitative Evidence
Scoring Formula
related_publication_doi ∈ record → 4 pts
Contribution: 4 of 100 points · Engagement bucket (0–20)
With Signal Present
720,512
datasets (54.3%)
μ = 0.428 citations/dataset
Without Signal
607,588
datasets (45.7%)
μ = 0.026 citations/dataset
Rate Ratio
16.46
95% CI: [16.20–16.73]
P-value
< 0.001
z = 343.37
Significance
Method: Poisson rate ratio · Source: Zenodo (n = 1,328,100)
E — Engagement Bucket
All signals in this bucket: