You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -283,7 +283,7 @@ Built-in example projects for testing and demonstration:
283
283
## Privacy and Security
284
284
285
285
-**All computation is local.** No data is uploaded to external services.
286
-
- External lookups (Crossref, OpenAlex, NCBI) only query public identifiers (DOI, PMID) and can be disabled with `--no-external-lookups`.
286
+
- External lookups (Crossref, OpenAlex, PubPeer, NCBI) only query public identifiers (DOI, PMID) and can be disabled with `--no-external-lookups`.
287
287
- Code reruns execute in temporary project copies with timeouts and minimal environment variables. This is not a strong security sandbox — treat unknown code accordingly.
288
288
- SHA-256 provenance ledgers are append-only and never transmit file contents.
The suite covers raw data rules, including digit distribution, high-similarity rows/columns, column relationships, rare categories, and ordinal concentration; summary-stat crosscheck; R scrutiny; R statcheck; R rsprite2; p-value collection checks; reference parsing; external metadata lookup; citation claim extraction; papermill light/network signals; image duplicate/copy-move/metadata review; code scan/rerun; unsupported code recording; data trace crosscheck; provenance record/verify; and local corpus screening.
38
38
39
-
Network coverage uses `inputs/project_external` and expects evidence from Crossref, OpenAlex, and NCBI. Network failures should be interpreted separately from detector regressions because external APIs can be unavailable, rate-limited, or return changed metadata.
39
+
Network coverage uses `inputs/project_external` and expects evidence from Crossref, OpenAlex, PubPeer, and NCBI. Network failures should be interpreted separately from detector regressions because external APIs can be unavailable, rate-limited, require credentials, or return changed metadata.
Copy file name to clipboardExpand all lines: benchmark/BENCHMARK_REPORT.md
+14-14Lines changed: 14 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ Conclusion: The core detection pipeline is stably covered by automated benchmark
16
16
- Raw data: Covers duplicate/highly similar rows and columns, fixed steps, high-frequency values, missing-concentrated-by-group, terminal digit distribution, inter-column relationships, and non-continuous variable anomalies; clean controls maintain 0 risk signals.
17
17
- Summary statistics: Covers SE/SD/N, CI, percent/count, p/t/df, p-value domain, and R scrutiny/SPRITE feasibility checks.
18
18
- In-text statistics: Covers R statcheck p-value consistency checks on APA/NHST expressions.
19
-
- Literature & network: Covers DOI/PMID parsing, Crossref/OpenAlex/NCBI metadata queries, and citation claim extraction.
19
+
- Literature & network: Covers DOI/PMID parsing, Crossref/OpenAlex/PubPeer/NCBI metadata queries, and citation claim extraction.
20
20
- Images: Covers image discovery, internal duplicates, local copy-move, metadata quality, and Western blot/gel review checklist.
21
21
- Code & project: Covers Python/R script reruns, Stata/SPSS/SAS read-only prompts, cross-material data reconciliation, project manifest, provenance version chain, and local corpus screening.
22
22
@@ -36,18 +36,18 @@ Not executed (--no-network used).
36
36
37
37
| Case | Type | Pass | Seconds | Risk Signals | Info | Missing Tools | Missing Checks |
@@ -93,5 +93,5 @@ Not executed (--no-network used).
93
93
## Interpretation Boundaries
94
94
95
95
The high/medium/low levels in this report are benchmark risk signals, not conclusions of academic misconduct, fabrication, or fraud. `info` records are run statuses, dependency states, skip reasons, or coverage notes; they do not count toward risk conclusions.
96
-
Network test cases depend on real-time availability, certificate chains, and rate limiting of Crossref, OpenAlex, and NCBI. If network cases fail, first check HTTP/SSL/rate-limit information in evidence before concluding it is a detector regression.
96
+
Network test cases depend on real-time availability, certificate chains, credentials, and rate limiting of Crossref, OpenAlex, PubPeer, and NCBI. If network cases fail, first check HTTP/SSL/rate-limit information in evidence before concluding it is a detector regression.
97
97
All weak-signal tools are only for surfacing human review directions. Final review should return to original data, scripts, image source files, literature metadata, and audit logs.
Copy file name to clipboardExpand all lines: benchmark/reports/pcr.benchmark_summary.md
+14-14Lines changed: 14 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ Conclusion: The core detection pipeline is stably covered by automated benchmark
16
16
- Raw data: Covers duplicate/highly similar rows and columns, fixed steps, high-frequency values, missing-concentrated-by-group, terminal digit distribution, inter-column relationships, and non-continuous variable anomalies; clean controls maintain 0 risk signals.
17
17
- Summary statistics: Covers SE/SD/N, CI, percent/count, p/t/df, p-value domain, and R scrutiny/SPRITE feasibility checks.
18
18
- In-text statistics: Covers R statcheck p-value consistency checks on APA/NHST expressions.
19
-
- Literature & network: Covers DOI/PMID parsing, Crossref/OpenAlex/NCBI metadata queries, and citation claim extraction.
19
+
- Literature & network: Covers DOI/PMID parsing, Crossref/OpenAlex/PubPeer/NCBI metadata queries, and citation claim extraction.
20
20
- Images: Covers image discovery, internal duplicates, local copy-move, metadata quality, and Western blot/gel review checklist.
21
21
- Code & project: Covers Python/R script reruns, Stata/SPSS/SAS read-only prompts, cross-material data reconciliation, project manifest, provenance version chain, and local corpus screening.
22
22
@@ -36,18 +36,18 @@ Not executed (--no-network used).
36
36
37
37
| Case | Type | Pass | Seconds | Risk Signals | Info | Missing Tools | Missing Checks |
@@ -93,5 +93,5 @@ Not executed (--no-network used).
93
93
## Interpretation Boundaries
94
94
95
95
The high/medium/low levels in this report are benchmark risk signals, not conclusions of academic misconduct, fabrication, or fraud. `info` records are run statuses, dependency states, skip reasons, or coverage notes; they do not count toward risk conclusions.
96
-
Network test cases depend on real-time availability, certificate chains, and rate limiting of Crossref, OpenAlex, and NCBI. If network cases fail, first check HTTP/SSL/rate-limit information in evidence before concluding it is a detector regression.
96
+
Network test cases depend on real-time availability, certificate chains, credentials, and rate limiting of Crossref, OpenAlex, PubPeer, and NCBI. If network cases fail, first check HTTP/SSL/rate-limit information in evidence before concluding it is a detector regression.
97
97
All weak-signal tools are only for surfacing human review directions. Final review should return to original data, scripts, image source files, literature metadata, and audit logs.
0 commit comments