Skip to content
This repository was archived by the owner on Feb 11, 2026. It is now read-only.
This repository was archived by the owner on Feb 11, 2026. It is now read-only.

Add BioC full text tool to MCP server #213

@turbomam

Description

@turbomam

Problem

The get_full_text_from_bioc function is available as a CLI command but not available in the MCP server (it's commented out in main.py).

Current State

CLI: ✅ Available

uvx --from artl-mcp artl-cli get-full-text-from-bioc --pmid "12345678"

MCP: ❌ Disabled

# main.py line ~264
# mcp.tool(get_full_text_from_bioc)

Impact

  • Inconsistency between CLI and MCP functionality
  • MCP users cannot access BioC XML format full text
  • Documentation incorrectly lists BioC as available source

BioC Format Benefits

BioC (BioCreative) is a structured XML format that:

  • Preserves document structure (sections, paragraphs)
  • Includes annotations and metadata
  • Provides cleaner text than raw PDF extraction
  • Is available from NCBI for many PMC articles

Proposed Solution

  1. Uncomment mcp.tool(get_full_text_from_bioc) in main.py
  2. Test the tool via MCP
  3. Update documentation to show it's available in both MCP and CLI
  4. Add integration test for MCP usage

Related

References

  • NCBI BioC API: https://www.ncbi.nlm.nih.gov/research/bionlp/RESTful/pmcoa.cgi/BioC_xml/{pmid}/ascii

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions