This skill enables AI agents to help you search NCBI databases using Biopython's Entrez module. It covers keyword searches, database exploration, and query building.
pip install biopythonTell your AI agent what you want to do:
- "Search PubMed for articles about CRISPR from 2024"
- "Find all human BRCA1 mRNA sequences in GenBank"
- "How many records mention 'machine learning' across all NCBI databases?"
- "What fields can I search in the protein database?"
"Search the nucleotide database for human insulin sequences"
"Find PubMed articles about COVID-19 published in the last 30 days"
"Search for mouse OR rat sequences of the TP53 gene longer than 1000 bp"
"List all available NCBI databases and their record counts"
"What search fields are available in the SRA database?"
- Import Bio.Entrez and set up email authentication
- Build the appropriate search query with field tags
- Call esearch(), einfo(), or egquery() as needed
- Parse and return the results
- Explain query translations if the search returns unexpected results
- pubmed - Biomedical literature
- nucleotide - DNA/RNA sequences
- protein - Protein sequences
- gene - Gene records
- sra - Sequencing data archives
- taxonomy - Organism taxonomy
- Be specific about which database to search
- Mention if you need all results or just a count
- Specify date ranges when searching for recent publications
- Ask about available fields if you're unsure how to narrow your search
- For very large result sets, mention you need batch downloading