Polly Co-Scientist
Polly Co-Scientist is an AI-powered research assistant developed to help scientists transform complex biomedical data into actionable scientific insights with ease. It transforms natural language queries into Cypher commands to interact with Polly's Knowledge Graph, allowing users to explore relationships, simulate biological reasoning, and generate hypotheses, without writing a single line of code.
Data Distribution and Sources Used
Polly KG integrates data from multiple open-source and authoritative biomedical databases. Below is a list of sources and their versions:
DB Name | Source | Version |
---|---|---|
MONDO | MONDO Ontology | v2025-02-04 |
HPO | HPO Ontology | v2025-01-16 |
NCBI | NCBI Download | release-113 |
GO | Gene Ontology | v2025-02-06 |
Reactome | Reactome Data | v91 |
ChEMBL | ChEMBL Downloads | ChEMBL35 |
OpenTargets | OpenTargets Platform - associationByDatatypeDirect: target-disease associations - diseaseToPhenotype: disease-phenotype mappings - diseases: disease names and IDs - indication: clinical trial indications - interaction: target interactions - targets: target info and tractability |
v24.9 |
UniProt | UniProtKB | 2021_03 |
Key Capabilities
- Ease of Use: Designed with a user-friendly interface that simplifies KG exploration.
- Natural Language Interface: Query the Knowledge Graph using simple English, no need to learn Cypher or write Python code.
- Automated Query Conversion: Intelligently translates natural language input into Cypher queries and executes them on the Knowledge Graph.
- Dual Output: View results in both text (summary and Cypher) and graphical (node-edge) formats.
How to Query the Knowledge Graph with Natural Language
Polly Co-Scientist enables you to interact with the biomedical Knowledge Graph using simple, natural language — no technical expertise required.
You can choose from two options:
- Use Predefined Templates: Select from templatized queries available directly within the Polly Co-Scientist interface to quickly explore common biomedical relationships.
- Run Custom Queries: Type your own questions in plain English to retrieve insights tailored to your specific research needs.
Running Custom Queries on the Knowledge Graph Using Natural Language
Step 1: Type Your Query in English
Use the chat interface to enter your research question in natural language.
Example:
Which tissue express the gene PDE4B, and what are the associated TPM median and cPKG score?
Step 2: Review the Auto-Generated Cypher Query
Polly Co-Scientist automatically translates your natural language input into a Cypher query and JSON output. Both are displayed for reference and executed without requiring manual input.
Example:
MATCH (t:Tissue)-[:EXPRESSES]->(g:Gene {name: 'PDE4B'})
RETURN t.name AS tissue_name, t.tpm_median AS tpm_median_value
Step 3: View the Results
The output is generated in 2 parts:
- Text Output: A simple readable summary of the query results is displayed.
Example:
The gene PDE4B is expressed in a wide range of tissues including various glands (adrenal, pituitary, prostate), parts of the brain (amygdala, cerebral cortex, hippocampal formation), and major organs such as the heart, liver, and lungs. Additionally, it is found in the thyroid, uterus, bladder, and various regions of the colon and stomach. Unfortunately, the median TPM values for these tissues are not available in the data provided. There is also no cPKG score information available for these tissues.
- Graph Output: A visual representation of the results is displayed on the Knowledge Graph viewer.
User can click the zoom icon on the right to expand the Knowledge Graph view for easier exploration and full-screen display.