GA4GH Session · April 16, 2026

Bring Your Own Disease
Rare Disease Phenotyping with FAIR Data

Pair clinical narratives with structured disease and phenotype terms — using Orphanet, OMIM, ICD, SNOMED, HPO, and more — identify gaps, and capture every contribution as FAIR-compliant open data.

📅 April 16, 2026  ·  2:00 – 4:45 PM 👥 All backgrounds welcome 🌐 Data stays open

How to contribute

You do not need to know Git, RDF, or any programming language to participate. Pick the path that fits you.

🩺
For clinicians · patient advocates · registry managers

Share a disease resource

Fill in a web form — no GitHub account needed, no code. Use any disease ontology you know: Orphanet, OMIM, ICD, SNOMED, GARD, and more. Your submission becomes a GitHub Issue that the team turns into FAIR data.

🦠 Submit a disease case 🔍 Report an ontology gap 📊 Report a data / model gap 💬 Feedback on these forms
🧬
For bioinformaticians · data engineers · ontologists

Contribute structured data

Add or edit Turtle files directly in the repo. Data is validated against ShEx profiles on every pull request. The FAIR Data Point is rebuilt and indexed automatically on merge.

Read CONTRIBUTING.md →

Already have a GitHub account? These links go directly to each form (faster during the live session):

Explore the FAIR Data

All structured contributions are published as a FAIR Data Point — machine-readable RDF available in Turtle and JSON-LD, listed in the StaticFDP Index alongside other FAIR Data Points worldwide.

🗂 This session's FAIR Data Point

Catalog and dataset metadata for all disease cases collected in this session. Served as RDF Turtle and JSON-LD. Updated automatically on every merge.

🌐 StaticFDP Index

A daily-crawled index of all registered FAIR Data Points. This session's FDP will appear here after the first push.

🧬 Per-Disease FAIR Dataset Index

Every disease submitted during this and future sessions becomes its own versioned FAIR dataset — diseases/{slug}/ in this repository. Each folder contains RDF Turtle, JSON-LD, a human-readable README, and linked ontology/data gap reports. The index below is the entry point for other biohackathons to discover and reuse this data.

How to use

# Python
from rdflib import Graph
g = Graph()
g.parse("https://fdp.semscape.org/ga4gh-rare-disease-trajectories/diseases/index.ttl")
# curl
curl https://fdp.semscape.org/ga4gh-rare-disease-trajectories/diseases/index.ttl

Technical contribution guide

The repository uses extensionless Turtle files validated by ShEx. GitHub Actions handles validation, SHACL generation, and FDP index pinging automatically.

  1. Fork or clone the repository github.com/StaticFDP/ga4gh-rare-disease-trajectories

  2. Create a case folder Copy cases/_template/ to cases/your-disease-name/. Fill in narrative.md, timeline.yaml, phenotype-map.yaml.

  3. Add a dataset Turtle file Create an extensionless file (e.g. cases/your-disease/dataset) using the Dataset.shex profile as your guide. ShEx validation runs automatically on your PR.

  4. Open a pull request CI runs ShEx validation (primary gate) and SHACL cross-check. On merge: SHACL + JSON-LD are generated and the FDP index is pinged.

ShEx profiles CONTRIBUTING.md GitHub Actions

Session agenda

April 16, 2026 · 2:00–4:45 PM

Time Item Who
2:00Introduction & framingAndra & Ada
2:10Disease terminologies + ECTO ontologyNicole
2:20HPO — purpose and boundaries HPOJules
2:25Phenopackets — purpose and boundariesJules
2:35Facial image–based phenotyping (GestaltMatcher)Tzung-Chien Hsieh
2:45Structured phenotyping in clinical geneticsOrion
2:55Use case presentations (10 min each)Danielle, Zohreh, Janine, Ada
3:35Group work & mapping hands-onAll
— break —
4:35Wrap-up & next steps FDPAll

Key resources

Standards, tools, and registries referenced in this session.

Live demonstrations

Two interactive demos showing what becomes possible once disease data is FAIR.

🔎
Demo 1 — SPARQL across FDPs

Query RDF metadata from all registered FDPs in your browser — no server needed.

Powered by Comunica — a W3C-compliant SPARQL engine running entirely in the browser. Pre-loaded queries: list all datasets, find by ORDO/OMIM disease term, show distributions, list community submissions.

Open SPARQL demo →
🔄
Demo 2 — Form → RDF pipeline

Watch a web-form submission become machine-readable linked data in real time.

The pipeline runs automatically: submit a disease case → GitHub Issue → GitHub Actions converts it to RDF Turtle → committed to docs/fdp/submissions.ttl → served via GitHub Pages → queryable in Demo 1.

  1. Submit a case via the form above (ORCID login required)
  2. Watch the Actions tab convert it to Turtle (<60 s)
  3. Run "Community submissions" query in Demo 1 to see it appear
View submissions.ttl →