πŸ₯‡ 1st Place β€” CLEF 2026 GutBrainIE (all 4 subtasks)

GutBrainIE 2026 NER Β· Linking Β· RE

πŸ“… 2026 πŸ‘€ Co-First Author & Corresponding πŸ›οΈ CLEF 2026 Working Notes (BioASQ) πŸ“‚ GitHub Repository

Overview

NightSun is a single sequential pipeline that addresses all four GutBrainIE 2026 subtasks over gut-brain axis biomedical abstracts: named entity recognition (T611), entity disambiguation / linking (T612), and mention- and concept-level relation extraction (T621/T622). T611 entities feed T612 disambiguation and T621 relation extraction; T621 mention-level triples are then lifted to canonical concept URIs via T612.

πŸ† Achievement: Ranked 1st in all four official subtasks. The build on our prior 2025 first-place NER system, extended end-to-end to entity linking and relation extraction.

Why this domain is hard

Pipeline

   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
   β”‚   Gut-brain axis PubMed abstract (raw text)   β”‚
   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                           β–Ό
   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
   β”‚  T611 Β· NER                                   β”‚
   β”‚  BiomedBERT-CRF Β· 27 BIO labels (13 types)    β”‚
   β”‚  chunked inference β†’ cross-model voting       β”‚
   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
            entity spans    β”‚
            β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
            β–Ό                               β–Ό
   β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”      β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
   β”‚  T612 Β· Entity        β”‚      β”‚  T621 Β· Mention-level β”‚
   β”‚  Linking (NERD)       β”‚      β”‚  Relation Extraction  β”‚
   β”‚  dict β†’ SapBERT β†’     β”‚      β”‚  typed entity markers β”‚
   β”‚  type-prior rerank    β”‚      β”‚  CE-Dice Β· 18 classes β”‚
   β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
              β”‚  canonical URIs              β”‚  mention triples
              β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                             β–Ό
                  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                  β”‚  T622 Β· Concept-level β”‚
                  β”‚  RE β€” lift triples to β”‚
                  β”‚  canonical concept URIsβ”‚
                  β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

T611 β€” Named Entity Recognition

Token-level sequence labeling with BertCrfForTokenClassification: a BiomedBERT encoder, a linear emission head, and a CRF layer over a 27-label BIO scheme.

T612 β€” Entity Linking (three-stage cascade)

A unified knowledge base of 52,263 concept aliases derived from MeSH, NCBI Gene, GTDB, NCBI Taxonomy, FoodOn, and USDA FNDDS, plus a direct 18,512-entry surface-form lookup table from the training annotations.

  1. Dictionary lookup β€” normalized (surface form, entity type) query against the lookup table; handles common unambiguous mentions with near-perfect precision in under a second.
  2. SapBERT dense retrieval β€” unresolved mentions encoded with SapBERT, top-k (k=20) candidate URIs retrieved from a FAISS index over the 52,263 alias embeddings (95.6% top-1 where a correct KB entry exists).
  3. Type-prior reranking β€” candidates reranked by sim(e,c) + Ξ±Β·log p(c|t), correcting type-ambiguous surface forms (e.g. a chemical that also appears as a drug) without over-relying on frequency priors.

Key finding: retrieval accuracy on in-KB entities is near-ceiling β€” the real bottleneck is KB coverage, not retrieval quality, which explains the dev-to-test gap on this subtask.

T621 / T622 β€” Relation Extraction

Error analysis: the oracle-to-pipeline RE gap is driven mostly by upstream NER errors, not the relation classifier itself β€” identifying NER recall as the highest-leverage direction for further gains.

Tech Stack

Python PyTorch Transformers BiomedBERT BioLinkBERT CRF SapBERT FAISS Relation Extraction Entity Linking BioNLP