🥇 1st Place — CLEF 2025 GutBrainIE

Biomedical NER System

📅 Mar 2025 – May 2025 👤 Tech Lead & 1st Co-Author 📂 GitHub Repository

Overview

Developed a state-of-the-art Named Entity Recognition (NER) system for the CLEF 2025 GutBrainIE shared task, focusing on extracting biomedical entities from gut-brain axis related PubMed abstracts. The core philosophy: "Smart Fine-Tuning Is All You Need".

🏆 Achievement: Ranked 1st on the leaderboard (Micro-F1: 0.8408) with a 3% improvement over baseline approaches through strategic fine-tuning and ensemble techniques.

Key Innovations

Experiment Results

Systematic experiments revealed key insights:

Final Results: Baseline 0.7211 → Best Ensemble 0.7773 (Macro-F1) | 0.8117 → 0.8408 (Micro-F1)

Competition Results

The team achieved best-performing status on T61 (NER task) considering Micro-F1:

Technical Pipeline

Tech Stack

Python PyTorch Transformers PubMedBERT CRF Hugging Face Jupyter