CLEF-CheckThat! Lab

Overview

CheckThat AI is built for participation in the CLEF-CheckThat! Lab’s Task 2 (2025), a competitive research challenge focused on advancing automatic fact-checking and claim verification technologies. The CLEF (Conference and Labs of the Evaluation Forum) CheckThat! Lab is an annual evaluation campaign that brings together researchers, practitioners, and developers to tackle pressing challenges in computational fact-checking.

What is CLEF-CheckThat! Lab?

CLEF-CheckThat! is an international shared task organized as part of the CLEF conference series. The lab focuses on developing automated methods for:

Check-worthiness detection: Identifying claims that merit fact-checking
Claim detection and extraction: Finding factual assertions in text
Claim matching: Identifying previously fact-checked claims
Claim verification: Assessing the veracity of claims using evidence

Research Goals

The lab addresses critical challenges in the fight against misinformation:

Scale: The volume of information online far exceeds human fact-checking capacity
Speed: False information spreads rapidly and requires quick responses
Complexity: Claims are often embedded in noisy, unstructured social media content
Context: Understanding requires disambiguation and background knowledge

Task 2 (2025): Claim Normalization

Task Description

Given: A noisy, unstructured social media post (text extracted from platforms like Twitter, Facebook, or Instagram) Goal: Transform the post into a concise, normalized claim that is:

Self-contained: Understandable without external context
Verifiable: Can be fact-checked against reliable sources
Concise: Typically under 25 words
Faithful: Accurately represents the original assertion without hallucination

Why This Matters

Social media posts present unique challenges:

Challenge Example: A post might read: “They’re hiding the truth about what happened last week! Wake up people!!!”This contains vague references (“They”, “the truth”, “what happened”), emotional language, and lacks specificity—making direct fact-checking impossible without normalization.

Competition Objectives

Participants develop systems that can:

Extract core claims from verbose or fragmented text
Resolve ambiguity in references and context
Remove noise like hashtags, URLs, and emotional language
Preserve meaning while achieving conciseness
Maintain factual accuracy without introducing new information

Dataset Information

Data Format

The CLEF-CheckThat! Task 2 dataset contains:

Development Set (dev.csv): Labeled examples for model training and validation
Test Set (test.csv): Unlabeled posts for final evaluation
JSONL Format: Alternative format with structured metadata

Data Characteristics

Source Material: Posts are collected from multiple social media platforms and represent diverse topics including:

Health and medicine claims
Political statements
Scientific assertions
News and current events
Viral misinformation

Evaluation Metrics

Systems are evaluated using:

METEOR Score: Primary metric measuring semantic similarity and quality (see METEOR Scoring)
Human Evaluation: Manual assessment of claim quality by expert judges
Quality Criteria: Verifiability, check-worthiness, clarity, and faithfulness

Research Context

Claim normalization builds on several research areas:

Information Extraction: Identifying structured information from unstructured text
Text Simplification: Making complex text more accessible
Summarization: Condensing information while preserving key content
Coreference Resolution: Identifying what pronouns and references point to
Fact-Checking: Verifying the truthfulness of claims

Academic Impact

The CLEF-CheckThat! Lab has produced significant research contributions:

Published in proceedings of CLEF conferences
Cited in academic papers on fact-checking and misinformation
Used as benchmarks for claim detection and verification systems
Influenced real-world fact-checking platforms and tools

CheckThat AI’s Approach

Our system leverages advanced techniques:

Model Architecture

Multi-Model Support: GPT-4, Claude, Gemini, Llama, and Grok
Prompting Strategies: Zero-shot, few-shot, and chain-of-thought reasoning
Refinement Loop: Iterative improvement using G-Eval feedback
Quality Assessment: Automated evaluation with DeepEval integration

Processing Pipeline

See our complete fact-checking pipeline for details on how we:

Detect claims in noisy posts
Extract verifiable assertions
Normalize and disambiguate
Evaluate and refine outputs
Score quality using METEOR and G-Eval

References

CLEF-CheckThat! Resources

Official Website: CLEF-CheckThat! Lab Task 2 (2025)
CLEF Conference: Annual conference on experimental IR and NLP evaluation
Previous Tasks: Archives of past competitions and datasets

Academic Citations

If you use CheckThat AI or build upon this work, please cite:

@misc{checkthat-ai-clef2025,
  title={Claim Extraction and Normalization for CLEF-CheckThat! Lab Task 2},
  author={Nikhil Kadapala},
  year={2025},
  howpublished={\url{https://github.com/nikhil-kadapala/clef2025-checkthat-lab-task2}}
}

Background

Methodology

Overview

What is CLEF-CheckThat! Lab?

Research Goals

Task 2 (2025): Claim Normalization

Task Description

Why This Matters

Competition Objectives

Dataset Information

Data Format

Data Characteristics

Evaluation Metrics

Research Context

Academic Impact

CheckThat AI’s Approach

Model Architecture

Processing Pipeline

References

CLEF-CheckThat! Resources

Academic Citations

Further Reading

Build docs developers (and LLMs) love

Background

Methodology

​Overview

​What is CLEF-CheckThat! Lab?

​Research Goals

​Task 2 (2025): Claim Normalization

​Task Description

​Why This Matters

​Competition Objectives

​Dataset Information

​Data Format

​Data Characteristics

​Evaluation Metrics

​Research Context

​Related Work

​Academic Impact

​CheckThat AI’s Approach

​Model Architecture

​Processing Pipeline

​References

​CLEF-CheckThat! Resources

​Academic Citations

​Further Reading

Build docs developers (and LLMs) love

Overview

What is CLEF-CheckThat! Lab?

Research Goals

Task 2 (2025): Claim Normalization

Task Description

Why This Matters

Competition Objectives

Dataset Information

Data Format

Data Characteristics

Evaluation Metrics

Research Context

Related Work

Academic Impact

CheckThat AI’s Approach

Model Architecture

Processing Pipeline

References

CLEF-CheckThat! Resources

Academic Citations

Further Reading