SYSTEM DEMONSTRATION

TELL: Traced Evidence for Learned Labels

This is a demo of a GRPO-trained model that detects AI-generated text by marking the specific "tells" that indicate AI or human authorship, where we focus on explainability rather than just a classifier score (read more in How this works).

OUR APPROACH

Imagine you see this text:

Dublin is the capital city of Ireland and one of the most exciting places I have ever visited. The city is full of history, culture, and friendly people.

and you go to check it on an AI detector:

Other detectors

87%

↑ this is telling

TELL

Dublin is the capital city of Ireland and one of the most exciting places I have ever visited. The city is full of history, culture, and friendly people.

↑ this is showing

Current AI detectors only give you a score. But is that enough?

We think that's not how we can build trust. For example, if I want to "accuse" someone of writing with AI, I need evidence, I can't just say "this website told me it's 87% AI".

So if we want proper AI text detection, we need to know why something is AI-generated. That's what TELL does.

TEST IT YOURSELF

Text to analyze

Inference engine: loading

Was this prediction correct?