arXiv cs.CL·19 May 2026

A Pilot Benchmark for NL-to-FOL Translation in Planetary Exploration

Signal

Hype

In three linesPilot benchmark for translating natural language to First-Order Logic (FOL) in planetary exploration. Dataset built from NASA mission documentation (2003-2013), manually annotated with FOL representations capturing temporal structure, agent roles, and operational dependencies. Structured predicate vocabularies provided.

Read source

Your take?

Reasoning Benchmarks Robotics AI Agents

Summary generated by Claude — human-verified

A Pilot Benchmark for NL-to-FOL Translation in Planetary Exploration

Other angles on this story