Back to feed
arXiv cs.CL·

A Pilot Benchmark for NL-to-FOL Translation in Planetary Exploration

Signal
72
Hype
18
In three linesPilot benchmark for translating natural language to First-Order Logic (FOL) in planetary exploration. Dataset built from NASA mission documentation (2003-2013), manually annotated with FOL representations capturing temporal structure, agent roles, and operational dependencies. Structured predicate vocabularies provided.
Read source
Your take?
ReasoningBenchmarksRoboticsAI Agents

Summary generated by Claude — human-verified