Back to feed
arXiv cs.CL·

A Modular Architecture for Typologically Controlled Lexicon Generation

Signal
72
Hype
15
In three linesModular framework for generating pronounceable, typologically plausible artificial lexicons. Samples phoneme inventories from PHOIBLE, applies three phonological grammars (deterministic, OT, MaxEnt), and assigns meanings via Swadesh-Leipzig-Jakarta ontology. Evaluation on character n-gram perplexity and KL divergence: probabilistic grammars outperform baselines on 100-5,000 word forms.
Read source
Your take?
PapersBenchmarks

Summary generated by Claude — human-verified