A Modular Architecture for Typologically Controlled Lexicon Generation
Signal
72
Hype
15
In three linesModular framework for generating pronounceable, typologically plausible artificial lexicons. Samples phoneme inventories from PHOIBLE, applies three phonological grammars (deterministic, OT, MaxEnt), and assigns meanings via Swadesh-Leipzig-Jakarta ontology. Evaluation on character n-gram perplexity and KL divergence: probabilistic grammars outperform baselines on 100-5,000 word forms.Read source
Your take?
Summary generated by Claude — human-verified