دورية أكاديمية
Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues.
العنوان: | Word segmentation from transcriptions of child-directed speech using lexical and sub-lexical cues. |
---|---|
المؤلفون: | Goriely, Zébulon, Caines, Andrew, Buttery, Paula |
بيانات النشر: | Cambridge University Press (CUP) Department of Computer Science and Technology Student //dx.doi.org/10.1017/s0305000923000491 J Child Lang |
سنة النشر: | 2023 |
المجموعة: | Apollo - University of Cambridge Repository |
مصطلحات موضوعية: | CHILDES, statistical learning, word segmentation |
الوصف: | We compare two frameworks for the segmentation of words in child-directed speech, PHOCUS and MULTICUE. PHOCUS is driven by lexical recognition, whereas MULTICUE combines sub-lexical properties to make boundary decisions, representing differing views of speech processing. We replicate these frameworks, perform novel benchmarking and confirm that both achieve competitive results. We develop a new framework for segmentation, the DYnamic Programming MULTIple-cue framework (DYMULTI), which combines the strengths of PHOCUS and MULTICUE by considering both sub-lexical and lexical cues when making boundary decisions. DYMULTI achieves state-of-the-art results and outperforms PHOCUS and MULTICUE on 15 of 26 languages in a cross-lingual experiment. As a model built on psycholinguistic principles, this validates DYMULTI as a robust model for speech segmentation and a contribution to the understanding of language acquisition. |
نوع الوثيقة: | article in journal/newspaper |
وصف الملف: | application/pdf |
اللغة: | English |
العلاقة: | https://www.repository.cam.ac.uk/handle/1810/357362Test |
الإتاحة: | https://www.repository.cam.ac.uk/handle/1810/357362Test |
حقوق: | Attribution 4.0 International ; https://creativecommons.org/licenses/by/4.0Test/ |
رقم الانضمام: | edsbas.341C7653 |
قاعدة البيانات: | BASE |
الوصف غير متاح. |