Subword Modeling

Subword Modeling

The goal of this course is to lead students to engage broadly with the existing NLP and computational linguistics research on subword modeling and develop new computational approaches to problems in morphology, orthography, and phonology. In addition to three other miniprojects, students will be expected to produce one piece of research that can be developed into a conference or workshop paper (though submission is not a course requirement). The paper should be suitable for the “Phonology, Morphology, and Word Segmentation” tracks of the *ACL conferences, the SIGMORPHON workshop, Coling, or LREC.

For a complete overview, see the syllabus.

Semester: Spring 2025

Location: DH A302

Day/Time: Mon and Wed, 11:00am—12:20pm

Contents:

Course Schedule Top

Date Lec Topic Assignment
Jan 13 1 Intro Lecture; introduction to Project 1 Project 1
Jan 15 2 Signs, minimal signs, and compositionality
Jan 22 3 Productivity and generalization
Jan 27 4 Inflection, derivation, and compounding
Jan 29 5 Morphotactics, affix ordering, the mirror principles, and the relevance principle
Feb 03 6 Computational approaches to morphological segmentation/tokenization
Feb 05 7 Allomorphy
Feb 10 8 Non-concatenative processes
Feb 12 9 Lexemes and paradigms; introduction to Project 2 Project 2
Feb 17 10 Grammatical properties
Feb 19 11 Word-and-Paradigm morphology
Feb 24 12 Rules of realization and rules of referral
Feb 26 13 Neural approaches to reinflection
Mar 10 15 Descriptive phonetics; introduction to Project 3 Project 3
Mar 12 16 IPA versus orthographies
Mar 17 17 Typology of orthographies
Mar 19 18 G2P and P2G
Mar 24 19 Unicode—logical and visual representations
Mar 26 20 Articulatory features; introduction to Project 4 Project 4
Mar 31 21 Syllabification and syllable segmentation
Apr 07 22 Phonological similarity and cognate detection
Apr 09 23 Sound change
Apr 14 24 Rule ordering and relative chronology
Apr 16 25 Presentations
Apr 21 26 Travel
Apr 23 26 Presentations

In-Class Exercises Top

Instructors Top

Instructor: Prof. David R. Mortensen

Teaching Assistant: TBA

Course Objectives Top

At the end of this course, students will:

Course Principles and Classroom guidelines

This course will be based around discussion of readings, guided by handouts. There will be no slides and no recordings of lectures. There are three groundrules:

Prerequisites Top

The course is open to graduate students and advanced undergraduates who have completed:

Grading Top

Grades will be based on three components:

Required Texts Top

There are two required textbooks (in additional to many freely available readings):

Resources/Support/How to Succeed in This Course Top

Office Hours

Scheduled for 2 hours weekly. One-on-one meetings with the instructor can always be scheduled using Calendly.

Office of Disability and Access Services

Many people have disabilities, including members of our own families. We see disabilities as deficits not in disabled people but in the institutions and societies that are structured such that they are disadvantaged. We wish to do our part to overcome this disparate treatment. If you have a disability (visible or invisible), please let us know as soon as possible (you don’t need to tell us the nature of the disability) and work with Disability Service to develop a set of accommodations which we can then approve. These might, for example, include lecture materials that are usable by people with visual disabilities, sign language interpretation, captioning, flexible due dates, etc.

Support from the Instructor

Finally, if you tell me that you are having trouble, I will not judge you or think less of you. You do not owe me an explanation of your health (physical or mental) or the health of your loved ones; but you are welcome to tell me and I will listen. Even if I can’t help you directly, it is likely that I know someone who can. If you need help or more information, please ask, and I will work with you.

Class and University Policies Top

Diversity, Inclusion, Anti-racism

Throughout human history, some people have been denied the rights and opportunities available to others on the basis of their race, gender, economic class, caste, ancestry, language community, age, religion, beliefs, political affiliation, and abilities (visible and invisible). A single course cannot undo the injustices of history, but we—as a teaching staff—are committed to fighting inequity and promoting inclusion. We encourage you to join us. If you feel that you, or those around you, have been treated unfairly based upon their identity (or perceived identity) by us, by other members of the teaching staff, or by other students in the course, we ask that you bring it to our attention so that we can address the wrongs (as well as pursing the approved University channels).

Policy on Religious Observances

Due dates can be revised for documented religious observances. Please make sure to communicate about religious observances as far in advance as possible (and no later than one week before the observance, per university policy) so that accommodations can be made.

Assignment Completion & Late Work

This course works best when everybody completes their work by the designated deadlines. This prevents cascading tardiness from overwhelming both students and teaching staff. However, sometimes there are situations that call for extensions. Some examples (real examples) include the following:

We care about you and your well being more than we care about deadlines and if something difficult is happening in your life which is making it hard for you to complete an assignment on time please contact me so we can talk. I have found that, often, the students who most need some leeway are those least likely to ask for it. When in doubt, ask. We will work out a plan so you can complete the requirements of the course with your physical and psychological health intact. Do not feel ashamed to reach out to me. I am eager to see you succeed.

Academic Integrity

Any cheating or plagiarism will be dealt with according to the University policies on academic integrity. In general, discussion of tools, concepts, and formalisms is acceptable collaboration and is encouraged. Misrepresenting the work of others as your own, however, is considered cheating.

License

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License