Subword Modeling

Subword Modeling

The goal of this course is to lead students to engage broadly with the existing NLP and computational linguistics research on subword modeling and develop new computational approaches to problems in morphology, orthography, and phonology. In addition to three other miniprojects, students will be expected to produce one piece of research that can be developed into a conference or workshop paper (though submission is not a course requirement). The paper should be suitable for the “Phonology, Morphology, and Word Segmentation” tracks of the *ACL conferences, the SIGMORPHON workshop, Coling, or LREC.

For a complete overview, see the syllabus.

Semester: Spring 2024

Location: PH 100

Day/Time: Tue and Thu, 11:00am—12:20pm

Contents:

Course Schedule Top

Date Lec Topic Assignment
Jan 16 1 Intro Lecture; introduction to Project 1 Project 1
Jan 18 2 Signs, minimal signs, and compositionality
Jan 23 3 Productivity and generalization
Jan 25 4 Inflection, derivation, and compounding
Jan 30 5 Morphotactics, affix ordering, the mirror principles, and the relevance principle
Feb 01 6 Computational approaches to morphological segmentation/tokenization
Feb 06 7 Allomorphy
Feb 08 8 Non-concatenative processes
Feb 13 9 Lexemes and paradigms; introduction to Project 2 Project 2
Feb 15 10 Grammatical properties
Feb 20 11 Word-and-Paradigm morphology
Feb 22 12 Rules of realization and rules of referral
Feb 27 13 Neural approaches to reinflection
Feb 29 14 Neural approaches to paradigm completion
Mar 12 15 Descriptive phonetics; introduction to Project 3 Project 3
Mar 14 16 IPA versus orthographies
Mar 19 17 Typology of orthographies
Mar 21 18 G2P and P2G
Mar 26 19 Unicode—logical and visual representations
Mar 28 20 Input methods (ML and HCI perspectives)
Apr 02 21 Articulatory features; introduction to Project 4 Project 4
Apr 04 22 Syllabification and syllable segmentation
Apr 09 23 Phonological similarity and cognate detection
Apr 16 24 Sound change
Apr 18 25 Rule ordering and relative chronology
Apr 23 26 Presentations
Apr 25 26 Presentations

In-Class Exercises Top

Instructors Top

Instructor: Prof. David R. Mortensen

Teaching Assistant: Anjali Kantharuban

Course Objectives Top

At the end of this course, students will:

Course Principles and Classroom guidelines

This course will be based around discussion of readings, guided by handouts. There will be no slides and no recordings of lectures. There are three groundrules:

Prerequisites Top

The course is open to graduate students and advanced undergraduates who have completed:

Grading Top

Grades will be based on three components:

Required Texts Top

There are two required textbooks (in additional to many freely available readings):

Resources/Support/How to Succeed in This Course Top

Office Hours

Scheduled for 2 hours weekly. One-on-one meetings with the instructor can always be scheduled using Calendly.

Office of Disability and Access Services

Many people have disabilities, including members of our own families. We see disabilities as deficits not in disabled people but in the institutions and societies that are structured such that they are disadvantaged. We wish to do our part to overcome this disparate treatment. If you have a disability (visible or invisible), please let us know as soon as possible (you don’t need to tell us the nature of the disability) and work with Disability Service to develop a set of accommodations which we can then approve. These might, for example, include lecture materials that are usable by people with visual disabilities, sign language interpretation, captioning, flexible due dates, etc.

Support from the Instructor

Finally, if you tell me that you are having trouble, I will not judge you or think less of you. You do not owe me an explanation of your health (physical or mental) or the health of your loved ones; but you are welcome to tell me and I will listen. Even if I can’t help you directly, it is likely that I know someone who can. If you need help or more information, please ask, and I will work with you.

Class and University Policies Top

Diversity, Inclusion, Anti-racism

Throughout human history, some people have been denied the rights and opportunities available to others on the basis of their race, gender, economic class, caste, ancestry, language community, age, religion, beliefs, political affiliation, and abilities (visible and invisible). A single course cannot undo the injustices of history, but we—as a teaching staff—are committed to fighting inequity and promoting inclusion. We encourage you to join us. If you feel that you, or those around you, have been treated unfairly based upon their identity (or perceived identity) by us, by other members of the teaching staff, or by other students in the course, we ask that you bring it to our attention so that we can address the wrongs (as well as pursing the approved University channels).

Policy on Religious Observances

Due dates can be revised for documented religious observances. Please make sure to communicate about religious observances as far in advance as possible (and no later than one week before the observance, per university policy) so that accommodations can be made.

Assignment Completion & Late Work

This course works best when everybody completes their work by the designated deadlines. This prevents cascading tardiness from overwhelming both students and teaching staff. However, sometimes there are situations that call for extensions. Some examples (real examples) include the following:

We care about you and your well being more than we care about deadlines and if something difficult is happening in your life which is making it hard for you to complete an assignment on time please contact me so we can talk. I have found that, often, the students who most need some leeway are those least likely to ask for it. When in doubt, ask. We will work out a plan so you can complete the requirements of the course with your physical and psychological health intact. Do not feel ashamed to reach out to me. I am eager to see you succeed.

Academic Integrity

Any cheating or plagiarism will be dealt with according to the University policies on academic integrity. In general, discussion of tools, concepts, and formalisms is acceptable collaboration and is encouraged. Misrepresenting the work of others as your own, however, is considered cheating.

License

Creative Commons License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License