Math and Data (MaD) Seminar

Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

Time and Location:

May 02, 2024 at 2PM; 60 Fifth Avenue, Room 150

Speaker:

Zhuoran Yang, Yale