Try It!
Focus on Utterance Segmentation
Listen to the audio and transcribe it using the SALT conventions.