Introduction



Data & Labeling



Feature Extraction



Models



Application



Results










Sources of training data: 
  • PhoneShell voicemail messages (222)  (50 of them explicitly emotional recorded by us)
  • Call Home Corpus (88)
  • Oasis Corpus (51)

The first ten seconds of voicemail messages were used for labeling and training. This is based on studies which claim that humans make their decisions on whether to skip or keep listening after hearing "the first few seconds" of each message. ( All Talk and All Action. Whittaker, Hirschberg et al. ) 

Emotionally significant segments were taken out of the Call Home and Oasis Corpus. They were very good sources for formal vs. informal.

361 messages were labeled by both of us independently. Labels were then compared and only those messages with labels that were in agreement were used as training data.

link to labels