The ÔCrit DayÕ Study

 

Yadid Ayzenberg and Brian D. Mayton

 

 

 

Abstract. We set out to characterize skin conductance changes as a response to public speaking. A group of 11 graduate students participated in the study. These students were presenting their thesis proposals to the faculty and other graduate students on a given day. Each participant was given two wristband sensors.  Each of the wristbands measured electrodermal activity, skin temperature and 3-axis accelerometery.  The participants were required to wear the sensors for 72 hours. In addition, they were required to submit daily surveys at the end of each day. The presentations were videotaped.

 

 

 

1. Introduction

 

Public speaking is often associated with high levels of stress and anxiety. Many individuals fear being in the spotlight, being the center of attention, and the possibility of standing on stage and forgetting what they meant to say. Additional elements that may add to the anxiety are the possibility of being asked questions that they are not prepared to answer. We set out to characterize the skin conductance changes related to public speaking. Would it be possible to find a correlation between speakers perceived stress levels and their physiological response? Were the levels of stress highest before, during or after a talk? Were the physiological responses of individuals that perceived themselves as very stressed higher than the ones of those that perceived themselves as calm?

 

To gain insight into the answers to these questions, we conducted a study and collected a dataset including electrodermal activity (EDA) and survey responses from 11 participants for several days around a stressful public speaking event.  In contrast to most studies concerned with EDA data, we also collected this data from both wrists for the duration of the study in an attempt to better understand what might cause an asymmetric EDA response.

 

This paper is organized as follows. Section 2 reviews previous studies on the subject of this work. Section 3 provides details about the experiment. Section 4 presents the resulting data set. Section 5 provides results and analysis.

 

 

 

2. Previous Work

 

It is possible to measure changes in stress level by using electro dermal activity sensors. Our Òfight-or-flightÓ response that is controlled by the sympathetic nervous system is heightened under stressful situations. The sub-dermal sweat glandsÕ activity, which is controlled by the sympathetic nervous system (SNS), can be measured using biosensors that are placed on the wrist and measure skin conductivity. Boucsein showed that skin conductance is highly correlated to changes in sweat gland activity. Work done by Setz et al. showed that it is possible to distinguish between cognitive load and stress in electrodermal activity. By using EDA peak height and the instantaneous peak rate it is possible to determine the stress level of a person.

Dimberg  et al. found that subjects that possessed a public speaking fear reacted with increased skin conductance activity when exposed to social stimuli compared to low-fear subjects.

 

 

 

3. Experiment

 

Each year, the MIT Media Lab organizes an event in which second year master's students present their thesis proposal to all of faculty, students, and researchers. Each proposal is evaluated in terms of depth, originality, and contribution. The purpose of these presentations is twofold. The first is that the student will receive constructive feedback on the proposed work.  The second is that it compels the students to spend time to explore possible research topics while constraining their schedule. All masterÕs students are required to participate in a thesis preparation seminar that will aid them in solidifying their ideas. At the beginning of this seminar, the students are told of the importance of their ÔCrit DayÕ performance that will be a factor that weighs heavily in their Ph.D. application. Naturally, this adds significant levels of stress to the students who realize that this speaking event may determine the future of their academic career.

 

We decided that ÔCrit DayÕ was a valuable opportunity to measure the physiological effects of public speaking.

 

3.1. Study Design

 

We recruited 11 graduate students, 8 males and 3 females, who were designated to present on ÔCrit DayÕ. The youngest was aged 23, the oldest was aged 27 and the mean age was 24.6 (SD=1.2). The students would receive compensation in return for their participation in the study. We made sure beforehand that none of the participants were taking medication that could potentially have an arousing or calming affect, or ADHD medication.

 

Before the study began, we asked the participants to report their perceived level of stress during the last month. We used the Perceived Stress Scale (PSS) as defined by Cohen et al., which is the most widely used psychological instrument for measuring the perception of stress. 

It is a measure of the degree to which circumstances in oneÕs life are considered as stressful.  Items were designed to determine how unpredictable, uncontrollable, and overloaded subjects find their lives to be.

 

During the three days of the study, we requested the participants to report their daily perceived stress levels at the end of each day.  We used the Daily Stress Inventory (DSI) as defined by Brantley et al., which is a psychometrically sound self-report instrument for the daily assessment of the sources and the individualized effect of relatively minor stressful events. It was designed to evaluate causes of stress that are not typically evaluated by major life-event scales. We add 4 additional questions to the standard DSI survey:

 

1. If something that was unexpected or caused you a great deal of stress, please note the approximate time and duration of the event.

 

2. On a scale of 0=time seemed to take forever, to 4=time seemed to fly by, how would you say your day felt?

 

3. On a scale of 0=felt poorly prepared to 4=felt very prepared, how well prepared do you feel you were for today's events?

 

4. This question mostly applies to ÔCrit DayÕ (Nov 14): On a scale of 0=unfair to 4=fair, in your opinion, how fair were the questions you were asked today?

 

 

The goal of question number 1 was to collect data that would be correlated to changes of the participants skin conductance. We hypothesized that unexpected events would cause a notable increase in skin conductance immediately following the event. We also collected similar data by performing exit interviews upon the completion of the study. However, since some time had passed from the occurrence of the events to the time of the interview, the results of these interviews may be biased.

 

We asked question 2 in order to find whether perceived stress had an impact on the perception of time duration. We hypothesized that the stringent deadline and sense of urgency would shorten the perceived duration of time, and that a day after the presentation, time would be perceived as having a longer duration.

 

Question 3 was asked in order to asses if the participants felt that they were well prepared. We hypothesized that well prepared participant would have a lower level of perceived stress and that this would also be evident in their skin conductance.

 

We asked question 4 to assess whether the participant felt in control during the presentation and whether they thought that they were asked fair questions. We hypothesized that during the question phase, participants who believed that they were asked ÒunfairÓ question would exhibit this physiologically as well.

 

 

 

 

3.2. Data Collection.

 

We collected the following data during the study:

 

1.     Skin conductance, skin temperature, and 3-axis accelerometer – we used the Affectiva QTM wristband sensor. The data was recorded at a sampling rate of 8 Hz, and was recorded using dry Ag-AgCl electrodes. Each participant received a pair of sensors, one for each wrist, so we could collect bi-lateral data. Each sensor had a label that was used to note the participant's id and whether the sensor was to be placed on the left or right wrist. The participants were asked to wear the sensors for 72 hours starting on the morning of the day before their presentation.

 

2.     Self-report measures – each participant was asked to fill in a PSS survey 2 days prior to the start of the study. During the study, participants were required to fill in a DSI survey at the end of each day.

 

3.     In addition, we interviewed each participant a few days after the end of the measurements. During those interviews, we asked the participants to describe their experiences during the 72 hours. We asked them to note any unexpected events or events that caused them a great deal of anxiety or emotional strain.

 

4.     Video – during the presentations we recorded a video of the participants. In some cases, we also recorded some of the participant's post-presentation question sessions.

 

3.2.2 IT Infrastructure and tools

 

For participant registration and conducting the surveys, we built custom Python-based web software that ran on a server in the lab. The participants could access the surveys by using a web browser from any location.  The tool presented the surveys to the participants and collected their responses.

A MySQL server was used to store the survey responses. At the end of the study we exported all the results from the database into an Excel sheet.

The Q sensor data was downloaded from each unit and stored in a dedicated server, along with the survey responses and the video files.

 

For the signal processing we used MATLAB. We built a Python tool to convert all of the Q sensor data into MATLAB file format and filter out noise.

 

We also used a tool written in Processing by Miriam Zisook that enabled us to rapidly detect cases of asymmetry.

 

3.2.1 Privacy Measures

 

In order to protect the privacy of the participants, several measures were put in place. When the participants logged into the web system for the first time, they were randomly allocated a 5-digit PIN number. They were asked to use this number every time they logged into the system. In addition, they were asked to write this number on the labels of the sensors that they were given. This enabled us to correlate between the subjects sensor reading and their self-reports.

 

Results and Discussion

We have collected the following data set:

 

PSS

11 participants (all)

DSI

á      7 recipients filled in all surveys

á      3 participants filled in the first 2 DSI surveys but not the 3rd one

á      1 participant did not fill in any of the DSI surveys

Video and audio during presentation

10 participants

Interviews

11 participants (all)

 

We calculated a PSS score per participant. The score is obtained by reversing the responses of the negatively stated items (e.g., 0 = 4, 1 = 3, 2 = 2, 3 = 1 & 4 = 0) to the four positively stated items (items 4, 5, 7, & 8) and then summing across all scale items. This score represented the perceived level of stress during the last month. The average PSS that was collected for a large population was 12.1 (SD=5.9) for males and 13.7 (SD=6.6) for females.

 

We calculated DSI scores for each participant. Three daily scores are derived for each individual: (1) the number of events that are endorsed as having occurred (Freq) (2) the sum of the total of the impact rating of these events (Sum), and (3) the average impact rating of the events (sum divided by the number of events) (Avg).

 

The table below shows the PSS and DSI scores for each participant.

 

Day 1

Day 2

Day 3

PIN

PSS Sums

Freq

Sums

Avg

Freq

Sum

Avg

Freq

Sum

Avg

54604

25

16

50

3.125

21

45

2.143

75049

22

14

62

4.429

20

67

3.350

17

34

2.000

34600

21

25

80

3.200

29

122

4.207

05972

19

15

46

3.067

9

25

2.778

11

34

3.091

18372

17

13

31

2.385

17

55

3.235

15

50

3.333

57607

17

19

73

3.842

14

48

3.429

41965

10

31

51

1.645

23

47

2.043

21

63

3.000

36237

9

13

63

4.846

17

55

3.235

4

9

2.250

13113

9

16

28

1.750

16

29

1.813

16

23

1.438

20052

8

10

26

2.600

8

16

2.000

0

0

0.000

 

 

 

 

 

 

 

 

 

 

 

Table 1 PSS and DSI scores

The first result is that according to the DSI the mean perceived stress across all participants was highest on the night before ÔCrit DayÕ (3.09) and lowest on the night of the day after ÔCrit DayÕ (2.16). On the night of ÔCrit DayÕ the perceived stress was in between the two (3.09). The average DSI measured for a large population in CohenÕs was 2.36 (SD=0.82) for males and 2.68 for females (SD=0.97)

We assumed that there should be a correlation between the PSS and the first DSI as they were submitted within the same time frame. Initially we found that the correlation between them was 0.3122, which was statistically not significant. After viewing the plot we noticed that a single subject was an outlier. Further investigation reveled that the participant had a argument with his partner just before filling the survey. We suspected this resulted in a biased score. After removing this subject we found that the correlation between the PSS and first DSI was 0.71, which was statistically significant (p<0.05).

 

The DSI also revealed that participants perceived time as passing quickly the day before ÔCrit DayÕ (mean = 2.4, SD = 1.5), and passing slower during ÔCrit DayÕ (mean = 2.33, mean = 1.21), and yet slower the day after (mean = 1.83, SD = 1.17).

 

We also found that most participants felt that the questions they were asked were fair (mean = 3.8, SD = 0.42).

 

Initially, after viewing the EDA signals we found that some of the samples were extremely noisy. We estimate that the reason for this is loose contact between the sensor and the skin. We applied an infinite impulse response low pass filter (exponential smoothing) with different coefficients for rising signals and falling signals (alpha rise = 0.5, alpha fall = 0.95).  This filter is an attempt to reduce artifacts where the signal briefly but sharply drops due to poor contact with the skin, which appeared in several subjects' data. We implemented this filter in Python.

 

The first result that arose from analyzing the EDA signals was that there is a notable difference within each subject between the presentation and the questions phase. In the question phase, the EDA level was higher, and there were more peaks per second.  This is true across all participants. We conclude that either the fear of being asked a difficult question and evaluated by others, or just being asked a question results in a heightened physiological response

 

 

 

Above can be seen the left and right EDA signals for two subjects. The EDA during the presentation is colored in blue, and the EDA during the question phase is colored in red.

 

We analyzed all of the EDA signals for asymmetry. 2 participants out of 11 exhibited temporary asymmetry in their EDA. There was no correlation between perceived stress ratings (on DSI and PSS) and asymmetry in EDA. The snapshot below shows asymmetry that begins just prior to the participantÕs presentation and lasts throughout.

 

Figure 1 EDA Asymmetry (18372) during presentation

 

The second EDA signal also shows asymmetry, as the participant is about to start his presentation.

 

Figure 2 EDA Asymmetry (57607) during presentation

 

Finally, we tried to count peaks and correlate them with the percieved stress levels. However, we could not find any significant results.  We suspect that our facility for counting peaks (the findpeaks function in MATLAB's signal processing toolkit) is confounded by noise in the data from poor contact with the skin, which varies greatly from participant to participant and, to a lesser extent, over the course of the day (perhaps due to participants re-adjusting the fit of the wristband.)  A more careful definition of what constitutes a 'peak' feature in EDA data and is more robust to noise may yield better results. 

 

 

 

 

 

 

 

PIN

13 Nov    Left

13 Nov  Right

14 Nov  Left

14 Nov  Right

15 Nov  Left

15 Nov Right

5972

0.108931

0.042647

0.097395

0.091768

0.105573

0.091005

13113

0.006115

0.016222

0.115815

0.034953

0.112331

0.029933

18372

0.016222

0.00873

0.002861

0.031462

0.014959

0.055563

20052

0.09649

0.120339

0.013636

0.018172

34600

0.038732

0.245188

0.069173

0.106703

0.218192

0.086264

36237

0.036325

0.013221

0.015145

0.00938

0.033767

0.028134

41965

0.239856

0.206557

0.403685

0.236872

0.048534

0.032602

54604

2.868826

0.163433

0.010651

0.008393

0.562887

0.40959

57607

0.140879

0.238506

0.167272

0.438723

0.083897

0.124151

75049

0.119898

0.070873

0.269948

0.485653

0.129117

91405

0.113824

0.110514

0.125519

0.100129

0.148339

0.545966

Mean

0.344190727

0.112384545

0.1021152

0.122409364

0.1814132

0.1532325

 

Table 2 Number of Peaks per Minute using findpeaks and threshold of 0.001

 

 

Conclusion

 

We have collected a large dataset, consisting of bilateral EDA data, initial Perceived Stress Scale (PSS) responses, and daily Daily Stress Inventory (DSI) survey responses.  Preliminary analysis shows some patterns that we expected, such as greater activation during Q&A after the presentation (which cannot be rehearsed) and less during the presentation (which participants likely rehearsed several times and felt prepared.)  The dataset also contains several events where the participants' EDA is asymmetric, some of which we have associated with participants' descriptions of events at those times.

 

The preliminary analysis that we have done merely scratches the surface.  We plan to continue investigating and to make the de-identified data available to other researchers for further study.  

 

References

 

Boucsein, W.: Electrodermal Activity. Plenum Press, New York (1992)

 

Brantley P.J., Waggoner C.D., Jones G.N., Rappaport N.B.: A daily stress inventory: Development, reliability, and validity. J Behav Med;10:61–731, (1987)

 

Cacioppo, J.T., Tassinary, L.G., Berntson, G.G.: Handbook of Psychophysiology.

Cambridge University Press, Cambridge (2000)

 

Cohen, S; Kamarck T, Mermelstein .: "A global measure of perceived stress". Journal of Health and Social Behavior 24 (4): 385–396. PMID 6668417, R (December 1983)

 

Cohen, S. and Williamson, G.:  Perceived Stress in a Probability Sample of the United States, The Social Psychology of Health, Newbury Park, CA:  Sage (1988)

 

Dimberg, U., Fredrikson, M. and Lundquist, O.: Autonomic reactions to social and neutral stimuli in subjects high and low in public speaking fear, Journal Biological Psychology, Volume 23(3), p223 (1986).

 

Lykken, D.T., Venables, P.H.: Direct measurement of skin conductance: A proposal

for standarization. Psychophysiology 8(5), 656–672 (1971)

 

Mundy-Castle, A.C., McKiever, B.L.: The psychophysiological signiÞcance of the

galvanic skin response. Experimental Psychology 46(1), 15–24 (1953)

 

Poh, M., Swenson, N., Picard, R.: A wearable sensor for unobtrusive, long-term

assessment of electrodermal activity. IEEE Trans. Biomed. Eng. 57(5), 1243–1252 (2010)

 

Setz, C., Arnrich, B., Schumm, J., La Marca, R., Troster, G., Ehlert, U.: Discriminating stress from cognitive load using a wearable eda device. IEEE Transactions on Information Technology in Biomedicine 14(2), 410–417 (2010)