Generalization of multisensory perceptual learning

Powers III, Albert R.; Hillock-Dunn, Andrea; Wallace, Mark T.

doi:10.1038/srep23374

Generalization of multisensory perceptual learning

Article
Open access
Published: 22 March 2016

Volume 6, article number 23374, (2016)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Generalization of multisensory perceptual learning

Download PDF

Albert R. Powers III^1,3,4,5,
Andrea Hillock-Dunn^1,2 &
Mark T. Wallace^1,2,3

3376 Accesses
37 Citations
9 Altmetric
1 Mention
Explore all metrics

Abstract

Life in a multisensory world requires the rapid and accurate integration of stimuli across the different senses. In this process, the temporal relationship between stimuli is critical in determining which stimuli share a common origin. Numerous studies have described a multisensory temporal binding window—the time window within which audiovisual stimuli are likely to be perceptually bound. In addition to characterizing this window’s size, recent work has shown it to be malleable, with the capacity for substantial narrowing following perceptual training. However, the generalization of these effects to other measures of perception is not known. This question was examined by characterizing the ability of training on a simultaneity judgment task to influence perception of the temporally-dependent sound-induced flash illusion (SIFI). Results do not demonstrate a change in performance on the SIFI itself following training. However, data do show an improved ability to discriminate rapidly-presented two-flash control conditions following training. Effects were specific to training and scaled with the degree of temporal window narrowing exhibited. Results do not support generalization of multisensory perceptual learning to other multisensory tasks. However, results do show that training results in improvements in visual temporal acuity, suggesting a generalization effect of multisensory training on unisensory abilities.

Multisensory perceptual learning is dependent upon task difficulty

Article 11 July 2016

Perceptual learning shapes multisensory causal inference via two distinct mechanisms

Article Open access 19 April 2016

Multisensory integration is independent of perceived simultaneity

Article 21 November 2016

Introduction

Our ability to perceive the world in an accurate and meaningful way depends critically upon the appropriate integration of stimuli from the different senses. One of the more challenging aspects of this process comes in determining which of a constant stream of stimuli from the different senses were caused by the same environmental event. One statistical feature that the brain employs in accomplishing this task is the temporal structure of the incoming sensory inputs. Thus, those events that occur in close temporal proximity are likely to have been caused by the same environmental event, whereas events that occur at disparate times are unlikely to have a common origin. However, because the environmental energies carrying information from the different senses (i.e., light, sound) propagate at different rates, the temporal relationship between these different inputs must be flexibly specified. For this reason the construct of a temporal window of multisensory binding has been proposed–an interval spanning several hundred milliseconds and within which paired events in two different sensory modalities are likely to produce enhanced neural, perceptual, and behavioral responses^{1,2,3,4,5,6,7,8,9}.

Although these studies have characterized the multisensory temporal binding window in adults, and others have focused on its developmental dynamics^10,11,12, only recently have efforts been made to examine the relative stability/lability of this window. This work illustrated marked malleability in multisensory temporal processes following perceptual training¹³, a plasticity that could be linked to activation changes in a multisensory cortical network centered on the posterior superior temporal sulcus¹⁴. Numerous questions remain about the perceptual changes induced by such training. Foremost among these are questions of generalization - specifically whether the improvements in multisensory temporal acuity brought about by training are capable of transferring to other tasks involving either multisensory or unisensory (i.e., visual alone, auditory alone) processing. Reports of such generalization are rare in the perceptual learning literature^{15,16,17,18,19,20}, with relatively few studies showing effects that transfer to untrained tasks. Although one study has demonstrated improvements in auditory temporal processing following training on a somatosensory temporal discrimination task²⁰, and some studies examining the effect of multisensory adaptation effects demonstrate some degree of cross-modal transfer^21,22,23, few studies have shown transfer of multisensory perceptual learning^24,25,26,27. One notable demonstration of these effects has described generalization of multisensory perceptual learning of a temporal order judgment (TOJ) task onto the sound-induced flash illusion (SIFI)²⁷, although the degree to which these changes are driven by effects on unisensory or multisensory processing remain unclear.

To examine this question, we trained participants on two versions (i.e., a 2-alternative forced choice (2-AFC) and a 2-interval forced choice (2-IFC)) of an audiovisual simultaneity judgment task in which they received trial-by-trial feedback on their perceptual reports (published previously in ref.13; Fig. 1a,b). We then examined these participants’ performance on a different task known to be sensitive to the temporal structure of audiovisual stimuli –the sound-induced flash illusion (Fig. 1c)^28,29.

Results

No change in SIFI illusion is seen after training, but improvements are seen in correctly recognizing two-flash conditions

Participants in both the 2-AFC and 2-IFC training and passive exposure groups completed the SIFI task immediately after the baseline simultaneity judgment assessment (i.e., prior to training) and again after the final simultaneity judgment assessment following the 5 days of training. The results of these experiments are summarized in Fig. 2. Here changes in the proportion of trials in which participants report two flashes are plotted as a function of the number of flashes and beeps actually presented during pre- and post-training assessments. Baseline between-group differences at each condition were not evident (F₅ = 1.304, p = 0.26). Examination of training effects indicate significant main effects of group (F₂ = 9.68; p < 0.001), training status (F₁ = 13.01; p < 0.001), and condition (F₅ = 34.05; p < 0.001) as well as significant group-by-training status (F₂ = 9.68; p = 0.04), group-by-condition (F₁₀ = 3.04; p < 0.001), and training status-by-condition (F₅ = 13.81; p < 0.001) interactions. Post-hoc paired t-tests revealed significant increases in the likelihood of correctly identifying two-flash conditions in the 2-AFC (Fig. 2a; 2 Flash/0 Beep: t₂₁ = 3.0, p = 0.037; 2 Flash/1 Beep: t₂₁ = 2.86, p = 0.038; 2 Flash/2 Beep: t₂₁ = 3.02; p = 0.033; all p values Holm-Sidak corrected for multiple comparisons) and 2-IFC groups (Fig. 2b; 2 Flash/0 Beep: t₁₉ = 4.6, p = 0.001; 2 Flash/1 Beep: t₁₉ = 3.75, p = 0.007; 2 Flash/2 Beep: t₁₉ = 3.09; p = 0.02; all p values Holm-Sidak corrected for multiple comparisons). In contrast, there were no significant changes in likelihood of reporting two flashes when only one was present after training when plotted as a function of the number of beeps present across any conditions; this includes the 1 Flash/2 Beep (or SIFI “fission” illusion condition). Similarly, there was no change in the likelihood of reporting only one flash when two were present and paired with one beep, or the SIFI “fusion” illusion. The Exposure Group (Fig. 2c) did not exhibit any significant changes on any conditions after passive exposure to the stimulus pairings. Taken together, results indicate that while all groups show evidence of both the fission (1 Flash/2 Beep) and fusion (2 Flash/1 Beep) versions of the SIFI illusion as evidenced by comparison with other 1-Flash and 2-Flash conditions, respectively, neither of these illusory conditions were affected by training. In contrast, any increment in 2-Flash reporting exhibited by the training groups was shared among the 2-Flash conditions, supporting a specific effect on recognition of two flashes regardless of the presence, absence, or number of auditory stimuli.

**Figure 2: No change in reports of the SIFI illusion are seen after training, but improvements are seen in correctly reporting the number of flashes in the two-flash conditions.**

Performance improvements are driven by increased hits across all conditions

In each of the participant groups (2-AFC, 2-IFC, and passive exposure), there was no significant change in the proportion of false alarms following training (Fig. 3a). In contrast, significant increases were seen in the proportion of hits in the two training groups (2-AFC: t₂₁ = 2.84, p = 0.0099; 2-IFC: t₁₉ = 3.22, p = 0.0045) but not in the exposure group. Similarly, a significant increase in hits persisted in the 2-AFC participants who returned for follow-up assessment (Fig. 3b; t₁₃ = 2.89, p = 0.0125). This group also demonstrated a decrease in false alarms on follow-up, manifested as a decrease in likelihood of reporting the SIFI (t₁₃ = 2.49, p = 0.027).

Multisensory perceptual training results in changes in sensitivity (d-prime) on the SIFI task, but no changes in response bias

In order to determine if the improvements in two-flash reporting seen with training were the result of a true sensitivity increase or due to a change in response bias, signal detection theory was employed. In this approach, measures of sensitivity (d-prime) can be derived from the correct and incorrect responses in the SIFI task. Individual baseline, post-training/exposure, and follow-up (as applicable) data are plotted in Fig. 4 for d-prime (a–c) as well as for measures of response bias (i.e., beta (d–f)) measures.

**Figure 4: Multisensory perceptual training results in changes in sensitivity on the SIFI task, but no changes in response bias.**

Analysis of baseline measures for each group indicated no group-wise difference in baseline d-prime (F_2,53 = 1.27; p = 0.28) or beta (F_2,53 = 0.21; p = 0.81). A main effect of training was detected on mixed ANOVA with training status (pre/post) as a within-subject factor and group as a between-subjects factor (F_1,2 = 11.49, p < 0.001). In both the 2-AFC and 2-IFC training groups, marked increases in sensitivity were noted following training (2-AFC mean increase in d-prime = 0.45; t₂₁ = 2.38, p = 0.027; 2-IFC mean increase in d-prime = 0.38; t₁₉ = 2.34, p = 0.030) (Fig. 4a,b), and the 2-AFC group continued to demonstrate significant increases on follow-up assessment (main effect of assessment (follow-up cohort, n = 14), F₂ = 11.49, p < 0.001; t₁₃ = 4.33, p < 0.001). In contrast, the passive exposure group showed no difference in sensitivity at the two time points (Fig. 4c). In contrast to these changes, no significant changes in beta were noted among any of the three groups on initial or follow-up assessment (Fig. 4d–f).

Collectively, these results illustrate that while training on a multisensory simultaneity judgment task does not alter the perception of a temporally-dependent multisensory illusion (i.e., SIFI), it does result in an increased ability to correctly detect two flashes presented in close succession when compared with pre-training data. These results support a lack of generalization from the multisensory training for the illusory task, but do provide evidence for generalization from multisensory training for improvements in visual temporal perception.

Increases in d-prime correlate with the degree of temporal window narrowing

To determine whether the degree of change induced in the multisensory temporal binding window brought about by training is able to predict the degree of change in sensitivity on the SIFI task, individual difference scores in multisensory temporal window size and d′ were entered as factors into a linear regression. As seen in Fig. 5a, there is a direct correlation between the decrease in window size and the increase in sensitivity immediately after training (r² = 0.1967; p = 0.0145). In a related analysis, Fig. 5b highlights a significant correlation between participants’ pre-/post-training difference in probability of reporting audiovisual simultaneity on the AFC simultaneity judgment task at the 300ms SOA and the increase in d-prime observed at this SOA that was most significantly altered on the SIFI task (r² = 0.2439; p = 0.0195).

**Figure 5: Increases in d-prime on the SIFI task correlate with measures of improvement in audiovisual simultaneity judgment.**

Discussion

In the current study we show that individuals who have undergone perceptual training on an audiovisual simultaneity judgment task do not demonstrate altered perception of the sound-induced flash illusion (SIFI) on those conditions in which the illusory percept is typically seen. Rather, they demonstrate performance improvements driven by an increase in the correct recognition of two-flash conditions after training. Moreover, we have demonstrated that the magnitude of change in performance on the SIFI task is directly dependent upon the degree of temporal window narrowing. Finally, we have shown that these changes are stable at least one week following the cessation of training.

The use of signal detection theory as an approach to the SIFI is unique, and here it is somewhat unconventionally applied. In contrast to traditional studies meant to determine individuals’ sensitivity to a faintly-presented signal, the SIFI task is meant to determine how susceptible individuals are to a stimulus that is in fact not present; thus, the effects of interest here (i.e., the illusion) would traditionally be described as false alarms (in the case of fission illusion) and misses (in the case of fusion illusion). Similar approaches have been taken to illusory phenomena^30,31 (including the SIFI²⁷) and hallucinations³², and applied here allow a more nuanced understanding of the interplay between unisensory and multisensory processing. In particular, the distinction between sensitivity improvements and response bias will be crucial in interpreting some of the differences observed here, as described below.

One of the more intriguing findings in the current study is the lack of a correlation between window narrowing and d-prime for participants trained on the 2-IFC task. One explanation for this difference is that because a single-presentation 2-AFC task of this type (as opposed to the 2-IFC construct) relies more upon the setting of an internal criterion^33,34, this decrease may be the result of post-perceptual processes. Moreover, such a mechanism would likely result in a change in response bias (i.e., β) in the 2-AFC group, but no such change was found. A more plausible explanation for these differences may rest on subtle differences in attention inherent in the 2-AFC and 2-IFC tasks^35,36,37. Because the 2-AFC task requires direct judgment of audiovisual simultaneity in a single stimulus pair rather than detection of timing differences between two objectively simultaneous and non-simultaneous pairs (as in the 2-IFC task), participants in the 2-AFC group may be more likely to attend to the fine temporal structure of single stimulus pairs, thus leading to both increases in hits and decreases in false alarms, a combination observed only in the 2-AFC group (although only on follow-up assessment). However, further studies with more sensitive measures of attention may be needed to determine whether these mechanisms may be at play in the differences observed here.

The observation that performance improvements after multisensory perceptual learning are seen not in another audiovisual task but rather in an improvement in visual temporal precision implies a complex relationship between multisensory and unisensory temporal processing. Rather than supporting the notion of a direct correspondence between related and temporally-dependent audiovisual tasks, the relationships observed may instead represent a dependence of visually-driven sensitivity improvements upon domain-general or unisensory timing improvements wrought by multisensory perceptual learning. The idea that these two processes may be acting in concert to produce the combination of behavioral effects observed is not novel, and in fact has been demonstrated in other models³⁸ and studies^24,39 of perceptual learning. Indeed, the seemingly contradictory observations that: 1) the performance improvements documented here are driven primarily by improvements in ability to correctly recognize two-flash conditions and 2) that these improvements are differentially expressed based upon the temporal structure of the auditory stimuli together warrant a complex model of interactions between visual, auditory, and multisensory temporal function. Future studies should examine the changes brought about by multisensory perceptual learning upon visual and auditory processing (see Stevenson, et al.²⁵ as an example of visual-to-audiovisual training generalization), but should also approach the questions of generalization across low-level stimulus attributes (e.g. retinotopic location) in order to begin to identify potential loci of change relevant to each task across the information processing hierarchy³⁸.

Intriguingly, the main effects described here are driven primarily by increases in participants’ abilities to discriminate between the presentation of one flash versus two flashes presented in close temporal proximity. Thus, the training effects brought about by audiovisual perceptual training largely represent changes in visual temporal performance. Other investigations into cross-modal generalization of temporally-based perceptual learning have generated mixed results. For example, transfer of learning has been shown from training on a somatosensory timing task to a corresponding auditory task if similar intervals are tested²⁰, and recent work demonstrates transfer of learning from a multisensory temporal order judgment task to a visual temporal order judgment task^25,40. Another study failed to demonstrate transfer of simultaneity learning either within or across different sensory modalities⁴¹. While the results reported here fall short of providing definitive evidence in support of the existence of a single, crossmodal or multisensory clock in its classical formulation as a pacemaker-accumulator⁴², they do join others^43,44,45 in demonstrating the possible existence of shared components for timing perception among the sensory modalities. Indeed, these results fit well with a growing literature in support of interval-specific timing circuits that are dependent upon the time scale in question but independent of stimulus specifics, location, or modality^46,47,48,49. Along these lines, future investigations should focus upon whether, as these results and cue reliability models of multisensory integration may predict^{50,51,52,53,54}, the relationship between the multisensory and visual improvements described here may be causally linked.

Methods

2-AFC Training

Subjects

Twenty-two (22) Vanderbilt undergraduate and graduate students (mean age 20.73; 11 female) took part in the 2-alternative forced-choice (2-AFC) training portion of the study. Data from this cohort of participants were obtained for a separate study (Powers et al.¹³) and results from training are reported in that original paper. By self-report, all participants had normal hearing and vision, and none had any history of neurological or psychiatric disorders. All procedures for all subject groups were approved by the Vanderbilt University Institutional Review Board (IRB). Additionally, all methods were carried out in accordance with the approved guidelines, and informed consent was obtained from all participants.

2-AFC Simultaneity Judgment Assessment

In this task (Fig. 1a,b), participants judged whether the presentation of an auditory stimulus and a visual stimulus was ‘simultaneous’ or ‘non-simultaneous’ by pressing 1 or 2, respectively, on a response box. For details of stimulus characteristics and task structure, please see¹³.

Sound-Induced Flash Illusion (SIFI) Task

Participants completed the SIFI task^28,29,55 once directly after the baseline simultaneity judgment assessment and once again after the completion of the final simultaneity judgment assessment on Day 5. In this task, participants are presented with one or two flashes paired with zero, one, or two beeps. Flashes were 8.5 ms in duration, and consisted of a white circle with an area of 12.6 cm² presented on a black background one centimeter below the fixation cross. Beeps consisted of a temporally-ramped 5000 Hz pure tone of 8 ms duration. In the two flash/one beep condition, the flashes were separated by 50 ms and the beep was presented at the midpoint between the two flashes. Similarly, in the two flash/two beep condition, the timing between flashes remained constant and one beep was always presented at the midpoint between the two flashes, with the other preceding or following it by an SOA ranging from 50 to 300 ms. In the illusion-inducing one flash/two beeps condition, one beep always occurred simultaneously with the flash onset, while one either preceded or followed that onset by SOAs ranging from 50 to 300 ms. This condition typically induces the perception of two flashes although only one appears, and the strength of this illusion varies with SOA²⁹. Each of these conditions occurred an equal number of times so as not to introduce a response bias. After each trial, participants responded by button-press to indicate the number of flashes they had perceived. In the SIFI assessment task there were 300 total trials (10 cycles × 30 trials/cycle) with an equal distribution of each condition.

2-AFC Simultaneity Judgment Training

The training task differed from simultaneity judgment assessments in that after making a response, the subject was presented with either the phrase “Correct!” paired with a happy face, or “Incorrect” paired with a sad face corresponding to the correctness of their choice. These faces (happy = yellow, sad = blue, area = 37.4 cm²) were presented for 500 ms in the center of the screen. The white ring and fixation were of the same size and duration as in assessment trials. Only SOAs between −150 and 150 ms, broken into 50 ms intervals, were used during the training phase. Additionally, in this phase the SOAs were unequally distributed: the veridical simultaneous condition had a 6:1 ratio to any of the other 6 non-simultaneous conditions. In this way there was an equal likelihood of simultaneous/non-simultaneous conditions, minimizing concerns about introducing a response bias. The training phase consisted of 120 trials (20 cycles × 6 trials/cycle). See Fig. 1a,b for illustrations of the temporal relationship between stimuli.

2-AFC Training Protocol

Training occurred over 5 hours (1 hour per day) during which participants took part first in a pre-training simultaneity judgment assessment, next in one SIFI assessment, then in 3 shorter simultaneity judgment training runs, followed by a post-training simultaneity judgment assessment. An additional baseline assessment was performed at the start of the study for each subject, followed by the typical training day; this was designed to detect any practice effects that may have resulted from completion of the assessment itself.

Follow-Up Assessment

After one week without training, a subset of the training cohort described above (n = 14, 6 female; mean age = 21.14) returned to the lab and underwent one simultaneity judgment assessment and one SIFI assessment without any training.

2-AFC Exposure

Subjects

Fourteen (14) Vanderbilt undergraduate and graduate students (mean age 19.5; 4 female) underwent the 2-alternative forced-choice (2-AFC) exposure portion of the study. As with the 2-AFC training group data, data from this cohort of participants represents data obtained for a separate study (Powers et al.¹³). All participants had self-reported normal sight and hearing, and none had any personal or family history of neurological or psychiatric disorders.

Exposure Protocol

The exposure portion of the study differed from the 2-AFC training protocol only in that instead of the training blocks, participants underwent 2-AFC exposure blocks of the same length. Thus, all participants in both cohorts took part in the same number of 2-AFC simultaneity judgment and SIFI assessments. The details of the exposure sessions are outlined below.

2-AFC Exposure

In the interest of maintaining attention, the 2-AFC exposure blocks were designed as an oddball task wherein participants were exposed to the same audiovisual pairs used in the simultaneity judgment training sessions but were instructed to press a button when they saw a red ring. As in the simultaneity judgment training sessions, the veridical simultaneous condition had a 6:1 ratio to any of the other 6 non-simultaneous conditions. Oddballs occurred with the same probability across all conditions, and were 1/10 as likely to appear as the standard. The rings and fixation were of the same dimensions and duration as in the assessment trial; the tone was identical to that presented during the simultaneity judgment assessment and training sessions. A range of SOAs between −150 and 150 ms, in steps of 50-ms intervals, were used for this task.

2-IFC Training

Subjects

Twenty (20) Vanderbilt undergraduate and graduate students (mean age 20.3; 14 female) underwent the 2-interval forced-choice (2-IFC) training portion of the study. As with data from the other cohorts, data from this cohort of participants represents a subset of that obtained for a separate study (Powers et al.¹³). All participants had normal hearing and vision by self-report, and none had any personal or close family history of neurological or psychiatric disorders.

2-IFC Simultaneity Judgment Assessment

The 2-IFC simultaneity judgment assessment employed precisely the same stimuli as those used in the 2-AFC task. In this task, however, participants were presented with two audiovisual pairs, one with an SOA of zero (simultaneously-presented) and one with a non-zero SOA (non-simultaneously presented). Presentations were separated by 1 second, during which a fixation cross alone was presented. Participants were asked to indicate as quickly as possible by button-press which interval (first or second presentation) contained the flash and beep that happened at the same time. Simultaneous pairings were as likely to be presented in the first interval as in the second, and a simultaneous-simultaneous catch trial condition was present in equal representation to other SOAs.

2-IFC Simultaneity Judgment Training

The training phase of the 2-IFC portion of the study was identical to that of the assessment phase with two exceptions: 1) in the same manner described in the 2-AFC training, participants were given feedback as to the accuracy of their responses after each trial; 2) as in the 2-AFC simultaneity judgment training protocol, the range of SOAs presented during training (−150 ms to 150 ms by 50-ms increments) was restricted in training as compared to assessment (−300 ms to 300 ms). However, unlike the 2-AFC version of this training, the ratio of simultaneous to non-simultaneous presentation was always 1:1.

2-IFC Training Protocol

Participants underwent training in five 1-hour blocks (one hour per day) on the 2-IFC version of the simultaneity judgment task. Each day’s 2-IFC training began with a simultaneity judgment assessment followed by three shorter blocks of training, and ended with a post-training simultaneity judgment assessment.

Follow-Up Assessment

A subset of the 2-IFC training cohort described above (n = 10, 7 female; mean age = 20.9) returned to the lab one week after cessation of training and underwent one simultaneity judgment assessment and one SIFI assessment without any training.

Data Analysis

All data were imported from E-Prime 2.0 text files into MatLab 7.7.0.471 R2008b (The Mathworks, Inc., Natick, MA) via a custom-made script for this purpose. Individual subject raw data were used to calculate the mean probability of simultaneity judgment (2-AFC), accuracy (2-IFC), and proportion of trials at which two flashes were reported (SIFI) at each SOA for all assessments. These means were then analyzed in multiple ways as summarized in the Results section.

Estimation of Window Size

Mean data from each individual were fit with two sigmoid curves generated using the MatLab glmfit function, splitting the data into left (auditory presented first) and right (visual presented first) sides and fitting them separately. For the 2-AFC tasks, the criterion at which to measure the breadth of the temporal window was equal to 75% of the maximum data point at baseline assessment. For the 2-IFC task, this criterion was set at half the distance between individuals’ lowest accuracy point at baseline assessment and 1 (also ~75% accuracy). These criteria were then used to assess the breadth of the distributions produced by each individual’s assessment data throughout the duration of the training period. Distribution breadth was then assessed for both the left side (from zero to the left-most point at which the sigmoid curve crossed the criterion line) and the right side (from zero to right intersection point) and then combined to get an estimation of total distribution width. This measure was then used as a proxy for the size of each individual’s window at each assessment.

Signal Detection Analysis

In order to determine whether any changes in SIFI performance were the result of a true increase in perceptual sensitivity (d′) or a shift in response bias (β), a signal detection analysis was performed. Perceptual sensitivity (d′) was defined as the ability to discriminate between one flash and multiple flashes^35,55. These parameters were calculated per individual in the following manner:

where z(p) indicates the inverse of the cumulative normal distribution corresponding to the response proportion p. H (hit) denotes correct detection of multiple flashes, and F (false alarm) indicates an incorrect report of multiple flashes.

Additional Information

How to cite this article: Powers, A. R. III et al. Generalization of multisensory perceptual learning. Sci. Rep. 6, 23374; doi: 10.1038/srep23374 (2016).

References

Diederich, A. & Colonius, H. Bimodal and trimodal multisensory enhancement: effects of stimulus onset and intensity on reaction time. Perception & psychophysics 66, 1388–1404 (2004).
Article Google Scholar
Diederich, A. & Colonius, H. Crossmodal interaction in saccadic reaction time: separating multisensory from warning effects in the time window of integration model. Experimental brain research. Experimentelle Hirnforschung 186, 1–22 (2008).
Article PubMed Google Scholar
Diederich, A. & Colonius, H. Crossmodal interaction in speeded responses: time window of integration model. Progress in brain research 174, 119–135 (2009).
Article PubMed Google Scholar
Colonius, H. & Diederich, A. Multisensory interaction in saccadic reaction time: a time-window-of-integration model. Journal of cognitive neuroscience 16, 1000–1009 (2004).
Article PubMed Google Scholar
Colonius, H., Diederich, A. & Steenken, R. Time-window-of-integration (TWIN) model for saccadic reaction time: effect of auditory masker level on visual-auditory spatial interaction in elevation. Brain topography 21, 177–184 (2009).
Article PubMed Google Scholar
Dixon, N. F. & Spitz, L. The detection of auditory visual desynchrony. Perception 9, 719–721 (1980).
Article CAS PubMed Google Scholar
Pandey, P. C., Kunov, H. & Abel, S. M. Disruptive effects of auditory signal delay on speech perception with lipreading. The Journal of auditory research 26, 27–41 (1986).
CAS PubMed Google Scholar
McGrath, M. & Summerfield, Q. Intermodal timing relations and audio-visual speech recognition by normal-hearing adults. The Journal of the Acoustical Society of America 77, 678–685 (1985).
Article CAS ADS PubMed Google Scholar
Meredith, M. A., Nemitz, J. W. & Stein, B. E. Determinants of multisensory integration in superior colliculus neurons. I. Temporal factors. J Neurosci 7, 3215–3229 (1987).
Article CAS PubMed PubMed Central Google Scholar
Lewkowicz, D. J. Perception of auditory-visual temporal synchrony in human infants. J Exp Psychol Hum Percept Perform 22, 1094–1106 (1996).
Article CAS PubMed Google Scholar
Hillock, A. R., Powers, A. R. & Wallace, M. T. Binding of sights and sounds: age-related changes in multisensory temporal processing. Neuropsychologia 49, 461–467 (2011).
Article PubMed Google Scholar
Hillock-Dunn, A. & Wallace, M. T. Developmental changes in the multisensory temporal binding window persist into adolescence. Developmental science 15, 688–696 (2012).
Article PubMed PubMed Central Google Scholar
Powers, A. R. 3rd, Hillock, A. R. & Wallace, M. T. Perceptual training narrows the temporal window of multisensory binding. The Journal of neuroscience: the official journal of the Society for Neuroscience 29, 12265–12274 (2009).
Article CAS Google Scholar
Powers, A. R. 3rd, Hevey, M. A. & Wallace, M. T. Neural correlates of multisensory perceptual learning. The Journal of neuroscience: the official journal of the Society for Neuroscience 32, 6263–6274 (2012).
Article CAS Google Scholar
Jeter, P. E., Dosher, B. A., Petrov, A. & Lu, Z. L. Task precision at transfer determines specificity of perceptual learning. Journal of vision 9, 11–13 (2009).
Article Google Scholar
Lapid, E., Ulrich, R. & Rammsayer, T. Perceptual learning in auditory temporal discrimination: no evidence for a cross-modal transfer to the visual modality. Psychonomic bulletin & review 16, 382–389 (2009).
Article Google Scholar
Polat, U. Making perceptual learning practical to improve visual functions. Vision research 49, 2566–2573 (2009).
Article PubMed Google Scholar
Roth, D. A., Appelbaum, M., Milo, C. & Kishon-Rabin, L. Generalization to untrained conditions following training with identical stimuli. Journal of basic and clinical physiology and pharmacology 19, 223–236 (2008).
Article PubMed Google Scholar
Dosher, B. A. & Lu, Z. L. The functional form of performance improvements in perceptual learning: learning rates and transfer. Psychol Sci 18, 531–539 (2007).
Article PubMed Google Scholar
Nagarajan, S. S., Blake, D. T., Wright, B. A., Byl, N. & Merzenich, M. M. Practice-related improvements in somatosensory interval discrimination are temporally specific but generalize across skin location, hemisphere, and modality. J Neurosci 18, 1559–1570 (1998).
Article CAS PubMed PubMed Central Google Scholar
Fujisaki, W., Shimojo, S., Kashino, M. & Nishida, S. Recalibration of audiovisual simultaneity. Nature neuroscience 7, 773–778, doi: 10.1038/nn1268 (2004).
Article CAS PubMed Google Scholar
Van der Burg, E., Orchard-Mills, E. & Alais, D. Rapid temporal recalibration is unique to audiovisual stimuli. Exp Brain Res 233, 53–59, doi: 10.1007/s00221-014-4085-8 (2015).
Article PubMed Google Scholar
Harvey, C., Van der Burg, E. & Alais, D. Rapid temporal recalibration occurs crossmodally without stimulus specificity but is absent unimodally. Brain Res 1585, 120–130, doi: 10.1016/j.brainres.2014.08.028 (2014).
Article CAS PubMed Google Scholar
Seitz, A. R., Kim, R. & Shams, L. Sound facilitates visual learning. Curr Biol 16, 1422–1427, doi: 10.1016/j.cub.2006.05.048 (2006).
Article CAS PubMed Google Scholar
Stevenson, R. A., Wilson, M. M., Powers, A. R. & Wallace, M. T. The effects of visual training on multisensory temporal processing. Exp Brain Res 225, 479–489, doi: 10.1007/s00221-012-3387-y (2013).
Article PubMed PubMed Central Google Scholar
Stevenson, R. A., Zemtsov, R. K. & Wallace, M. T. Individual differences in the multisensory temporal binding window predict susceptibility to audiovisual illusions. J Exp Psychol Hum Percept Perform 38, 1517–1529, doi: 10.1037/a0027339 (2012).
Article PubMed PubMed Central Google Scholar
Setti, A. et al. Improving the efficiency of multisensory integration in older adults: audio-visual temporal discrimination training reduces susceptibility to the sound-induced flash illusion. Neuropsychologia 61, 259–268, doi: 10.1016/j.neuropsychologia.2014.06.027 (2014).
Article ADS PubMed Google Scholar
Shams, L., Kamitani, Y. & Shimojo, S. Illusions. What you see is what you hear. Nature 408, 788 (2000).
Article CAS ADS PubMed Google Scholar
Shams, L., Kamitani, Y. & Shimojo, S. Visual illusion induced by sound. Brain research 14, 147–152 (2002).
PubMed Google Scholar
Brosvic, G. M. et al. Signal-detection analysis of the Muller-Lyer and the Horizontal-Vertical illusions. Perceptual and motor skills 79, 1299–1304, doi: 10.2466/pms.1994.79.3.1299 (1994).
Article CAS PubMed Google Scholar
Lown, B. A. Quantification of the Muller-Lyer illusion using signal detection theory. Perceptual and motor skills 67, 101–102, doi: 10.2466/pms.1988.67.1.101 (1988).
Article CAS PubMed Google Scholar
Ishigaki, T. & Tanno, Y. The signal detection ability of patients with auditory hallucination: analysis using the continuous performance test. Psychiatry and clinical neurosciences 53, 471–476, doi: 10.1046/j.1440-1819.1999.00586.x (1999).
Article CAS PubMed Google Scholar
Nachmias, J. On the psychometric function for contrast detection. Vision research 21, 215–223 (1981).
Article CAS PubMed Google Scholar
Pelli, D. G. Uncertainty explains many aspects of visual contrast detection and discrimination. J Opt Soc Am A 2, 1508–1532 (1985).
Article CAS ADS PubMed Google Scholar
Green, D. M. & Swets, J. A. Signal detection theory and psychophysics. (Wiley, 1966).
Gescheider, G. A. Psychophysics: method, theory, and application. 2nd edn, (L. Erlbaum Associates, 1985).
Jang, Y., Wixted, J. T. & Huber, D. E. Testing signal-detection models of yes/no and two-alternative forced-choice recognition memory. Journal of experimental psychology. General 138, 291–306, doi: 10.1037/a0015525 (2009).
Article PubMed PubMed Central Google Scholar
Shams, L. & Seitz, A. R. Benefits of multisensory learning. Trends in cognitive sciences 12, 411–417, doi: 10.1016/j.tics.2008.07.006 (2008).
Article PubMed Google Scholar
Mossbridge, J. A., Fitzgerald, M. B., O’Connor, E. S. & Wright, B. A. Perceptual-learning evidence for separate processing of asynchrony and order tasks. J Neurosci 26, 12708–12716, doi: 10.1523/JNEUROSCI.2254-06.2006 (2006).
Article CAS PubMed PubMed Central Google Scholar
Alais, D. & Cass, J. Multisensory perceptual learning of temporal order: audiovisual learning transfers to vision but not audition. PLoS One 5, e11283, doi: 10.1371/journal.pone.0011283 (2010).
Article CAS ADS PubMed PubMed Central Google Scholar
Virsu, V., Oksanen-Hennah, H., Vedenpaa, A., Jaatinen, P. & Lahti-Nuuttila, P. Simultaneity learning in vision, audition, tactile sense and their cross-modal combinations. Experimental brain research. Experimentelle Hirnforschung 186, 525–537 (2008).
Article PubMed Google Scholar
Treisman, M. Temporal discrimination and the indifference interval. Implications for a model of the “internal clock”. Psychological monographs 77, 1–31 (1963).
Article CAS PubMed Google Scholar
Burr, D. & Morrone, C. Time perception: space-time in the brain. Curr Biol 16, R171–173 (2006).
Article CAS PubMed Google Scholar
Burr, D., Silva, O., Cicchini, G. M., Banks, M. S. & Morrone, M. C. Temporal mechanisms of multimodal binding. Proceedings 276, 1761–1769 (2009).
Google Scholar
Alais, D. & Burr, D. The “Flash-Lag” effect occurs in audition and cross-modally. Curr Biol 13, 59–63 (2003).
Article CAS PubMed Google Scholar
Ivry, R. B. & Spencer, R. M. The neural representation of time. Current opinion in neurobiology 14, 225–232 (2004).
Article CAS PubMed Google Scholar
Buhusi, C. V. & Meck, W. H. What makes us tick ? Functional and neural mechanisms of interval timing. Nature reviews 6, 755–765 (2005).
Article CAS PubMed Google Scholar
Johnston, A., Arnold, D. H. & Nishida, S. Spatially localized distortions of event time. Curr Biol 16, 472–479 (2006).
Article CAS PubMed Google Scholar
Barakat, B., Seitz, A. R. & Shams, L. Visual rhythm perception improves through auditory but not visual training. Current biology: CB 25, R60–61, doi: 10.1016/j.cub.2014.12.011 (2015).
Article CAS PubMed Google Scholar
Ronsse, R., Miall, R. C. & Swinnen, S. P. Multisensory integration in dynamical behaviors: maximum likelihood estimation across bimanual skill learning. J Neurosci 29, 8419–8428 (2009).
Article CAS PubMed PubMed Central Google Scholar
Andersen, T. S., Tiippana, K. & Sams, M. Maximum Likelihood Integration of rapid flashes and beeps. Neuroscience letters 380, 155–160 (2005).
Article CAS PubMed Google Scholar
Angelaki, D. E., Gu, Y. & DeAngelis, G. C. Multisensory integration: psychophysics, neurophysiology, and computation. Current opinion in neurobiology 19, 452–458 (2009).
Article CAS PubMed PubMed Central Google Scholar
Deneve, S. & Pouget, A. Bayesian multisensory integration and cross-modal spatial links. Journal of physiology, Paris 98, 249–258 (2004).
Article PubMed Google Scholar
Roach, N. W., Heron, J. & McGraw, P. V. Resolving multisensory conflict: a strategy for balancing the costs and benefits of audio-visual integration. Proceedings 273, 2159–2168 (2006).
Google Scholar
Rosenthal, O., Shimojo, S. & Shams, L. Sound-induced flash illusion is resistant to feedback training. Brain topography 21, 185–192 (2009).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was supported by the Vanderbilt Kennedy Center for Research on Human Development, the National Institute for Child Health and Development (Grant HD050860), and the National Institute for Deafness and Communication Disorders (Grant F30 DC009759). Data analysis and revisions were supported by the Integrated Mentored Patient-Oriented Research Training (IMPORT) in Psychiatry grant (5R25MH071584-07) as well as the Clinical Neuroscience Research Training in Psychiatry grant (5T32MH19961-14) from the NIMH. Additional support was provided by the Yale Detre Fellowship for Translational Neuroscience. We thank Drs Calum Avison, Randolph Blake, Maureen Gannon, and Daniel Polley, as well as Leslie Dowell, Matthew Hevey, and Aaron Nidiffer for their technical, conceptual, and editorial assistance.

Author information

Authors and Affiliations

Kennedy Center, Vanderbilt University, Nashville, Tennessee, USA
Albert R. Powers III, Andrea Hillock-Dunn & Mark T. Wallace
Department of Hearing and Speech Sciences, Vanderbilt University, Nashville, Tennessee, USA
Andrea Hillock-Dunn & Mark T. Wallace
Neuroscience Graduate Program, Vanderbilt University, Nashville, Tennessee, USA
Albert R. Powers III & Mark T. Wallace
Medical Scientist Training Program, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
Albert R. Powers III
Department of Psychiatry, Yale University, New Haven, Connecticut, USA
Albert R. Powers III

Authors

Albert R. Powers III
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Hillock-Dunn
View author publications
You can also search for this author in PubMed Google Scholar
Mark T. Wallace
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.P. and A.D. designed the experiments conducted and gathered all data. A.P. performed all data analysis and A.P. and M.W. prepared the main manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Albert R. Powers III.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Powers III, A., Hillock-Dunn, A. & Wallace, M. Generalization of multisensory perceptual learning. Sci Rep 6, 23374 (2016). https://doi.org/10.1038/srep23374

Download citation

Received: 04 September 2015
Accepted: 01 March 2016
Published: 22 March 2016
DOI: https://doi.org/10.1038/srep23374
Springer Nature Limited

This article is cited by

A Randomized Controlled Trial for Audiovisual Multisensory Perception in Autistic Youth
- Jacob I. Feldman
- Kacie Dunham
- Tiffany G. Woynaroski
Journal of Autism and Developmental Disorders (2023)
Lower multisensory temporal acuity in individuals with high schizotypal traits: a web-based study
- Gianluca Marsicano
- Filippo Cerpelloni
- Luca Ronconi
Scientific Reports (2022)
Within- and Cross-Modal Integration and Attention in the Autism Spectrum
- Geneviève Charbonneau
- Armando Bertone
- Olivier Collignon
Journal of Autism and Developmental Disorders (2020)
Audiovisual Temporal Processing in Postlingually Deafened Adults with Cochlear Implants
- Iliza M. Butera
- Ryan A. Stevenson
- Mark T. Wallace
Scientific Reports (2018)
Multisensory perceptual learning is dependent upon task difficulty
- Matthew A. De Niear
- Bonhwang Koo
- Mark T. Wallace
Experimental Brain Research (2016)

Generalization of multisensory perceptual learning

Abstract

Similar content being viewed by others

Introduction

Results

No change in SIFI illusion is seen after training, but improvements are seen in correctly recognizing two-flash conditions

Performance improvements are driven by increased hits across all conditions

Multisensory perceptual training results in changes in sensitivity (d-prime) on the SIFI task, but no changes in response bias

Increases in d-prime correlate with the degree of temporal window narrowing

Discussion

Methods

2-AFC Training

Subjects

2-AFC Simultaneity Judgment Assessment

Sound-Induced Flash Illusion (SIFI) Task

2-AFC Simultaneity Judgment Training

2-AFC Training Protocol

Follow-Up Assessment

2-AFC Exposure

Subjects

Exposure Protocol

2-AFC Exposure

2-IFC Training

Subjects

2-IFC Simultaneity Judgment Assessment

2-IFC Simultaneity Judgment Training

2-IFC Training Protocol

Follow-Up Assessment

Data Analysis

Estimation of Window Size

Signal Detection Analysis

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation