Abstract
While several methods have been proposed to assess the influence of continuous visual cues in parallel numerosity estimation, the impact of temporal magnitudes on sequential numerosity judgments has been largely ignored. To overcome this issue, we extend a recently proposed framework that makes it possible to separate the contribution of numerical and non-numerical information in numerosity comparison by introducing a novel stimulus space designed for sequential tasks. Our method systematically varies the temporal magnitudes embedded into event sequences through the orthogonal manipulation of numerosity and two latent factors, which we designate as “duration” and “temporal spacing”. This allows us to measure the contribution of finer-grained temporal features on numerosity judgments in several sensory modalities. We validate the proposed method on two different experiments in both visual and auditory modalities: results show that adult participants discriminated sequences primarily by relying on numerosity, with similar acuity in the visual and auditory modality. However, participants were similarly influenced by non-numerical cues, such as the total duration of the stimuli, suggesting that temporal cues can significantly bias numerical processing. Our findings highlight the need to carefully consider the continuous properties of numerical stimuli in a sequential mode of presentation as well, with particular relevance in multimodal and cross-modal investigations. We provide the complete code for creating sequential stimuli and analyzing participants’ responses.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Humans can rapidly estimate the number of elements in a collection of items without counting, although in an imprecise manner. Sensitivity to numerical information has been observed even in young infants (Xu & Spelke, 2000) and several nonhuman species (e.g., Agrillo et al., 2008; Bortot et al., 2019; Cantlon & Brannon, 2007), and can emerge from statistical learning mechanisms in deep neural networks (Stoianov & Zorzi, 2012; Zorzi & Testolin, 2018; Testolin, Zou & McClelland, 2020b). Nonsymbolic number processing is thought to be independent of mode of presentation (e.g., arrays of items vs. sequences of flashes) or sensory modality (e.g., visual vs. auditory), since individuals show similar precision in discriminating numerosity from visual collections of elements and sequences of visual, auditory, or tactile events (Anobile et al., 2018; Barth et al., 2003; Tokita & Ishiguchi, 2016). Moreover, the ability to integrate numerical information across sensory modalities seems to appear early in life (Izard et al., 2009; Jordan & Brannon, 2006), and a supra-modal encoding of numerical information is further supported by adaptation studies (Arrighi et al., 2014) and neuroscientific findings (Eger et al., 2003; Nieder, 2012). However, inconsistencies between sensory modalities and mode of presentation have been reported (Droit-Volet et al., 2008; Tokita et al., 2013), including at the neural level (Cavdaroglu & Knops, 2019), calling for the design of more precise experimental paradigms to control for potential confounds.
A long-debated methodological issue in the study of numerosity perception is the possible influence of continuous magnitudes on numerosity judgments. Indeed, converging evidence shows that visual numerosity judgments are influenced by non-numerical quantities such as total surface area, convex hull, or density (Clayton et al., 2015; Dakin et al., 2011; Gebuis & Reynvoet, 2012). According to one hypothesis, these interactions can be explained by a partially overlapping representation of numerical and non-numerical quantities (Walsh, 2003), subserved by shared neural resources (Sokolowski et al., 2017, for a meta-analysis). Others postulate an interference between competing representations at the response selection level after initial parallel processing (Gilmore et al., 2013; Nys & Content, 2012), as also suggested by the variability in interference effects across different contexts (Dramkin et al., 2022). An alternative view instead argues that the influence of non-numerical visual cues suggests that numerical information might be indirectly extracted by weighting non-numerical quantitative information, thus calling into question the very concept of “number sense” (Gebuis et al., 2016; Gevers et al., 2016). Despite the lack of consensus on the underlying mechanism, the tight interplay between continuous visual magnitudes and numerosity is well documented, and several methods have been suggested to assess and estimate the presence of non-numerical biases in parallel numerosity comparison tasks (DeWind et al., 2015; Salti et al., 2017) and to practically generate numerical stimuli with a precise manipulation of their visual features (De Marco & Cutini, 2020).
Nevertheless, the possible effect of continuous temporal cues on sequential numerosity perception is still largely unexplored. A few studies have reported significant interference from temporal duration during parallel numerosity comparison, with an overall overestimation of arrays that are displayed for a longer duration (Javadi & Aichelburg, 2012). However, the latter finding cannot be disentangled from the possible effect of exposure duration (Inglis & Gilmore, 2013). Investigations based on a sequential or dynamic presentation of events offer a contradictory picture. Some studies indicate that duration can influence sequential numerosity judgments, but with divergent results regarding the direction of its effect (Lambrechts et al., 2013; Martin et al., 2017; Philippi et al., 2008; Togoli, Fornaciai, et al., 2021b; Tokita & Ishiguchi, 2011). On the other hand, a large portion of studies on developmental or adult populations failed to find any influence of the duration of the stimuli on numerical judgments of sequences of visual or auditory events (Agrillo et al., 2010; Dormal et al., 2006; Dormal & Pesenti, 2013; Droit-Volet et al., 2003). Such inconsistencies can be partially explained by the lack of a common methodology to investigate temporal biases, and by the fact that most studies limited their investigations to the effect of either the duration of the overall sequence/stimulus or the intervals between events, ignoring potential interferences related to other covarying temporal dimensions such as the duration of individual events.
To overcome this issue and enable more precise investigations on the nature of numerical representation, considering its relationship to temporal magnitude processing, here we extend to the temporal domain a recent framework used to model responses in parallel numerosity comparison tasks (DeWind et al., 2015; Park, 2022). Our goal is to provide a common ground for the investigation of biases generated by continuous magnitudes on parallel and sequential numerosity perception, at the same time promoting the investigation of the interplay between numerical and non-numerical magnitude processing in different sensory modalities. We first describe the method for generating sequences of events that vary in number, duration, and distance in time, as well as the statistical method for response modeling (MATLAB code is made available onlineFootnote 1). To demonstrate its applicability in numerosity perception studies, we then validate the proposed method in two psychophysical experiments involving both visual and auditory modalities, where participants performed a numerosity comparison task with sequences of either flashing dots or tones.
Proposed method
The framework originally developed in the seminal work of DeWind and colleagues (2015; also see Park, 2022, for discussion) consists of the systematic construction of nonsymbolic numerical stimuli by varying several continuous magnitudes alongside numerosity, taking into account the mathematical relationships between numerical and non-numerical features. We extended the original idea to sequences of events, which are characterized not only by their number but also by temporal features.
In particular, we distinguish between intrinsic temporal features, which pertain to the individual events, and extrinsic temporal features, which are related to the entire sequence (see Fig. 1). Following DeWind et al. (2015), we define mean event duration (MED) as the average duration of the individual events and total event duration (TED) as the sum of the length of all pulses (similar to average item size and total surface area, respectively, in spatial arrays of elements). In the case of regular sequences with fixed durations of events, mean event duration corresponds to individual event duration (IED) (equal to individual item size). Similarly, we refer to total stimulus duration (TSD) as the time from the beginning of the first pulse to the end of the last pulse, intervals included, and mean event period (MEP) as the total stimulus duration divided by the number of events (similar to convex hull and sparsity).
Based on the relationship between intrinsic and extrinsic features, we can derive two dimensions orthogonal to numerosity, namely duration and temporal spacing (see Fig. 2a and b). In analogy with the visual features introduced by DeWind and colleagues (2015), these two dimensions can be mathematically defined as
or, in logarithmic scale:
In this sense, duration is the dimension that varies simultaneously with TED and MED, keeping numerosity constant, while temporal spacing is the dimension that varies simultaneously with TSD and MEP, keeping numerosity constant. For a fixed number of events, a change in duration is associated with a change in the average temporal length of the events spread in a fixed interval, while a change in temporal spacing can be imagined as a change in their temporal distance, keeping a fixed average duration.
Conversely, we can mathematically describe the individual temporal features in terms of the three cardinal dimensions numerosity (n), duration (Dur), and temporal spacing (TmSp) as
In addition to the features described above, we can consider coverage (Cov) as an alternative measure of density, as total event duration per stimulus duration:
Note that log scaling the axes produces a linear relationship between the cardinal dimensions and the other temporal features (see Table 1). Moreover, in log space, the distance between stimulus points becomes proportional to the ratios of their features.
In our online repository, we provide a working example of code that can be used to create a stimulus set taking into consideration the features described above. Stimuli are first defined as sequences of timestamps in frames, in order to easily instantiate either visual or auditory sequences with identical temporal properties through different stimulus presentation software. Users can generate regular (with fixed IED and fixed inter-event intervals) or irregular sequences. From the script, users can also easily visualize the relationship between numerosity and the mentioned non-numerical features in the resulting sequential numerical stimuli. A thorough description of the code is provided in the Supplementary Material.
Stimuli generated by orthogonally varying numerosity, duration, and temporal spacing can then be used to estimate the impact of non-numerical magnitudes on the dependent variable using a regression-based approach. Approximate numerical representations have been traditionally modeled as a logarithmically compressed number line, where numerosities are represented by partially overlapping Gaussian distributions in accordance with Weber’s law (Dehaene, 2003; Piazza et al., 2004), so that the discriminability of a change in numerosity is dependent on the difference in log numerosity. Similarly to DeWind et al. (2015), we can extend the same logic to non-numerical magnitudes and estimate how performance in behavioral tasks is affected by log differences in temporal cues. For example, in a sequential numerosity comparison task, trial-by-trial responses can be modeled with a generalized linear model using as regressors the log-ratios of numerosity, duration, and temporal spacing between the two sequences. The combination of the estimated coefficients of the regressors (βNum, βDur, and βTempSp) is informative regarding the influence of numerical and non-numerical quantities on participant’s decisions. If the response is based entirely on numerical ratios and is unaffected by non-numerical temporal features, it will lead to a positive coefficient for numerosity and coefficients for duration and temporal spacing close to zero. In this context, βNum can also be considered an indication of numerical acuity, with larger values of the numerosity coefficients corresponding to better performance in discriminating more difficult ratios (De Wind et al., 2015). Nonzero values for the βDur and βTempSp coefficients instead quantify the influence of temporal features on the participant responses.
The three coefficients can also be thought of as defining a discrimination vector in the parameter space: the vector norm depends on the overall discrimination acuity, while its orientation is informative about the relevant features determining performance (DeWind et al., 2015). In the case of a response unbiased by temporal magnitudes, the discrimination vector is perfectly aligned with the numerosity axis. Significant non-numerical biases, instead, cause the vector to deviate from the numerosity dimension; in such a case, its angle from the numerosity axis can be used to quantify non-numerical bias irrespective of the orientation of the discrimination vector. Moreover, as extensively described by Park (2022), the linear relation (see the equations in Table 1) between the cardinal dimensions of this stimulus space and individual features allows one to derive the impact of individual features on participants’ response from the combination of parameters estimated for the cardinal axes, while avoiding multicollinearity between regressors. For example, a participant consistently selecting the sequence with a longer total duration of the events would be characterized by a positive and equal coefficient for numerosity and duration.
The proposed extension of the framework introduced by DeWind and colleagues (2015) to the temporal domain thus enables a precise quantification of the contribution of temporal features, allowing us to the estimate the overall non-numerical bias on task performance as well as the test of specific hypotheses regarding individual temporal features potentially influencing or driving task response.
Validation: Psychophysical experiments
To evaluate the proposed method and sequential stimuli, we conducted two online experiments where participants performed a numerosity comparison task in either visual or auditory modalities. The objectives of the study were twofold. On one hand, we aimed to empirically evaluate the methodology and test its versatility. At the same time, we sought to better characterize, in the healthy adult population, the effect of temporal magnitudes on numerosity judgments and identify potential differences between sensory modalities in strategy, presence of non-numerical bias, and numerical acuity.
Method and materials
Participants
One hundred and forty individuals took part in the study, with 69 participants completing the comparison task in the visual modality and 71 in the auditory modality. Due to the internet-based data collection procedure (see below), we excluded 13 participants for technical issues related to the presentation of the stimuli (e.g., incorrect screen refresh rate) and 28 participants for showing poor comprehension of task instructions (accuracy below 50% in practice trials) or low attention during the task (low accuracy in the easiest trials or a high number of outlier response times). Finally, 13 additional participants were excluded during the analyses due to a non-satisfactory fit of the statistical model (see Data Analysis section for details on all exclusion criteria). The final sample was composed of 41 participants (28 females) with a mean age of 24.12 (range: 20 – 36) for the visual modality and 45 participants (34 females) with a mean age of 22.47 (range: 20 – 31) for the auditory modality. A power analysis conducted using Monte Carlo simulations (see Supplementary Material) indicated a minimum sample size of 20 participants to detect a non-numerical bias with power above .90, a minimum sample of 20 participants per group to detect a group difference in non-numerical bias with power above .90, and a minimum sample of 40 participants per group to detect a group difference in numerical acuity of d = 1 with power above .70. Participants were volunteers recruited through social media and students from the University of Padova who received course credits for their participation. All participants gave their written informed consent. Research procedures were approved by the Psychological Science Ethics Committee of the University of Padova.
Procedure
The experiment was conducted online through Pavlovia, a hosting platform for running online PsychoPy tasks (www.pavlovia.org). Participants were instructed to perform the task in a single session, on a laptop or computer, in a quiet environment without distractions, sitting approximately one arm from the screen. In the case of the auditory comparison task, participants were free to use the speakers or headphones connected via cable to the computer; 18 participants performed the task using a headset or earphones, while the remaining ones used the computer speakers.
Stimuli
We varied independently numerosity, duration, and temporal spacing across 13 levels evenly spaced on a logarithmic scale. Numerosity varied between 7 and 28, and a similar maximum range of 1:4 was used for duration and temporal spacing. Sequences were not homogeneous, so the individual duration of an event and the single interval between one pulse and the next could vary within the same stimulus, with events lasting between 2 and 16 frames (33–270 ms at 60 Hz) and empty intervals lasting between 3 and 30 frames (50–500 ms). Such minimum durations are considered to be above the visual and auditory temporal resolution thresholds (Kanabus et al., 2002). An initial dataset of 500 pairs was created; to build each pair, we selected a ratio between 1.1:1 and 2:1 independently for numerosity, duration, and temporal spacing. Then sequences were created from the combination of appropriate pairs of numerosity, duration, and temporal spacing randomly selected from the 13 levels. For each participant, at the beginning of the experiment, a random subsample of 120 stimuli was drawn from the initial dataset to obtain an equal range for the three orthogonal dimensions and a balanced distribution of the ratios in the entire dataset and presented in a randomized order.
Stimuli were created in MATLAB R2020a (www.mathworks.com) as sequences of timestamps and instantiated as sequences of visual or auditory events directly in PsychoPy/PsychoJS (Peirce et al., 2019), which enables online experiments to be run with accurate temporal precision (Bridges et al., 2020). Visual stimuli were presented as sequences of flashes (white discs on a gray background) placed centrally on the screen. The dimension of the dot scaled with the resolution of the participants’ screen. Auditory stimuli were presented as sequences of sounds (pure tones at 400 Hz). At the beginning of the experiment, participants were allowed to adjust the volume as they preferred. Independently from modality, all durations were controlled in frames.
Task
All participants performed a computerized numerosity comparison task with either visual or auditory sequences of events. They were presented with pairs of sequences of rapid flashes or tones and were instructed to indicate which one contained more events by pressing the left or right arrow keys to select the first or the second sequence, respectively. Each trial began with a fixation cross lasting 1 s, after which the two sequences were presented, one after the other, with a fixed interval of 2 s between the first and the second one. The duration of the sequences changed depending on the stimulus, ranging from 1.70 s to 6.83 s. To prevent participants from consistently overestimating the duration of the first stimulus, during the interval between the two sequences, a green fixation cross appeared in the center of the screen to indicate the end of the first sequence. After the second sequence, a blank screen was presented until the participant response, followed by a pseudorandom blank inter-trial interval between 500 and 1500 ms. Participants did not receive any feedback based on their responses. Both tasks consisted of 120 test trials (around 40 minutes), divided into four blocks; participants could take a short break between blocks. Before the test phase, each participant completed five easy practice trials (randomly selected from a pool of 30 pairs) with a numerical ratio equal to 1:4 (i.e., 7 vs. 28) and the ratios of non-numerical dimensions varying between 1.1:1 and 2:1. In the practice phase, we selected pairs with an easy numerical ratio and variable temporal features to let participants familiarize themselves with the task, making sure they understood the instructions and to prepare them for the variability in the temporal dimensions that they would experience in the test phase. During the practice phase, the trial structure was identical to that of the test phase, with the only difference that participants received feedback indicating whether their response was correct or not.
Data analysis
Differences in screen refresh rate between participants were examined a posteriori, and all participants with a refresh rate different from 60 Hz were discarded, independently from stimulus presentation modality. Participants who were not appropriately engaged in the task were excluded from the analysis. One participant in the visual task and two participants in the auditory task were discarded due to inaccurate performance in practice trials (below 50%). Furthermore, we considered the largest numerical ratio (1:2) as catch trials, discarding participants with an accuracy below 75% in this easiest condition. This led to the exclusion of 12 participants in each sensory modality. To remove unattended trials, in both modalities we also discarded trials where a response was recorded before 200 ms (anticipations) or later than 4 s (Halberda et al., 2012), planning to exclude participants if more than 20% of their total trials were discarded. Based on this cut-off, we excluded one additional participant in the auditory task.
We then modeled individual responses (selection of the second sequence) with a generalized linear model (GLM) with binomial distribution and probit link function, using as regressors the log-ratios (second/first sequence) of numerosity, duration, and temporal spacing. We excluded from subsequent analyses eight participants for the visual task and five participants for the auditory task whose performance was not well captured by the GLM, indicated by an R2adj below 0.2 (Piazza et al., 2010). We tested the significance of coefficients at the group level with one-sample Student t-tests against zero. A positive coefficient for duration is associated with an overestimation of long events, while a negative coefficient indicates that shorter events are perceived as more numerous. Similarly, a positive coefficient for temporal spacing indicates that events separated by longer intervals are perceived as more numerous, while a negative coefficient is related to an overestimation of higher rates of events. The estimated coefficients were also used to calculate the nondirectional angle between individual discrimination vectors and the numerosity axis. To assess the proximity of the discrimination vector to the different feature dimensions, we computed the vector projections onto the dimensions of the temporal features, and we determined the closest to the discrimination vector at the group level with a series of paired t-tests. Nonparametric tests (Wilcoxon signed rank) were performed in case of violation of the normality assumption, and Bonferroni's method was used to correct multiple comparisons whenever necessary. We also assessed differences in acuity and bias between modalities with frequentist and Bayesian t-tests. In the latter case, we report the Bayes factor BF10, expressing the probability of current data under the alternative hypothesis (H1) relative to the null hypothesis (H0) (Kass & Raftery, 1995). Values larger than 1 are in favor of H1, and values smaller than 1 are in support of H0. A BF between 1 and 3 (or between 1 and 0.33) can be considered anecdotal evidence, a BF between 3 and 10 (0.33–0.10) can be considered moderate evidence, and a BF larger than 10 (< 0.03) can be considered as strong evidence (van Doorn et al., 2020). Bayesian analyses were conducted using JASP ver. 0.12.1 2020 (www.jasp-stats.org), with default priors. Analyses were otherwise performed with MATLAB.
Results
The proportion of correct responses over the total number of trials was above chance in both the visual (mean 0.80, range 0.68–0.88) and auditory modality (mean 0.81, range 0.65–0.93). Mean response times ranged between 457 and 1498 ms (mean 878 ms) in the visual modality and between 454 and 1506 ms (mean 1010 ms) in the auditory modality.
The GLM fit was significantly better than a constant model for all participants in the final sample of both the visual (mean R2adj = 0.44, all χ2 > 29.82, all ps < .001) and the auditory (mean R2adj = 0.46, all χ2 > 27.71, all ps < .001) tasks. The individual coefficient estimates are presented in Fig. 3, with additional lines representing non-orthogonal temporal features. At the group level, the model coefficients were significantly different from zero for numerosity (M = 1.89, SD = 0.53, t(40) = 22.96, p < .001, d = 3.59), duration (M = −0.16, SD = 0.32, t(40) = −3.21, p = .002, d = −0.50), and temporal spacing (M = 0.24, SD = 0.40, t(40) = 3.85, p < .001, d = 0.60) in the visual modality. For the auditory task, βNum (M = 1.95, SD = 0.77, t(44) = 16.90, p < .001, d = 2.52) and βTmSp (M = 0.31, SD = 0.51, t(44) = 4.06, p < .001, d = 0.61) were significantly different from zero at group level, while the coefficient magnitude of duration was not (M = −0.05, SD = 0.31, t(44) = −1.16, p = .25, d = −0.17) (see Fig. 4a).
For a better visualization of group results, we fit mixed-effect models separately for the two modalities, with similar fixed effects but including by-subject random intercepts and slopes of numerosity, duration, and temporal spacing. Psychometric curves obtained by the estimated fixed effect from these two models are presented in Fig. 4b. Both the individual results and the group model (further described in the Supplementary Material) indicate that our participants discriminated sequences primarily on the basis of the number of flashes and tones, but that they were also marginally affected by non-numerical temporal cues of the stimulus sequences.
The combination of coefficient weights suggest that participants’ responses were primarily based on a numerical strategy. This conclusion is further supported by the analysis of the discrimination vectors: we computed the vector projections onto the non-orthogonal dimensions at the individual level, and tested whether any other magnitude projection was higher than the numerosity coefficient. Differences between the numerosity coefficient and the magnitude projections are shown in Fig. 5. βNum was higher than all the other projections on the total event duration, mean event duration, total stimulus duration, mean event period, and coverage dimensions in both the visual modality (paired t-tests with Bonferroni correction: all ts(40) > 8.16, ps < .01, ds > 1.28) and the auditory modality (Wilcoxon signed-rank tests with Bonferroni correction: all W > 930.00, ps < .01, r > 0.79).
The magnitude of the numerosity coefficient was not significantly different between the visual and auditory tasks (t(84) = 0.38, p = .70, d = 0.08, BF10 = 0.24 (0.03), moderate evidence), suggesting a similar numerical acuity across sensory modalities. As for non-numerical biases, we estimated the angle between the numerosity dimension and individual discrimination vectors as a nondirectional measure of non-numerical bias (see Fig. 6). No difference emerged in the vector line angle estimated in visual (M = 15.15°, SD = 6.59) and auditory (M = 16.50°, SD = 9.40) modality (U = 941, p = .88, r = 0.02, BF10 = 0.29 (0.03), moderate evidence). We directly compared the coefficients of duration between tasks, and we did not find a significant difference between the weights of duration in the two modalities (t(84) = 1.58, p = .12, d = 0.34, BF10 = 0.67), despite the significant result from the one-sample t-test in the visual modality.
Discussion
The present work aimed to extend the methodology proposed by DeWind and colleagues (2015) to enable the assessment of temporal biases in sequential numerosity perception. Our main goal was to create a common framework for the study of both visual cues and temporal biases, facilitating investigations across multiple sensory modalities and modes of presentation, at the same time improving the comparability of measures used in the investigation of non-symbolic number processing. Besides describing the theoretical bases for the creation of the proposed stimulus space, we also provide computer code to generate sequences of timestamps where numerosity, duration and temporal spacing are orthogonally varied. We validated our method in an empirical study, demonstrating that it can be effectively used to quantify how much participants rely on numerical and temporal information during sequential numerosity discrimination, and to identify relevant cues impacting numerosity judgments in different sensory modalities.
Our findings show that adult individuals discriminated sequences primarily on the basis of the number of flashes and tones and were only marginally affected by non-numerical temporal cues embedded in the stimulus sequences. The prominent numerical strategy adopted by our participants adds to the growing body of evidence suggesting that performance in numerosity estimation tasks cannot be explained by the unique processing of non-numerical magnitudes (Abalo-Rodríguez et al., 2022; Cicchini et al., 2019; DeWind et al., 2015). However, although our analyses rule out the possibility that participants relied consistently on any single time cue considered, our behavioral investigation cannot exclude a more complex and dynamic integration of non-numerical magnitudes (Gebuis & Reynvoet, 2012). To address this issue, the stimulus space introduced in this work could be exploited in electrophysiological and neuroimaging studies, as done with the original space from which we took inspiration (DeWind et al., 2018; Fornaciai et al., 2017; but see Park, 2018). Furthermore, it would be interesting to adopt our sequential stimulus space to investigate whether numerosity information would still play a significant role even when the task explicitly requires comparing non-numerical magnitudes (e.g., duration or frequency), as the original space has proven useful to identify numerosity biases on area judgements in 5-year-old children (Tomlinson et al., 2020). Indeed, a significant influence of numerosity has been reported not only on spatial judgments of density and area involving parallel presentation of visual stimuli (Cicchini et al., 2016), but also on judgments on temporal features such as the duration of sequences of flashing dots (Dormal et al., 2006).
Our results also revealed that numerosity comparison was affected, to some extent, by task-irrelevant characteristics of the stimuli, mostly related to the temporal spread of the events. These results are in line with previous reports of temporal biases in numerosity discrimination, although the direction of the influence is inconsistent across studies (Lambrechts et al., 2013; Martin et al., 2017; Philippi et al., 2008; Tokita & Ishiguchi, 2011). Some reported an overestimation of the numerosity of dynamically appearing arrays presented in shorter time intervals, suggesting an effect of the rate of evidence accumulation on the noisiness of magnitude representations (Lambrechts et al., 2013; Martin et al., 2017). In contrast to this interpretation, we found a tendency to overestimate the number of events in longer sequences with longer blank intervals, in both visual and auditory modalities. At least one study reports an underestimation of sequences when events were separated by shorter intervals in visual, auditory, and tactile modality, with a larger underestimation of visual events, compared to other modalities, for shorter intervals (Philippi et al., 2008). However, the authors suggest that this interaction could have emerged from a flicker-fusion illusion for visual sequences, due to the high presentation rates of their stimuli (up to roughly 33 Hz) (Levinson, 1968). To avoid a similar effect, in our stimuli we kept the minimum interval between pulses above 50 ms and we used irregular durations of events and intervals. Moreover, although we cannot completely exclude that underestimation of faster sequences could have emerged from a perceptual fusion of extremely close pulses in the visual modality, the parallel result in the auditory modality, where the temporal resolution is largely below the average frequency of presentation, is a strong indicator that our results cannot be explained by a possible fusion of close events. It should also be noted that our stimulus space was designed in terms of physical, not perceptual, units. Although some studies highlighted similarities in the discrimination precision of numerical and temporal magnitudes (Feigenson, 2007; Droit-Volet et al., 2008), some studies report the opposite (Odic, 2018). Therefore, future studies could better investigate whether the intensity range in each dimension allows for a similar discriminability across all axes of the stimulus space and additionally calibrate the ranges in terms of “just noticeable difference” units.
The current design does not allow us to determine whether the non-numerical bias originated from a shared neural representation of magnitude (Bueti & Walsh, 2009) or a non-perceptual competition at the response selection level. To address this question, a similar manipulation of features could also be used to assess the spontaneous saliency of different numerical and temporal magnitudes during development (Roitman et al., 2007) or test the existence of cross-dimensional adaptation or serial dependency effects on temporal numerosity (Togoli, Fedele, et al., 2021a; Tsouli et al., 2019). Furthermore, a promising avenue for future research would be to use this type of stimuli to probe computational models of numerosity perception, which often exhibit the same type of non-numerical biases observed in human participants (Testolin et al., 2020a; Zorzi & Testolin, 2018).
The discrimination accuracy and the overall contribution of temporal magnitudes were similar in the visual and auditory sensory modalities. Although the moderate statistical evidence does not allow us to draw strong conclusions, this finding is in line with previous reports of correlations in the estimation precision of sequences of flashes and sounds, and similar numerical acuity across auditory or tactile modalities (Anobile et al., 2016; Tokita & Ishiguchi, 2016). Collectively, these results suggest that basic numerosity processing in different senses could rely on a common mechanism. It could be noted that the duration coefficient was significantly different from zero in the visual but not in the auditory modality. However, the direct contrast between the two duration coefficients was not significant. Nevertheless, we believe that a within-subject design would be more appropriate for a fine-grained comparison of coefficient values across sensory modalities.
Overall, these results illustrate how the proposed approach can be used to study numerosity perception and temporal biases in different modalities such as visual, auditory, or tactile. Crucially, this modality-nonspecific manipulation paves the way to the investigation of the impact of temporal cues in cross-modal or multimodal experimental designs as well. For this reason, we believe that it can represent a useful method to further clarify the interplay between number and time and lead to a more comprehensive understanding of human quantitative reasoning.
Data availability
The datasets analyzed in the current study and the code to perform the analysis are available in the GitHub repository: https://github.com/CCNL-UniPD/temporal-bias-numseq
Code availability
The code to generate stimuli with the procedure illustrated in this study is available in the GitHub repository: https://github.com/CCNL-UniPD/temporal-bias-numseq
References
Abalo-Rodríguez, I., De Marco, D., & Cutini, S. (2022). An undeniable interplay: Both numerosity and visual features affect estimation of non-symbolic stimuli. Cognition, 222, 104944. https://doi.org/10.1016/j.cognition.2021.104944
Agrillo, C., Dadda, M., Serena, G., & Bisazza, A. (2008). Do fish count? Spontaneous discrimination of quantity in female mosquitofish. Animal Cognition, 11(3), 495–503. https://doi.org/10.1007/s10071-008-0140-9
Agrillo, C., Ranpura, A., & Butterworth, B. (2010). Time and numerosity estimation are independent: Behavioral evidence for two different systems using a conflict paradigm. Cognitive Neuroscience, 1(2), 96–101. https://doi.org/10.1080/17588921003632537
Anobile, G., Arrighi, R., Castaldi, E., Grassi, E., Pedonese, L., Moscoso, P. A. M., & Burr, D. C. (2018). Spatial but not temporal numerosity thresholds correlate with formal math skills in children. Developmental Psychology, 54(3), 458–473. https://doi.org/10.1037/dev0000448
Anobile, G., Cicchini, G. M., & Burr, D. C. (2016). Number As a Primary Perceptual Attribute: A Review. Perception (Vol. 45, Issues 1–2). https://doi.org/10.1177/0301006615602599
Arrighi, R., Togoli, I., & Burr, D. C. (2014). A generalized sense of number. Proceedings of the Royal Society B: Biological Sciences, 281(1797), 1–7. https://doi.org/10.1098/rspb.2014.1791
Barth, H., Kanwisher, N., & Spelke, E. (2003). The construction of large number representations in adults. Cognition, 86(3), 201–221. https://doi.org/10.1016/S0010-0277(02)00178-6
Bortot, M., Agrillo, C., Avarguès-Weber, A., Bisazza, A., Petrazzini, M. E. M., & Giurfa, M. (2019). Honeybees use absolute rather than relative numerosity in number discrimination. Biology Letters, 15(6), 5–9. https://doi.org/10.1098/rsbl.2019.0138
Bridges, D., Pitiot, A., MacAskill, M. R., & Peirce, J. W. (2020). The timing mega-study: comparing a range of experiment generators, both lab-based and online. PeerJ, 8, e9414. https://doi.org/10.7717/peerj.9414
Bueti, D., & Walsh, V. (2009). The parietal cortex and the representation of time, space, number and other magnitudes. Philosophical Transactions of the Royal Society B: Biological Sciences, 364(1525), 1831–1840. https://doi.org/10.1098/rstb.2009.0028
Cantlon, J. F., & Brannon, E. M. (2007). Basic math in monkeys and college students. PLoS Biology, 5(12), 2912–2919. https://doi.org/10.1371/journal.pbio.0050328
Cavdaroglu, S., & Knops, A. (2019). Evidence for a Posterior Parietal Cortex Contribution to Spatial but not Temporal Numerosity Perception. Cerebral Cortex, 29(7), 2965–2977. https://doi.org/10.1093/cercor/bhy163
Cicchini, G. M., Anobile, G., & Burr, D. C. (2016). Spontaneous perception of numerosity in humans. Nature Communications, 7(1), 12536.
Cicchini, G. M., Anobile, G., & Burr, D. C. (2019). Spontaneous representation of numerosity in typical and dyscalculic development. Cortex, 114, 151–163. https://doi.org/10.1016/j.cortex.2018.11.019
Clayton, S., Gilmore, C., & Inglis, M. (2015). Dot comparison stimuli are not all alike: The effect of different visual controls on ANS measurement. Acta Psychologica, 161, 177–184. https://doi.org/10.1016/j.actpsy.2015.09.007
Dakin, S. C., Tibber, M. S., Greenwood, J. A., Kingdom, F. A. A., & Morgan, M. J. (2011). A common visual metric for approximate number and density. Proceedings of the National Academy of Sciences of the United States of America, 108(49), 19552–19557. https://doi.org/10.1073/pnas.1113195108
Dehaene, S. (2003). The neural basis of the Weber-Fechner law: A logarithmic mental number line. Trends in Cognitive Sciences, 7(4), 145–147. https://doi.org/10.1016/S1364-6613(03)00055-X
De Marco, D., & Cutini, S. (2020). Introducing CUSTOM: A customized, ultraprecise, standardization-oriented, multipurpose algorithm for generating nonsymbolic number stimuli. Behavior Research Methods, 52, 1528–1537.
DeWind, N. K., Adams, G. K., Platt, M. L., & Brannon, E. M. (2015). Modeling the approximate number system to quantify the contribution of visual stimulus features. Cognition, 142, 247–265. https://doi.org/10.1016/j.cognition.2015.05.016
DeWind, N. K., Park, J., Woldorff, M. G., & Brannon, E. M. (2018). Numerical encoding in early visual cortex. Cortex, 1–14. https://doi.org/10.1016/j.cortex.2018.03.027
Dormal, V., & Pesenti, M. (2013). Processing numerosity, length and duration in a three-dimensional Stroop-like task: Towards a gradient of processing automaticity? Psychological Research, 77(2), 116–127. https://doi.org/10.1007/s00426-012-0414-3
Dormal, V., Seron, X., & Pesenti, M. (2006). Numerosity-duration interference: A Stroop experiment. Acta Psychologica, 121(2), 109–124. https://doi.org/10.1016/j.actpsy.2005.06.003
Dramkin, D., Bonn, C. D., Baer, C., & Odic, D. (2022). The malleable impact of non-numeric features in visual number perception. Acta Psychologica, 230, 103737. https://doi.org/10.1016/j.actpsy.2022.103737
Droit-Volet, S., Clement, A., & Fayol, M. (2008). Time, number and length: Similarities and differences in discrimination in adults and children. Quarterly Journal of Experimental Psychology, 61(12), 1827–1846. https://doi.org/10.1080/17470210701743643
Droit-Volet, S., Clément, A., & Fayol, M. (2003). Time and number discrimination in a bisection task with a sequence of stimuli: A developmental approach. Journal of Experimental Child Psychology, 84(1), 63–76. https://doi.org/10.1016/S0022-0965(02)00180-7
Eger, E., Sterzer, P., Russ, M. O., Giraud, A. L., & Kleinschmidt, A. (2003). A supramodal number representation in human intraparietal cortex. Neuron, 37(4), 719–726. https://doi.org/10.1016/S0896-6273(03)00036-9
Feigenson, L., Dehaene, S., & Spelke, E. (2004). Core systems of number. Trends in Cognitive Sciences, 8(7), 307–314. https://doi.org/10.1016/j.tics.2004.05.002
Feigenson, L. (2007). The equality of quantity. Trends in Cognitive Sciences, 11(5), 185–187. https://doi.org/10.1016/j.tics.2007.01.006
Fornaciai, M., Brannon, E. M., Woldorff, M. G., & Park, J. (2017). Numerosity processing in early visual cortex. NeuroImage, 157, 429–438. https://doi.org/10.1016/j.neuroimage.2017.05.069
Gebuis, T., Cohen Kadosh, R., & Gevers, W. (2016). Sensory-integration system rather than approximate number system underlies numerosity processing: A critical review. Acta Psychologica, 171, 17–35. https://doi.org/10.1016/j.actpsy.2016.09.003
Gebuis, T., & Reynvoet, B. (2012). The interplay between nonsymbolic number and its continuous visual properties. Journal of Experimental Psychology: General, 141(4), 642–648. https://doi.org/10.1037/a0026218
Gevers, W., Cohen Kadosh, R., & Gebuis, T. (2016). Sensory Integration Theory: An Alternative to the Approximate Number System. Continuous Issues in Numerical Cognition: How Many or How Much, 405–418. https://doi.org/10.1016/B978-0-12-801637-4.00018-4
Gilmore, C., Attridge, N., Clayton, S., Cragg, L., Johnson, S., Simms, V., & Inglis, M. (2013). Individual Differences in Inhibitory Control, Not Non-Verbal Number Acuity. Correlate with Mathematics Achievement. PLoS ONE, 8(6), e67374. https://doi.org/10.1371/journal.pone.0067374
Halberda, J., Ly, R., Wilmer, J. B., Naiman, D. Q., & Germine, L. (2012). Number sense across the lifespan as revealed by a massive Internet-based sample. Proceedings of the National Academy of Sciences of the United States of America, 109(28), 11116–11120. https://doi.org/10.1073/pnas.1200196109
Inglis, M., & Gilmore, C. (2013). Sampling from the mental number line: How are approximate number system representations formed? Cognition, 129(1), 63–69. https://doi.org/10.1016/j.cognition.2013.06.003
Izard, V., Sann, C., Spelke, E. S., & Streri, A. (2009). Newborn infants perceive abstract numbers. Proceedings of the National Academy of Sciences, 106(25), 10382–10385. https://doi.org/10.1073/pnas.0812142106
Javadi, A. H., & Aichelburg, C. (2012). When time and numerosity interfere: The longer the more, and the more the longer. PLoS ONE, 7(7), 1–9. https://doi.org/10.1371/journal.pone.0041496
Jordan, K. E., & Brannon, E. M. (2006). The multisensory representation of number in infancy. Proceedings of the National Academy of Sciences, 103(9), 3486–3489. https://doi.org/10.1073/pnas.0508107103
Kanabus, M., & Szela̧g, E., Rojek, E., & Pöppel, E. (2002). Temporal order judgement for auditory and visual stimuli. Acta Neurobiologiae Experimentalis, 62(4), 263–270.
Kass, R. E., & Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90(430), 773–795.
Lambrechts, A., Walsh, V., & Van Wassenhove, V. (2013). Evidence accumulation in the magnitude system. PLoS ONE, 8(12). https://doi.org/10.1371/journal.pone.0082122
Levinson, J. Z. (1968). Flicker Fusion Phenomena: Rapid flicker is attenuated many times over by repeated temporal summation before it is “seen.” Science, 160(3823), 21–28.
Martin, B., Wiener, M., & Van Wassenhove, V. (2017). A Bayesian Perspective on Accumulation in the Magnitude System. Scientific Reports, 7(1), 1–14. https://doi.org/10.1038/s41598-017-00680-0
Nieder, A. (2012). Supramodal numerosity selectivity of neurons in primate prefrontal and posterior parietal cortices. Proceedings of the National Academy of Sciences of the United States of America, 109(29), 11860–11865. https://doi.org/10.1073/pnas.1204580109
Nys, J., & Content, A. (2012). Judgement of discrete and continuous quantity in adults: Number counts! Quarterly Journal of Experimental Psychology, 65(4), 675–690. https://doi.org/10.1080/17470218.2011.619661
Odic, D. (2018). Children’s intuitive sense of number develops independently of their perception of area, density, length, and time. Developmental Science, 21(2), 1–15. https://doi.org/10.1111/desc.12533
Park, J. (2018). A neural basis for the visual sense of number and its development: A steady-state visual evoked potential study in children and adults. Developmental Cognitive Neuroscience, 30, 333–343. https://doi.org/10.1016/j.dcn.2017.02.011
Park, J. (2022). Flawed stimulus design in additive-area heuristic studies. Cognition, 229, 104919. https://doi.org/10.1016/j.cognition.2021.104919
Peirce, J., Gray, J. R., Simpson, S., MacAskill, M., Höchenberger, R., Sogo, H., Kastman, E., & Lindeløv, J. K. (2019). PsychoPy2: Experiments in behavior made easy. Behavior Research Methods, 51(1), 195–203. https://doi.org/10.3758/s13428-018-01193-y
Philippi, T. G., van Erp, J. B. F., & Werkhoven, P. J. (2008). Multisensory temporal numerosity judgment. Brain Research, 1242, 116–125. https://doi.org/10.1016/j.brainres.2008.05.056
Piazza, M., Izard, V., Pinel, P., Le Bihan, D., & Dehaene, S. (2004). Tuning curves for approximate numerosity in the human intraparietal sulcus. Neuron, 44(3), 547–555. https://doi.org/10.1016/j.neuron.2004.10.014
Piazza, M., Facoetti, A., Trussardi, A. N., Berteletti, I., Conte, S., Lucangeli, D., Dehaene, S., & Zorzi, M. (2010). Developmental trajectory of number acuity reveals a severe impairment in developmental dyscalculia. Cognition, 116(1), 33–41. https://doi.org/10.1016/j.cognition.2010.03.012
Roitman, J. D., Brannon, E. M., Andrews, J. R., & Platt, M. L. (2007). Nonverbal representation of time and number in adults. Acta Psychologica, 124(3), 296–318. https://doi.org/10.1016/j.actpsy.2006.03.008
Salti, M., Katzin, N., Katzin, D., Leibovich, T., & Henik, A. (2017). One tamed at a time: A new approach for controlling continuous magnitudes in numerical comparison tasks. Behavior Research Methods, 49(3), 1120–1127. https://doi.org/10.3758/s13428-016-0772-7
Sokolowski, H. M., Fias, W., BosahOnonye, C., & Ansari, D. (2017). Are numbers grounded in a general magnitude processing system? A functional neuroimaging meta-analysis. Neuropsychologia, 105, 50–69. https://doi.org/10.1016/j.neuropsychologia.2017.01.019
Stoianov, I., & Zorzi, M. (2012). Emergence of a “visual number sense” in hierarchical generative models. Nature Neuroscience, 15(2), 194–196. https://doi.org/10.1038/nn.2996
Testolin, A., Dolfi, S., Rochus, M., & Zorzi, M. (2020a). Visual sense of number vs. sense of magnitude in humans and machines. Scientific Reports, 10(1), 1–13. https://doi.org/10.1038/s41598-020-66838-5
Testolin, A., Zou, W. Y., & McClelland, J. L. (2020b). Numerosity discrimination in deep neural networks: Initial competence, developmental refinement and experience statistics. Developmental Science, 23(5), e12940.
Togoli, I., Fedele, M., Fornaciai, M., & Bueti, D. (2021a). Serial dependence in time and numerosity perception is dimension-specific. Journal of Vision, 21(5), 1–15. https://doi.org/10.1167/jov.21.5.6
Togoli, I., Fornaciai, M., & Bueti, D. (2021b). The specious interaction of time and numerosity perception. Proceedings of the Royal Society B: Biological Sciences, 288(1959). https://doi.org/10.1098/rspb.2021.1577
Tokita, M., Ashitani, Y., & Ishiguchi, A. (2013). Is approximate numerical judgment truly modality-independent? Visual, auditory, and cross-modal comparisons. Attention, Perception, and Psychophysics, 75(8), 1852–1861. https://doi.org/10.3758/s13414-013-0526-x
Tokita, M., & Ishiguchi, A. (2011). Temporal information affects the performance of numerosity discrimination: Behavioral evidence for a shared system for numerosity and temporal processing. Psychonomic Bulletin and Review, 18(3), 550–556. https://doi.org/10.3758/s13423-011-0072-2
Tokita, M., & Ishiguchi, A. (2016). Precision and Bias in Approximate Numerical Judgment in Auditory, Tactile, and Cross-modal Presentation. Perception, 45(1–2), 56–70. https://doi.org/10.1177/0301006615596888
Tomlinson, R. C., DeWind, N. K., & Brannon, E. M. (2020). Number sense biases children’s area judgments. Cognition, 204, 104352. https://doi.org/10.1016/j.cognition.2020.104352
Tsouli, A., Dumoulin, S. O., te Pas, S. F., & van der Smagt, M. J. (2019). Adaptation reveals unbalanced interaction between numerosity and time. Cortex, 114, 5–16. https://doi.org/10.1016/j.cortex.2018.02.013
van Doorn, J., van den Bergh, D., Böhm, U., Dablander, F., Derks, K., Draws, T., Etz, A., Evans, N. J., Gronau, Q. F., Haaf, J. M., Hinne, M., Kucharský, Š., Ly, A., Marsman, M., Matzke, D., Gupta, A. R. K. N., Sarafoglou, A., Stefan, A., Voelkel, J. G., & Wagenmakers, E. J. (2020). The JASP guidelines for conducting and reporting a Bayesian analysis. Psychonomic Bulletin and Review, 813–826. https://doi.org/10.3758/s13423-020-01798-5
Walsh, V. (2003). A theory of magnitude: Common cortical metrics of time, space and quantity. Trends in Cognitive Sciences, 7(11), 483–488. https://doi.org/10.1016/j.tics.2003.09.002
Xu, F., & Spelke, E. (2000). Large number discrimination in 6-month-old infants: Evidence for the accumulator model. Cognition, 74, B1–B11. https://doi.org/10.1016/s0163-6383(98)91983-x
Zorzi, M., & Testolin, A. (2018). An emergentist perspective on the origin of number sense. Philosophical Transactions of the Royal Society B: Biological Sciences, 373(1740). https://doi.org/10.1098/rstb.2017.0043
Funding
Open access funding provided by Università degli Studi di Padova within the CRUI-CARE Agreement. This work was supported by the Cariparo Foundation (Italy) [Excellence Grant “NUMSENSE” to M.Z.].
Author information
Authors and Affiliations
Contributions
All authors conceptualized the methodology and the study. S.D. implemented the methodology and performed the investigation. S.D. and A.T. wrote the manuscript. S.C. and M.Z. supervised the project. All authors revised and approved the final version of the manuscript.
Corresponding author
Ethics declarations
Ethics approval
The research procedure and methodology for this study was approved by the Psychological Science Ethics Committee of the University of Padova.
Consent to participate
Informed consent was obtained from all individual participants included in the study.
Competing interest
The authors have no competing interests to declare that are relevant to the content of this article.
Open practices statement
The data and materials for all experiments are available at https://github.com/CCNL-UniPD/temporal-bias-numseq and none of the experiments was preregistered.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Dolfi, S., Testolin, A., Cutini, S. et al. Measuring temporal bias in sequential numerosity comparison. Behav Res 56, 7561–7573 (2024). https://doi.org/10.3758/s13428-024-02436-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3758/s13428-024-02436-x