Europe PMC
Nothing Special   »   [go: up one dir, main page]

Europe PMC requires Javascript to function effectively.

Either your web browser doesn't support Javascript or it is currently turned off. In the latter case, please turn on Javascript support in your web browser and reload this page.

This website requires cookies, and the limited processing of your personal data in order to function. By using the site you are agreeing to this as outlined in our privacy notice and cookie policy.

Abstract 


A growing number of studies have shown a connection between rhythmic processing and language skill. It has been proposed that domain-general rhythm abilities might help children to tap into the rhythm of speech (prosody), cueing them to prosodic markers of grammatical (syntactic) information during language acquisition, thus underlying the observed correlations between rhythm and language. Working memory processes common to task demands for musical rhythm discrimination and spoken language paradigms are another possible source of individual variance observed in musical rhythm and language abilities. To investigate the nature of the relationship between musical rhythm and expressive grammar skills, we adopted an individual differences approach in N = 132 elementary school-aged children ages 5-7, with typical language development, and investigated prosodic perception and working memory skills as possible mediators. Aligning with the literature, musical rhythm was correlated with expressive grammar performance (r = 0.41, p < 0.001). Moreover, musical rhythm predicted mastery of complex syntax items (r = 0.26, p = 0.003), suggesting a privileged role of hierarchical processing shared between musical rhythm processing and children's acquisition of complex syntactic structures. These relationships between rhythm and grammatical skills were not mediated by prosodic perception, working memory, or non-verbal IQ; instead, we uncovered a robust direct effect of musical rhythm perception on grammatical task performance. Future work should focus on possible biological endophenotypes and genetic influences underlying this relationship.

Free full text 


Logo of scirepAboutEditorial BoardFor AuthorsScientific Reports
Sci Rep. 2023; 13: 2201.
Published online 2023 Feb 7. https://doi.org/10.1038/s41598-022-21902-0
PMCID: PMC9905575
PMID: 36750727

Exploring individual differences in musical rhythm and grammar skills in school-aged children with typically developing language

Associated Data

Supplementary Materials
Data Availability Statement

Abstract

A growing number of studies have shown a connection between rhythmic processing and language skill. It has been proposed that domain-general rhythm abilities might help children to tap into the rhythm of speech (prosody), cueing them to prosodic markers of grammatical (syntactic) information during language acquisition, thus underlying the observed correlations between rhythm and language. Working memory processes common to task demands for musical rhythm discrimination and spoken language paradigms are another possible source of individual variance observed in musical rhythm and language abilities. To investigate the nature of the relationship between musical rhythm and expressive grammar skills, we adopted an individual differences approach in N = 132 elementary school-aged children ages 5–7, with typical language development, and investigated prosodic perception and working memory skills as possible mediators. Aligning with the literature, musical rhythm was correlated with expressive grammar performance (r = 0.41, p < 0.001). Moreover, musical rhythm predicted mastery of complex syntax items (r = 0.26, p = 0.003), suggesting a privileged role of hierarchical processing shared between musical rhythm processing and children’s acquisition of complex syntactic structures. These relationships between rhythm and grammatical skills were not mediated by prosodic perception, working memory, or non-verbal IQ; instead, we uncovered a robust direct effect of musical rhythm perception on grammatical task performance. Future work should focus on possible biological endophenotypes and genetic influences underlying this relationship.

Subject terms: Human behaviour, Language, Learning and memory

Introduction

There are a growing number of studies showing correlations between musical rhythm perception and speech/language skills, including grammar14. Gordon et al.3 demonstrated a positive association in children (age 6), such that those who were more accurate at distinguishing between musical rhythm sequences, also tended to have stronger performance on a morpho-syntactic language production task, despite obvious differences in task demands and musical and language stimuli. This type of finding was extended to other grammatical tasks by several groups, including Swaminathan and Schellenberg4 who showed a strong association between musical rhythm perception and receptive grammar (with a larger sample size of ~ 100 school-aged children) and Politimou et al.1, who showed that rhythm perception was associated with performance on a sentence imitation task (which encompasses both receptive and expressive dimensions of grammatical skill) in pre-schoolers. Moreover, Lee et al.2 also found a correlation between rhythm discrimination and receptive grammar in a wider age range of participants (7–17-year-olds), while controlling for auditory working memory and musical training. The goal of this paper was to test the robustness of the relationship between musical rhythm perception and grammatical performance, and then to look at the processes that might mediate this interaction.

Further behavioural investigations into the relationship between rhythm and grammatical skills are important because such studies could provide insight into mediating mechanisms, which in turn can inform us about overlap between how music and language are processed and decoded in the developing brain. An understanding of this relationship is also important in the context of atypical language development. Disorders that affect the use of language and hamper communication have long-term negative impacts on quality of life, economic outcomes, and mental health5,6. Children with developmental disorders of reading and spoken language (e.g., developmental dyslexia, developmental language disorder, stuttering) are known to struggle with rhythmic tasks (like rhythm production, discrimination, and synchronisation)79. A recent synthesis of over 300 studies suggested that atypical (impaired) rhythm is a risk factor for atypical language development10; moreover, there is still much to learn about how developmental cascades affected by rhythm processing unfold during development (see also Lense et al.11). Therefore, advancing the understanding of the role of rhythm in acquisition of grammatical skills might be instrumental in providing new perspectives into studying language disorders. It is interesting to note that rhythmic priming experiments, in which participants actively listen to musical sequences with a strong and predictable rhythm (prime) prior to language trials (target), have shown that listening to regular rhythms enhances the performance of the listener in a sentence imitation task that measures phonological accuracy12, and in grammaticality judgement tasks1315. These effects have been shown in both typically developing participants and in those with reading, hearing, and language disorders12,14,16. Taken together, the convergence of evidence of associations between rhythm and grammatical processing at both the trait (stable correlations between over time) and state (transient improvement in task performance) levels provides further corroboration for overlapping processing of musical rhythm and linguistic grammar.

Such groundwork demonstrating an association between rhythm and grammar task performance, in typical and atypical populations, has led to the question of what underlying cognitive and biological mechanisms might account for such a relationship. In order to dissect the relationship between rhythm and grammar, the current study had two goals: first, we evaluated the associations between music rhythm perception and expressive grammar skills; and second, we examined the role of working memory and prosodic perception as potential mediators of the association between rhythm and grammar. We made use of structural equation modelling so that we could study the direct effect of musical rhythm perception on expressive grammar, the indirect mediation either through prosodic perception or working memory, and because this approach would also help us identify which predictors would have the strongest effect on grammatical ability. We also considered the role of IQ in the structural equation model along with our mediators. In addition to an overall measure of expressive language, we used a complex syntax sub-score from the grammar assessment, given prior associations between rhythm and this subscore17. Complex syntax refers to sentences constructed using multiple clauses, that can either be linked with connectors or be embedded18. When parsing such sentences, children appear to make use of prosodic cues to facilitate processing of multiple clauses1921. Since prosodic perception is measured in our study, using complex syntax as an additional outcome would be another way to understand its specific role in mediating the rhythm-grammar link.

The first mediator we considered is prosodic perception. In music, rhythm is represented by temporal patterns of sounds organised around the “beat” or pulse of a piece of music, while in language, slow temporal cues are captured by speech prosody, which encompasses intonational and stress patterns of syllables, the lengthening and pausing during and of phrases, and the loudness and pitch of speech2224. These patterns are perceived as the rhythmic components of speech, though prosody may not necessarily adhere to the isochronous meter we tend to experience in musical rhythm25,26. Prosody plays a crucial role in early language acquisition and lexical development: for example, prosodic boundaries, which in sentences generally reflect the sentences’ syntactic structure, were shown to be leveraged by toddlers to learn the syntactic function of novel words27,28. Five-month-old children can distinguish between their native language and a language from a different rhythmic class, but not between two languages that follow similar rhythmic classes29. Studies have also shown that the prosody of sentences affects grammatical processing30 and facilitates word learning in toddlers28. Sensitivity to prosody is thus important for syntactic parsing and lexical access, in addition to its well-documented roles in decoding nuanced emotions, tone, context, emphasis and semantics23,27,31,32. Individual differences in prosodic sensitivity have also been shown to be related to musical rhythm perception as well as with musical training4,3335. Torppa et al.36 found that children who are musically engaged have improved performance on tasks that measure stress pattern awareness in speech. The sensitivity to prosodic cues of grammatical structures could be affected by a domain-general rhythm perception ability, since speech and music both require the listener to process hierarchically organised temporal sequences37. Such an attunement to rhythm might give some children an advantage in accessing the syntactical nature of grammar, thus allowing prosody to play a mediatory role in how rhythm and grammar are processed and acquired.

The other mediator that we explored is working memory. Working memory is a multi-component system that is primarily employed during task/goal-oriented activities and involves storage of relevant information (verbal or non-verbal), and cognitive manipulation and processing of the stored information to complete said task38. When recruited, working memory can also transfer information received during the task to long-term storage, and thus plays an important role in learning39. Working memory ability is correlated with performance on rhythmic synchronisation tasks40, prosodic perception41, ability to distinguish between prosodically deviant sentences42, and also with sensitivity to morphosyntactic and grammatical violations43,44. Further, just as for prosody, musical ability and training have been shown to correlate with higher working memory spans45,46. Another process that is closely related to working memory is cognitive flexibility, which is the ability to effectively switch mental representation or task sets as new information becomes available. Cognitive flexibility might play a role in parsing multi-clause sentences, and has been shown to be positively correlated with beat perception47 and as being predictive of developmental dyslexia48. In summary, given that working memory (and related processes) are associated with rhythm, grammar, and prosodic abilities, we hypothesised that it could play a mediatory role between rhythm and grammar processing and is thus included in our analysis, implemented with structural equation modelling (path model) in a relatively sizable developmental sample (N = 132 children with typical language development, ages 5–7 years old).

Results

Sample characteristics

We screened 150 participants for this study, from which N = 132 (aged 5.1–8.2, mean = 6.5 years, SD = 10 months, 76 females) met the complete eligibility criteria (based on their assessment scores and holistic review of screening measures by Speech-Language Pathologists [SLPs]) and constitute the final dataset (see Methods and Supplementary Information for details). Eligibility for the current study was dependent on participants having normal hearing, typical non-verbal cognitive ability, typical language development, an absence of major neurodevelopmental disorders or conditions (e.g., autism spectrum disorder, brain injury, cerebral palsy), and English being the primary language spoken at home. During the screening visit, parents were also asked to fill in a demographic questionnaire, from which we used mother’s highest level of education as a proxy for socioeconomic status (SES). We assessed children’s musical experience and calculated a musical experience score (MES) using the same parent questionnaire as Gordon et al.3 The descriptive statistics of all the eligible participants (N = 132) including age, SES, and MES is tabulated in Table Table1.1. For details of the screening and eligibility determination please see Supplementary Information.

Table 1

Demographics and Assessment Scores of the Study Cohort.

Sample characteristicsNMeanSDSERangeSkewKurtosis
Age (years)1326.530.840.075.07–8.170.21 1.20
PTONI standard score124120.5519.361.7480–150 0.36 1.02
Picture vocabulary (scaled score)13212.892.180.195–17 0.941.25
Sentence Imitation (scaled score)13112.652.120.198–17 0.03 0.74
Morphological completion (scaled score)13212.861.840.166–17 0.630.57
Socioeconomic status (SES)1327.501.000.095–9 0.02 0.72
Musical experience (MES)1311.091.010.090–40.790.10

This table summarises the demographic characteristics and scores of the screening measures employed in the study. The picture vocabulary, sentence imitation and morphological completion scores provide age-normed scaled scores for the cohort. A questionnaire was used to assess the musical engagement and calculate a musical experience score, and the highest level of maternal education was used as a proxy for socioeconomic status. The age-normed standard scores of the PTONI are reported for the non-verbal IQ measure. PTONI = Primary Test of Non-Verbal IQ.

Individual differences analysis

We used assessments of expressive grammar, musical rhythm perception, prosodic perception, and verbal working memory as the behavioural variables for the individual differences analyses. Expressive grammar was measured with the Structured Photographic Expressive Language Test-3 (SPELT-3)49 and musical rhythm perception was measured using the children’s version of the BBA (Beat-based Advantage) test3,50. The BBA is a computer-based task, in which participants are asked to identify which rhythms are same and which are different from a pair of rhythms presented over speakers. The computer-based Prosody-Matching task, which is a forced choice ABX sentence prosody matching test, was used to probe prosodic perception. In this task participants had to match a low-pass filtered speech sentence to one of two non-filtered sentences (see Methods and Supplementary Information for details). Finally, verbal working memory was assessed with the forward digit span subtest of the Kaufman Assessment Battery for Children second edition (KABC-II51).

The descriptive statistics of the outcome measures are summarised in Table Table2.2. From the 132 participants that completed all study visits and assessments, n = 103 participants had valid and complete data for all the tasks and assessments. For the computer-based assessments (BBA and Prosody Matching), n = 23 participants (individual test missingness is: n = 11 for BBA, n = 14 for Prosody Matching) did not have useable data for one or more of the tests either due to technical errors, program malfunctions, or participant’s inability to understand or attend to the behavioural task. The path models exclude missing data in a list-wise manner and use only those observations with complete data; the n for path models without controlling for IQ is 109, and is 103, when controlling for IQ. Correlation analyses used all available data and excluded missing data in a pair-wise fashion; thus, n differs for each pair of variables.

Table 2

Descriptive statistics of behavioural outcome measures.

Variable nameNMeanSDSERangeSkewKurtosis
SPELT-3 standard score132113.136.660.5890–126 0.781.06
SPELT-3 raw score13246.063.800.3335–53 0.770.22
SPELT-3 complex syntax sub-score1320.790.130.010.42–1 0.63 0.004
BBA d'1210.990.850.08 1.20–2.730.12 0.64
Prosody Matching d'1181.790.780.070.13–3.28 0.40 0.39
KABC-II number recall raw score1329.362.020.185–160.060.37

Age-normed standard scores and raw test scores for the expressive language test (SPELT-3) are shown. The complex syntax sub-score is a non-scaled raw score. The d' dcalculated using signal detection is reported for the BBA and Prosody Matching tests. The non-scaled raw score of the forward digit span subtest of the KABC-II is used. SPELT-3 = Structured Photographic Expressive Language Test-3; BBA = Beat-Based Advantage; KABC = Kaufman Assessment Battery for Children.

Correlations

Correlations between all measures (controlling for age) are displayed in Table Table3.3. For the correlation we first z-scored the raw scores for SPELT-3, BBA, Prosody Matching, KABC-II, MES, and SES, then regressed age from all the variables, and z-scored the residuals obtained from the regression. These z-scored residuals from the age-controlled regression were used for pairwise correlation. For IQ we used z-scored standard scores of the PTONI and used these in the correlation. Since MES and SES did not correlate significantly with prosodic perception, grammar, or musical rhythm perception, they were not included in further analyses. Individual BBA (r = 0.41, p < 0.001) and Prosody Matching (r = 0.25, p = 0.006) scores were both significantly and positively correlated with expressive grammar (SPELT-3) scores. Prosody Matching scores were also positively correlated with BBA scores (r = 0.32, p < 0.001). Further, we also computed a complex syntax sub-score from the SPELT-3 using the item groupings as described in Gordon et al., 201517. Since complex syntactical ability has been demonstrated to be strongly correlated with musical rhythm perception17, and processing complex sentences might recruit overlapping neural resources with domain-general rhythm perception abilities, this metric was explored in the associations. In our sample, BBA scores were positively correlated with complex syntax sub-scores (r = 0.26, p = 0.003). Interestingly, we also found that the correlation between musical rhythm perception and simple grammar is small and non-significant (see Supplementary Information section ‘Analysis with simple grammar scores’ for details). Scatter plots for the relevant individual correlations are shown in Fig. 1A–D.

Table 3

Pearson correlation matrix of all variables used in the correlation and path model analysis.

Variable12345678
1. SPELT-3 raw score
2. SPELT-3 complex syntax sub-score0.72**
3. BBA0.41**0.26*
4. Prosody Matching0.25*0.150.32**
5. PTONI standard score0.27*0.21*0.33**0.24*
6. KABC-II number recall raw score0.21*0.100.33**0.27*0.30**
7. SES0.08 0.010.030.110.100.21*
8. MES0.01 0.19* 0.060.07 0.100.050.18*

Except for the PTONI age-normed standard scores (which are the z-scored standard scores), the raw scores of all other variables were z-scored, regressed on age, and the resulting residuals were z-scored once again, and these z-scored, age-regressed residuals were used in the correlation matrix. Values are correlation r values for each pair of variables. Missing values were excluded in a pair-wise manner. SPELT-3 = Structured Photographic Expressive Language Test; BBA = Beat-Based Advantage; PTONI = Primary Measure of Non-Verbal IQ; KABC = Kaufman Assessment Battery for Children; SES = Socioeconomic Status; MES = Musical Experience Score.

*indicates p < 0.05, and **indicates p < 0.001.

An external file that holds a picture, illustration, etc.
Object name is 41598_2022_21902_Fig1_HTML.jpg

Scatterplots showing the correlations between variables. Scatter plots for correlations between (A) expressive grammar (SPELT-3) scores and musical rhythm perception (n = 121); (B) expressive grammar and prosodic perception (n = 118); (C) complex syntax sub-scores and musical rhythm perception (n = 121); and (D) musical rhythm perception and prosody perception (n = 109). Age is controlled for in all plots, and Pearson’s r and p values are displayed for each correlation.

Mediation analysis

We used a fully identified path model to investigate the possible mediation of prosodic perception or working memory in the rhythm-grammar correlation, displayed in Fig. 2 (n = 109). In this model, we controlled for age by using residualised variables for all measures (i.e., regressing all z-scored raw measure scores on age and computing residualised scores prior to analysis, as described for the simple correlations above). As shown in Fig. 3, the direct path from musical rhythm perception to expressive grammar task performance was significant (Path A; β = 0.41, p < 0.001), even when accounting for prosodic perception and working memory. The indirect effects of musical rhythm perception on grammar through prosodic perception (Path BC; β = 0.02) and working memory (Path DE; β = 0.01) were small and non-significant. Thus, musical rhythm perception was correlated significantly with expressive grammar skills, and this correlation was not mediated by either prosodic perception or working memory. The residual correlation between prosodic perception and working memory was also significant (Path X, β = 0.20, p = 0.037).

An external file that holds a picture, illustration, etc.
Object name is 41598_2022_21902_Fig2_HTML.jpg

Path model for mediation analysis of musical rhythm perception and expressive grammar performance. This path model examines the relationship between musical rhythm perception (predictor) and grammar performance (outcome), and the possible mediators (working memory capacity and prosody perception) that might play a role in this correlation. Path A is the direct effect from rhythm to grammar, while paths BC and DE constitute the mediation effects through prosodic perception and working memory respectively. Path X depicts the residual correlation between prosodic perception and working memory.

An external file that holds a picture, illustration, etc.
Object name is 41598_2022_21902_Fig3_HTML.jpg

Path analysis model of associations between musical rhythm perception and expressive grammar. The path analysis model shows the associations between musical rhythm perception, expressive grammar task performance, prosodic perception and working memory, while controlling for age (n = 109). Z-scored age-regressed residuals of the z-scored raw scores were used for all measures. We found that though the relationship between musical rhythm perception and expressive grammar task performance (path A) is positive and significant, it is not explained by prosodic perception (path BC) or working memory (path DE). The model uses list-wise exclusion for missing values. The β for each pair of variables is indicated on the path. Solid lines indicate a significant relationship. *signifies p < 0.05, while **signifies p < 0.001.

We conducted the same path analysis by covarying for non-verbal IQ from all measures in the model (n = 103). We used the z-scored standard scores from the PTONI as the measure for IQ, while all other variables were the z-scored, age-residualised raw scores. As shown in Fig. 4, although IQ was significantly correlated with musical rhythm perception and working memory, the primary results of the model remain unchanged. That is, the direct path (path A) between musical rhythm perception and expressive grammar remained significant (β = 0.38, p < 0.001), while the indirect paths through prosodic perception (path BC) and working memory (path DE) were non-significant. Finally, we also conducted a multiple regression with musical rhythm perception, working memory, and prosodic perception (See Table S1 in Supplementary Information), which also demonstrated that rhythm perception was the only significant predictor of grammar controlling for all other variables, including SES, MES, and nonverbal IQ.

An external file that holds a picture, illustration, etc.
Object name is 41598_2022_21902_Fig4_HTML.jpg

Path analysis model for musical rhythm perception and expressive grammar, controlling for non-verbal IQ. This figure depicts the path model for music rhythm perception and expressive grammar, prosodic perception and working memory as mediators, with IQ as a covariate (n = 103). Age was partialled out from all variables (except IQ), by creating z-scored residualised scores of age against the z-scored, raw scores of variables. Z-scored standard PTONI scores were used as the measure of IQ. Missing values were excluded in a list-wise fashion. The β for each pair of variables is indicated on the path. Solid lines indicate a significant relationship.*signifies p < 0.05, while **signifies p < 0.001.

We performed similar path analyses (controlling for age, as previously described), to investigate the association between musical rhythm perception as the predictor and complex syntax task performance as the outcome, displayed in Fig. 5. We hypothesised that since complex syntax and musical rhythm follow an inherent hierarchy that unfolds over time, processing both complex syntax and rhythm might employ shared processing pathways evident in their behavioural output. Our analysis showed that the direct association, path A between musical rhythm perception and complex syntax was positive and significant (β = 0.27, p = 0.007) in our path model (Fig. 5A). Further, as seen in Fig. 5A, the direct association between rhythm perception and complex syntax, remained significant, and the indirect mediating pathways through prosodic perception (path BC, β = 0.02) and working memory (path DE, β = 0.06) were non-significant. Similar results were observed when we included non-verbal IQ as a covariate in the path model (Fig. 5B), showing that complex syntax, much like expressive grammar in the previous path model (Fig. 4), was positively and significantly correlated with performance on the musical rhythm perception task (the BBA) (path A, β = 0.25, p = 0.018), and this correlation was not mediated either by prosody perception or working memory, and neither explained by non-verbal IQ. Further, path analyses conducted with simple grammar as an additional covariate, demonstrated that there is a unique correlation between musical rhythm perception and complex grammar (See supplemental information Fig. S5).

An external file that holds a picture, illustration, etc.
Object name is 41598_2022_21902_Fig5_HTML.jpg

Path analysis model for musical rhythm perception and complex syntax, with and without nonverbal IQ as a covariate. (A) Panel A (n = 109) depicts the path model for musical rhythm perception and complex syntax, with prosodic perception and working memory as mediators. Residualised scores were created by partialling age from z-scored raw scores of all variables. The age-residualised scores were then z-scored and used in the path model. We find that the relationship between musical rhythm perception and complex syntax task performance is non-significant. Solid lines indicate a significant relationship. *signifies p < 0.05, while **signifies p < 0.001. (B). Panel B shows the path analysis model for musical rhythm perception and complex syntax, controlling for IQ (n = 103). Residualised scores were created by partialling age from the z-scores of all variables, except for IQ. For IQ we used z-scored standard PTONI scores. The z-scored, age-regressed residual scores, and z-scored PTONI standard scores were used in the path model. The β for each pair of variables is indicated on the path. Even by controlling for IQ, the relationship between musical rhythm perception and complex syntax remains non-significant. Solid lines indicate a significant relationship. *signifies p < 0.05, while **signifies p < 0.001.

Power analysis

Post-hoc power calculations for testing the mediation effect (e.g., through prosody or working memory) were conducted using the powerMediation package in R52. Achieving 80% power for an indirect effect of β = 0.10 (alpha = 0.05; two-tailed test; path from rhythm to mediation β = 0.32, path from mediator to grammar β = 0.32) requires N = 72 participants, less than our achieved N = 109. Based on the actual indirect effects described in Fig. 3, which were quite small (β = 0.022 for prosody, β = 0.014 for working memory), we would need N = 140 or N = 282 individuals, respectively, to detect a significant indirect path. Thus, our study was powered to detect an indirect effect of small-to-medium size, but not the very small indirect effects observed here.

Discussion

In alignment with prior work3,4,1315, here we demonstrated a positive correlation between musical rhythm perception and expressive grammar skills in a sizeable sample of 132 school-aged (5–7-year-old) children. We also found that prosodic perception is positively correlated with musical rhythm perception (r = 0.32, p < 0.001), and with expressive grammar (r = 0.25, p = 0.006). The importance of prosodic perception in the context of language has been elaborated in studies which showed that prosodic cues play an important role in early language acquisition53 and account for variance in literacy skills in children54. Imposing metrical regularities in spoken sentences also affects adult listeners’ neural processing of grammatical structure30, indicating that prosodic sensitivity continues to be important for language processing later in the lifespan20,55. Previous studies have explored the correlation between prosodic sensitivity and reading and literacy skills56 as well as prosodic sensitivity and musical rhythm skill35, showing that weak performance on prosodic tasks tends to co-occur with weak reading comprehension and musical rhythm perception skills. Based on these observation, we explored the possibility that prosodic perception would be an important factor mediating the associations between musical rhythm and grammatical skills.

To study the nature of this relationship, we used a path model that factored in prosodic perception as a mediator, and tested working memory as an additional potential mediator, since working memory also tends to play a role in performance on musical and grammar tasks40,43,57. Contrary to our predictions about these potential mediators, the associations between musical rhythm and grammar scores remained significant and were not accounted for by prosodic perception ability, nor by working memory, evidenced by the results of the path model. Importantly, covarying non-verbal IQ and age did not impact these associations, thus indicating that the dynamics of the relationship between musical rhythm and spoken grammar skills are not driven by prosodic perception, verbal working memory, general cognition, or non-specific developmental effects. These findings replicate and extend other recent work that has shown a relationship between rhythm-related skills and spoken grammar task performance, above and beyond general cognitive effects or working memory24.

Furthermore, in our study, the path model with complex syntax sub-scores demonstrated similar results as for overall grammatical skill as the outcome: there was a significant direct effect of musical rhythm perception on complex syntax, and this relationship was not mediated by prosodic perception nor working memory. In line with previous findings of musical rhythm variables predicting children’s individual differences in complex syntax skill17,58 and adults’ years of musical experience predicting attainment of long-distance syntactic dependencies in an artificial language learning paradigm59, we thus found converging evidence for an association between mastery of complex multiclausal syntactic structure and musicality variables.

Overall, our cross-sectional results should not be interpreted as a causal role of musical rhythm skill on children’s grammatical development. While the present study upholds previous observations that rhythm and grammar performance are indeed correlated, we did not delve into the neural, biological, or genetic mechanisms that are responsible for this association. Recent twin studies, for example, have revealed that shared genetic influences account for much of the association between verbal ability and measures of musical aptitude and music engagement60,61. Such findings are more consistent with a model in which musical and language traits are influenced by a similar set of genetic (and environmental) factors, rather than causal associations of music on language development. Moving forward, other study designs will be necessary to unpack any potential longitudinal or (bi)directional associations between rhythm and grammatical skills. Further, while we did not observe mediation between musical rhythm perception and grammar via working memory, it is possible that we might have observed this mediation had we used a working memory related task that engaged cognitive manipulation as well. We used the forward digit span as a measure of working memory, a task which relies primarily on the storage capacity aspect. Working memory consists of both—the ability to store information and the ability to manipulate it, and using a more involved task like the backward digit span, in addition to the forward digit span, might have provided a more nuanced point of view.

A particular limitation is that in our application of the structural model to our data, due to not having multiple measures of each construct in the full sample, we could not construct latent variables for our measures. Inclusion of latent variables would have reduced the measurement error present in the study measures and allowed us to create variables that capture individual differences with reduced error margins. Future studies should include multiple correlated measures for each construct (i.e. rhythm production, perception, and imitation for a “rhythm” construct62) while also maintaining a large enough sample size to warrant path modelling and other types of structural equation models. Another caveat is that our study only included monolingual speakers of American English. Studies looking at musicality and language skills in bilingual participants2,63,64 suggest that a fruitful strategy for future work would be to employ more inclusive inclusion criteria, with bilingualism coded as a covariate, as successfully demonstrated in Swaminathan and Schellenberg4.

It is also important to keep in mind that here we quantified prosodic perception with a Prosody Matching task (employed through discrimination of filtered sentences) that focused on sensitivity to prosodic cues signifying syntactic structure, i.e., phrasal lengthening to indicate pauses or intonational markings that signify when a question is being asked. This assessment thus may not capture other rhythmic aspects of speech (i.e., lexical stress65,66), but rather capture prosodically-directed sentence-level syntactic processing of the phrase constituents. Future incorporation of speech-rhythm (e.g., tasks concerning identification of stressed syllables, or repetition of nonsense syllable rhythms) will likely help disentangle the effects of prosodic perception on the associations between rhythm and grammar task performance. Similarly, for the outcome measure, we have used expressive grammar abilities, complementing prior research that has shown similar results with receptive grammar tasks2,4. Grammatical syntactic processing is a complex, multi-layered neurobiological function, and studying its various facets will help specify the nature of rhythmic skills and how they might impact other individual differences in morpho-syntactic processing/skills.

From the literature and from our study we observe that musical rhythm sensitivity has significant implications for development of grammatical skills, such that children who are more sensitive to musical rhythm seem to have an advantage during language learning, though the processes underlying this relationship are not well-understood. Children’s early-emerging sensitivity to speech prosody is leveraged by infants during language and especially syntactical learning28,67. The association between rhythm perception and language becomes more directly measurable by pre-school age; Politimou et al.1 demonstrated that rhythm perception and production skills predicted sentence imitation task (which taps into both receptive and expressive grammar and syntax), converging with findings by Woodruff Carr et al.,68 that rhythm discrimination and beat synchronisation predict sentence imitation. There is a relatively large body of studies linking musical rhythm to literacy skills; for example rhythmic sequence reproduction is positively associated with children’s reading and spelling performance in school in school-aged children69. Taken together, these observations suggest that (a) early language and music development might be entangled, (b) language development to some extent utilises rhythmic perception, and (c) the domain-specificity and nuanced processing of these stimuli develops over time70.

Both musical rhythm and linguistic syntax are ordered in a temporally progressing hierarchy7174. Considering these structural parallels, there could be shared mechanisms at play that are responsible for processing both music and grammar75,76. One cognitive pathway that might feature in the relationship between musical rhythm and grammar processing is hierarchical processing, which helps to break down a complex, stratified system into its individual components77. With some similarities to rhythm, the grammatical structure of language, especially complex syntax, follows a hierarchical structure that unfolds over time73,77,78. Our finding of a direct relationship between children’s rhythm discrimination and complex syntax skills suggests a possible role of hierarchical processing across domains. There is also functional support for this possibility from a recent meta-analysis which demonstrated some overlap between brain regions involved in processing musical rhythm and syntactic structure in language (i.e., inferior frontal gyrus, supplementary motor area, and bilateral insula), with the common cognitive mechanism attributed to hierarchical processing79. Evolutionary accounts of a common origin of language and music have pointed to a preference for tree-like hierarchical structures in early humans’ communication traits that may have subsequently given way to rhythm and language in modern language32,77,80.

The underlying biology of the cross-trait relationship between rhythmic perception and grammatical skills, especially in light of proposed shared evolution, could be explained by genetics—both through shared genetic architecture60,81 between human rhythm and grammatical traits, and through evolution of hierarchical processing as a common mechanism to process syntax in both music and grammar82. Developmental observations in musicality and language skill further support the case for common genetic influences on these traits, perhaps that are ontogenetic. Although as adults we might consider musical rhythm and speech highly dissimilar83, evidence collected from studies with infants looking at language discrimination, speech tracking and rhythmicity84,85, melodic complexity of crying and language acquisition8688, and familiarity with musical rhythms and words89 might indicate that development of rhythm and grammatical skills is concomitant rather than sequential.

In conclusion, our study in school-aged children adds to the growing literature in music cognition that demonstrates the association between individual differences in musical rhythm perception and expressive grammatical skill. Our results demonstrated that the ability to parse complex syntax is also correlated with musical rhythm perception, indicating that there might be a common mode of processing syntactical hierarchy in both rhythm and grammar. This study replicates the finding that there is a positive correlation between rhythm and grammar performance, but interestingly this relationship is independent of working memory and IQ, is not simply driven by sensitivity to prosody but rather is truly a cross-domain phenomenon. These results open the door to future investigations of underlying common biological and genetic mechanisms that drive rhythm-grammar links with a focus on the dynamic underlying causes of this relationship.

Methods

The study protocol was reviewed and approved by the Vanderbilt University Institutional Review Board (IRB #120,062). All experiments were performed in accordance with Vanderbilt IRB regulations and all relevant ethics guidelines from the Belmont Report. For the study, all participants, and their parents gave verbal assent and written informed consent respectively, in accordance with established protocols.

Participant recruitment

Children (N = 150) between the ages of 5–7 were recruited from several sources in the Middle Tennessee community such as libraries, museums, research mailing lists through Vanderbilt University, and schools, and were screened for the study. Children were invited for one in-depth, on-campus screening visit, and those who were eligible were invited back for one or two additional study visits on-campus. Note that all the children were between the ages of 5 and 7 when their language was characterised, and they were enrolled during the screening visit. However, one child turned 8 by the time the additional study visits were completed. This is reflected in the age range reported for the expressive language outcome measure (SPELT-3). Study data were collected and managed using REDCap electronic data capture tools hosted at Vanderbilt University90,91. Families were compensated with a gift card and small toy per visit. Details of screening measures and language characterisations used to determine eligibility for the study are detailed below; N = 132 were included in the final dataset. Note that data from the same cohort, including the language screening characterisations, non-verbal IQ and the expressive grammar (SPELT-3) variables, are also utilised as the sample with typical language development in another study focused on motor timing in developmental language disorder92.

Screening and language characterisation

Hearing screening

Using headphones, each child listened to a series of 20 dB pure-tones presented 3 times at 1000, 2000 and 4000 Hz in each ear respectively (18 tones in total; 9 in each ear; 3 at each frequency). To pass, participants needed to correctly identify two out of three presentations at each frequency, in both ears. If a child failed a hearing screening due to allergies/infections, they were invited back for a second screening, which if they passed, allowed them to continue in the study.

Nonverbal IQ

To rule out cognitive disabilities and obtain a covariate of non-verbal cognition, the PTONI (Primary Test of Non-Verbal IQ)93 was administered to assess age-normed nonverbal IQ; details are provided in the Supplementary Information.

Language characterisation

The TOLD-P:4 (Test of Language Development—Primary, 4th edition; Newcomer & Hammill, 2008)94 and the TEGI (Test of Early Grammatical Impairment; Rice & Wexler, 2001)95 were used to assess language eligibility and rule out language disorders. To assess typical language development, we used three subtests from the TOLD-P:4, namely Sentence Imitation (SI), Picture Vocabulary (PV) and Morphological Completion (MC). Since these subtests assess expressive and receptive syntax, receptive vocabulary, and expressive morphology respectively, they effectively capture language impairments, if present. We also used the phonological probe from the TEGI to rule out speech and/or phonological impairments that may interfere with accurate assessment of morphosyntax.

Children who failed the phonological probe of the TEGI (raw score of 12 or below) were excluded from study participation. Children with signs of language impairment as judged by SLP clinicians were not eligible to continue and were referred to another study for children with language impairment. Evidence of potential language impairment was indicated by below average scaled scores on any one of the specific TOLD subtests (scaled score < 8 for either SI, PV, or MC). One participant scored below criterion on the SI subtest for the TOLD P-4 due to non-compliance, and four children scored below average on one out of three subtests. However, their average performance on the PTONI, SLP expert opinion based on holistic review of other tests, and observations during the assessments verified normal language development, which allowed them to be included in the study. All standardised language testing and decisions to include or exclude participants based on scores were overseen by Speech-Language Pathologist research staff. Language characterisation was performed for eligibility, whereas expressive grammar, the phenotype of interest for the individual differences analysis, was measured with the SPELT-3 (described below).

It is important to note that our inclusion criteria focussed on typical language development, and hence the TOLD-P:4 and the phonological probe from the TEGI, were used for language characterisation. This phonological probe tests the child’s ability to produce certain phonemes and is, as such, a test of developmental impairments of articulation, and not of phonological awareness, and cannot detect presence of reading impairments. There is therefore, a possibility that children with undetected reading difficulties were included in our sample, even though their spoken language was found to be typically developing. For more details, see Supplementary Information.

Behavioural assessments for individual differences analyses

Grammar

Expressive grammar was measured with the Structured Photographic Expressive Language Test-3 (SPELT-3)49. Participants were shown a series of pictures and asked questions that elicit responses requiring specific grammatical construction (e.g., relative clauses, possessive pronouns). The z-scored raw scores from this assessment were used as the outcome measure of expressive grammar skills for the phenotypic correlations, while controlling for age. We also generated complex syntax subscores from the SPELT-3 using the 12 items designated to have complex syntax form17.

Rhythm assessments

Musical rhythm perception (BBA)

Musical rhythm perception was measured using and the children’s version of BBA (Beat-based Advantage) test3,50, which has been adapted from the original adult version of the task implemented by Grahn and Brett96, and use auditory stimuli originally developed by Povel and Essens97. The children’s BBA is a computer-based test developed and presented via E-Prime v2.0 Professional (Psychology Software Tools, Inc.) on a laptop. The sounds were presented over external speakers connected to the laptop and calibrated to 70 dB. The BBA was presented to children as the “Drummer” game (see Supplementary Information for visuals and details). During the task, participants were asked to listen to three standard rhythms, played sequentially. The first two rhythms were identical and played by a character called Randy Drummer, while the third and final rhythm could be the same as the first two (thus played by Sandy Same who copies Randy Drummer), or different (played by Doggy Different). Children were asked to determine whether the third rhythm was played by Sandy Same or by Doggy Different. This task employed simple and complex rhythms (half the trials each). The same stimulus corpus and visual representation was used here as in Gordon et al.3 such that rhythms were 25% faster than in the other work in adults, due to young children’s faster internal tempo98.

The BBA had 4 practice trials (2 simple and 2 complex—each with a same/different variation) followed by 28 test trials, divided into 14 simple and 14 complex rhythms, with each rhythm type further subdivided into 7 “same” and 7 “different” trials. The rhythms heard in the practice trials were excluded from the test trials. Correct/incorrect feedback was provided only in the practice stage. For analyses, we calculated the d’ separately for simple and complex rhythms, with children generally performing better on trials with simple rhythms (higher d’ values) than with complex rhythms. The simple and complex d’ were then averaged for an overall BBA d’ score. This BBA raw score was further z-scored, controlled for age, and used as the measure of musical rhythm perception in our analyses.

Prosodic perception

Prosodic perception was measured with a Prosody-Matching task (adapted from Soman99), which tests sensitivity to intonational fluctuations in speech. The task was presented through the “Astronaut” game, via E-Prime using the same computer and speaker setup as for the BBA. The task was a forced-choice ABX discrimination task that required children to match the prosody of a low-pass speech filtered sentence to one of two non-filtered, normal speech sentences (see Supplementary Information for visuals and details of the task). The unfiltered sentences were spoken by either the red astronaut or the blue astronaut in the visual, while the low-pass filtered speech was attributed to their green alien friend. Children were told that the astronauts and their green alien friend were playing a copycat game, and that the alien was trying to copy what one of the astronauts was saying, but it's hard to hear in outer space, and it was the child’s job to determine which astronaut the alien was mimicking. The task had a total of 24 trials (12 trials mimicking the red astronaut, and 12 trials the blue) which were presented to the participants in random order. Similar signal detection theory analysis was used to calculate the d’, which was then z-scored, controlled for age, and the residuals were further z-scored. These z-scored, age-controlled residuals were used as a measure of prosodic perception. For experimental details of the BBA and Prosody Matching task, please see the Supplementary Information.

Verbal working memory

The forward digit span subtest of the KABC-II (Kaufman Assessment Battery for Children, second edition)51, was used to measure verbal working memory, given that working memory has been positively correlated with rhythm perception and synchronisation and with grammatical task performance in earlier work43,45. During the task, children heard a recording through the computer speakers of a number-series that they were asked to repeat back to the examiner verbatim. The raw scores from the test were z-scored, controlled for age, and the resulting residuals were z-scored and used for analysis.

Analysis plan

All data were analysed using R software environment100, the tidyverse R package101, and the lavaan package102. Using lavaan, a structured equation model (SEM)/path model was used to examine the possible mediation of the music rhythm perception—grammatical skill relationship through prosodic perception or working memory, with missing values excluded in a list-wise manner. Prosodic perception and working memory served as possible mediators, with IQ and age as covariates. As shown in Fig. 2, we evaluated two possible mechanisms of mediation—one through prosodic perception (path BC), and the other through working memory (path DE). This path model was fully identified, hence model fit indices were not considered.

For both the path model and correlation analyses, we used raw scores for all our measures (SPELT-3, KABC-II, Prosody Matching, BBA), except for non-verbal IQ, where we used the age-normed PTONI scores. For the behavioural measures which used raw scores (i.e., SPELT-3, KABC-II, Prosody Matching, and BBA), we first z-scored, i.e., standardised the raw scores. Then age was partialled from the behavioural measures by generating residuals from the correlation between the z-score of age and the z-score of the behavioural measures. Finally, these residuals were z-scored again, and these residual z-scores were used as the final input values of the behavioural measures in the analyses. For the PTONI standard scores, we z-scored the available PTONI scores, and then used these z-scores in our analyses. Though the SPELT-3 is a standardised test, with age normed scores available, since we used the SPELT-3 scores as an outcome measure of expressive grammatical ability for individual participants as a representative of the population, rather than as a measure of ranking/standing within the general age group of the participants, we used raw scores which were then controlled for age (as described above) in our analysis.

Supplementary Information

Acknowledgements

This project was supported by funding from the National Institutes of Health (NIH): National Institute on Deafness and Other Communication Disorders, the Office of Behavioral and Social Sciences Research, and the Office of the Director of the NIH under Award Numbers R03DC014802, K18DC017383, DP2HD098859, and R01DC016977, and funding from the National Science Foundation (1926794). S.M.C. was also supported by funding from the Scottish Rite Foundation of Nashville. Use of REDCap was made possible by Grant UL1 TR000445 from the National Center for Advancing Translational Sciences (NCATS)/NIH. The content is solely the responsibility of the authors and does not necessarily represent the official views of the funders. The project was also supported by funding from Vanderbilt Trans-Institutional Programs. We thank the families for participating, and we thank the following individuals for assistance with data collection and data management: Ashley Hirsch, Karli Oxford-Jordan, Katherine Margulis, Alaina Baird, Mari McCarville, Rebecca Embalabala, Alexander Chern, Courtney Walters Jr., Leah Boyd, Kristin Gummersheimer, Cara Petrucci, Aysu Erdemir, Alyssa Scartozzi, Youjia Wang, Sabrina Halavi, and Elise Titiner. The authors appreciate feedback from two anonymous reviewers.

Author contributions

R.L.G., J.D.M., S.M.C., and D.Z.H. conceived the project, and conceptualised and designed the study; R.L.G. supervised the study. R.L.G., U.S., S.D.B., N.W., A.A., and C.V. designed and piloted the rhythm and language paradigms. A.A., N.W., C.V., S.D.B., O.A.B., R.N., and V.P. collected participant data. O.A.B., C.T.B., R.N., and V.P. were responsible for data management and review, with input from S.M.C regarding language characterisation. R.N., V.P., O.A.B. and D.E.G. conducted statistical and mediation analyses. R.N., D.E.G, and R.L.G interpreted results and findings. R.N. and R.L.G. drafted the manuscript text and figures. All authors reviewed the manuscript.

Data availability

Data analysis code and de-identified variables reported in this manuscript are made available on Open Science Framework at 10.17605/OSF.IO/KG6JX.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Rachana Nitin, gro.cmuv@1.nitin.anahcar.

Reyna L. Gordon, ude.tlibrednav@nodrog.anyer.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-21902-0.

References

1. Politimou N, Dalla Bella S, Farrugia N, Franco F. Born to speak and sing: Musical predictors of language development in pre-schoolers. Front. Psychol. 2019;10:1–18. [Europe PMC free article] [Abstract] [Google Scholar]
2. Lee YS, Ahn S, Holt RF, Schellenberg EG. Rhythm and syntax processing in school-age children. Dev. Psychol. 2020;56:1632–1641. [Abstract] [Google Scholar]
3. Gordon RL, et al. Musical rhythm discrimination explains individual differences in grammar skills in children. Dev. Sci. 2015;18:635–644. [Abstract] [Google Scholar]
4. Swaminathan S, Schellenberg EG. Musical ability, music training, and language ability in childhood. J. Exp. Psychol. Learn. Mem. Cogn. 2020;46:2340–2348. [Abstract] [Google Scholar]
5. Conti-Ramsden G, Durkin K, Toseeb U, Botting N, Pickles A. Education and employment outcomes of young adults with a history of developmental language disorder. Int. J. Lang. Commun. Disord. 2018;53:237–255. [Europe PMC free article] [Abstract] [Google Scholar]
6. Eadie P, et al. Quality of life in children with developmental language disorder. Int. J. Lang. Commun. Disord. 2018;53:799–810. [Abstract] [Google Scholar]
7. Sallat, S. & Jentschke, S. Music perception influences language acquisition: Melodic and rhythmic-melodic perception in children with specific language impairment. Behav. Neurol.2015, 606470. 10.1155/2015/606470 (2015). [Europe PMC free article] [Abstract]
8. Cumming R, Wilson A, Leong V, Colling LJ, Goswami U. Awareness of rhythm patterns in speech and music in children with specific language impairments. Front. Hum. Neurosci. 2015;9:672. 10.3389/fnhum.2015.00672. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
9. Corriveau KH, Goswami U. Rhythmic motor entrainment in children with speech and language impairments: Tapping to the beat. Cortex. 2009;45:119–130. [Abstract] [Google Scholar]
10. Ladányi E, Persici V, Fiveash A, Tillmann B, Gordon RL. Is atypical rhythm a risk factor for developmental speech and language disorders? WIREs Cogn. Sci. 2020;11:e1528. [Europe PMC free article] [Abstract] [Google Scholar]
11. Lense, M. D., Ladányi, E., Rabinowitch, T. C., Trainor, L. & Gordon, R. L. Rhythm and timing as vulnerabilities in neurodevelopmental disorders. Philos. Trans. R. Soc. B Biol. Sci.376, 20200327. 10.1098/rstb.2020.0327 (2021). [Europe PMC free article] [Abstract]
12. Cason N, Hidalgo C, Isoard F, Roman S, Schön D. Rhythmic priming enhances speech production abilities: Evidence from prelingually deaf children. Neuropsychology. 2015;29:102–107. [Abstract] [Google Scholar]
13. Chern A, Tillmann B, Vaughan C, Gordon RL. New evidence of a rhythmic priming effect that enhances grammaticality judgments in children. J. Exp. Child Psychol. 2018;173:371–379. [Europe PMC free article] [Abstract] [Google Scholar]
14. Ladányi E, Lukács Á, Gervain J. Does rhythmic priming improve grammatical processing in Hungarian-speaking children with and without developmental language disorder? Dev. Sci. 2021;24:e13112. [Europe PMC free article] [Abstract] [Google Scholar]
15. Fiveash A, Bedoin N, Lalitte P, Tillmann B. Rhythmic priming of grammaticality judgments in children: Duration matters. J. Exp. Child Psychol. 2020;197:104885. [Abstract] [Google Scholar]
16. Przybylski L, et al. Rhythmic auditory stimulation influences syntactic processing in children with developmental language disorders. Neuropsychology. 2013;27:121–131. [Abstract] [Google Scholar]
17. Gordon RL, Jacobs MS, Schuele CM, Mcauley JD. Perspectives on the rhythm-grammar link and its implications for typical and atypical language development. Ann. N. Y. Acad. Sci. 2015;1337:16–25. [Europe PMC free article] [Abstract] [Google Scholar]
18. Frizelle P, Thompson PA, Duta M, Bishop DVM. The understanding of complex syntax in children with Down syndrome. Wellcome Open Res. 2018;3:140. [Europe PMC free article] [Abstract] [Google Scholar]
19. Hawthorne K, Gerken L. From pauses to clauses: Prosody facilitates learning of syntactic constituency. Cognition. 2014;133:420–428. [Europe PMC free article] [Abstract] [Google Scholar]
20. Frazier L, Carlson K, Clifton C. Prosodic phrasing is central to language comprehension. Trends Cogn. Sci. 2006;10:244–249. [Abstract] [Google Scholar]
21. Choi Y, Mazuka R. Young children’s use of prosody in sentence parsing. J. Psycholinguist. Res. 2003;32:197–217. [Abstract] [Google Scholar]
22. Bryant GA. Prosodic contrasts in ironic speech. Discourse Process. 2010;47:545–566. [Google Scholar]
23. Hellbernd N, Sammler D. Prosody conveys speaker’s intentions: Acoustic cues for speech act perception. J. Mem. Lang. 2016;88:70–86. [Google Scholar]
24. Boutsen F. Prosody: The music of language and speech. ASHA Lead. 2003;8:6–8. [Google Scholar]
25. Brown S, Pfordresher PQ, Chow I. A musical model of speech rhythm. Psychomusicol. Music Mind Brain. 2017;27:95–112. [Google Scholar]
26. Nolan, F. & Jeon, H. S. Speech rhythm: A metaphor? Philos. Trans. R. Soc. B Biol. Sci.369(1658), 20130396. 10.1098/rstb.2013.0396 (2014). [Europe PMC free article] [Abstract]
27. de Carvalho A, Dautriche I, Lin I, Christophe A. Phrasal prosody constrains syntactic analysis in toddlers. Cognition. 2017;163:67–79. [Abstract] [Google Scholar]
28. de Carvalho A, He AX, Lidz J, Christophe A. Prosody and function words cue the acquisition of word meanings in 18-month-old infants. Psychol. Sci. 2019;30:319–332. [Abstract] [Google Scholar]
29. Nazzi T, Jusczyk PW, Johnson EK. Language discrimination by english-learning 5-month-olds: Effects of rhythm and familiarity. J. Mem. Lang. 2000;43:1–19. [Google Scholar]
30. Schmidt-Kassow M, Kotz SA. Attention and perceptual regularity in speech. NeuroReport. 2009;20:1643–1647. [Abstract] [Google Scholar]
31. Coutinho E, Dibben N. Psychoacoustic cues to emotion in speech prosody and music. Cogn. Emot. 2013;27:658–684. [Abstract] [Google Scholar]
32. Kotz SA, Ravignani A, Fitch WT. The Evolution of Rhythm Processing. Trends Cogn. Sci. 2018;22:896–910. [Abstract] [Google Scholar]
33. Moreno S. Can music influence language and cognition? Contemp. Music Rev. 2009;28:329–345. [Google Scholar]
34. Zioga I, Di Bernardi Luft C, Bhattacharya J. Musical training shapes neural responses to melodic and prosodic expectation. Brain Res. 2016;1650:267–282. [Europe PMC free article] [Abstract] [Google Scholar]
35. Hausen M, Torppa R, Salmela VR, Vainio M, Särkämö T. Music and speech prosody: A common rhythm. Front. Psychol. 2013;4:1–16. [Europe PMC free article] [Abstract] [Google Scholar]
36. Torppa R, Faulkner A, Laasonen M, Lipsanen J, Sammler D. Links of prosodic stress perception and musical activities to language skills of children with cochlear implants and normal hearing. Ear Hear. 2019;41:395–410. [Abstract] [Google Scholar]
37. Heffner CC, Slevc LR. Prosodic structure as a parallel to musical structure. Front. Psychol. 2015;6:1–14. [Europe PMC free article] [Abstract] [Google Scholar]
38. Miyake A, et al. The unity and diversity of executive functions and their contributions to complex ‘Frontal Lobe’ Tasks: A latent variable analysis. Cogn. Psychol. 2000;41:49–100. [Abstract] [Google Scholar]
39. Chai WJ, Abd Hamid AI, Abdullah JM. Working memory from the psychological and neurosciences perspectives: A review. Front. Psychol. 2018;9:1–16. [Europe PMC free article] [Abstract] [Google Scholar]
40. Ireland K, Iyer TA, Penhune VB. Contributions of age of start, cognitive abilities and practice to musical task performance in childhood. PLoS ONE. 2019;14:1–14. [Europe PMC free article] [Abstract] [Google Scholar]
41. Torppa R, et al. The perception of prosody and associated auditory cues in early-implanted children: The role of auditory working memory and musical activities. Int. J. Audiol. 2014;53:182–191. [Abstract] [Google Scholar]
42. Stepanov A, Kodrič KB, Stateva P. The role of working memory in children’s ability for prosodic discrimination. PLoS ONE. 2020;15:1–16. [Europe PMC free article] [Abstract] [Google Scholar]
43. Zhou H, Rossi S, Chen B. Effects of working memory capacity and tasks in processing L2 complex sentence: Evidence from Chinese-english bilinguals. Front. Psychol. 2017;8:1–10. [Europe PMC free article] [Abstract] [Google Scholar]
44. Marton K, Campanelli L, Farkas L. Grammatical sensitivity and working memory in children with language impairment. Acta Linguist. Hung. 2011;58:448–466. [Europe PMC free article] [Abstract] [Google Scholar]
45. Bailey JA, Penhune VB. Rhythm synchronization performance and auditory working memory in early- and late-trained musicians. Exp. Brain Res. 2010;204:91–101. [Abstract] [Google Scholar]
46. Hansen M, Wallentin M, Vuust P. Working memory and musical competence of musicians and non-musicians. Psychol. Music. 2013;41:779–793. [Google Scholar]
47. Puyjarinet F, Bégel V, Lopez R, Dellacherie D, Dalla Bella S. Children and adults with Attention-Deficit/hyperactivity disorder cannot move to the beat. Sci. Rep. 2017;7:11550. [Europe PMC free article] [Abstract] [Google Scholar]
48. Bégel V, et al. Rhythm as an independent determinant of developmental dyslexia. Dev. Psychol. 2022;58:339–358. [Abstract] [Google Scholar]
49. Dawson, J. I, Stout, C. E & Eyer, J. A. SPELT-3: Structured Photographic Expressive Language Test-Third Edition (Janelle Publications, 2003).
50. Wieland EA, McAuley JD, Dilley LC, Chang S-E. Evidence for a rhythm perception deficit in children who stutter. Brain Lang. 2015;144:26–34. [Europe PMC free article] [Abstract] [Google Scholar]
51. Singer JK, Lichtenberger EO, Kaufman JC, Kaufman AS, Kaufman NL. Contemporary Intellectual Assessment Theories, Tests, and Issues. New York: The Guilford Press; 2012. The Kaufman assessment battery for children— second edition and the Kaufman test of educational achievement second edition; pp. 269–296. [Google Scholar]
52. Sobel ME. Asymptotic confidence intervals for indirect effects in structural equation models. Sociol. Methodol. 1982;13:290. [Google Scholar]
53. Langus A, Mehler J, Nespor M. Rhythm in language acquisition. Neurosci. Biobehav. Rev. 2017;81:158–166. [Abstract] [Google Scholar]
54. Holliman AJ, Wood C, Sheehy K. Does speech rhythm sensitivity predict children’s reading ability 1 year later? J. Educ. Psychol. 2010;102:356–366. [Google Scholar]
55. Hoyte KJ, Brownell H, Wingfield A. Components of speech prosody and their use in detection of syntactic structure by older adults. Exp. Aging Res. 2009;35:129–151. [Europe PMC free article] [Abstract] [Google Scholar]
56. Groen MA, Veenendaal NJ, Verhoeven L. The role of prosody in reading comprehension: evidence from poor comprehenders. J. Res. Read. 2019;42:37–57. [Google Scholar]
57. D’Souza AA, Moradzadeh L, Wiseheart M. Musical training, bilingualism, and executive function: Working memory and inhibitory control. Cogn. Res. Princ. Implic. 2019;3(1):11. 10.1186/s41235-018-0095-6. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]
58. Persici V, Blain SD, Iversen JR, Key AP, Kotz SA, McAuley JD, Gordon RL. Individual differences in neural markers of beat processing predict spoken grammar skills in six-year-old children. PsyArXiv. 2019 10.31234/osf.io/9ebvc. [CrossRef] [Google Scholar]
59. Brod G, Opitz B. Does it really matter? Separating the effects of musical training on syntax acquisition. Front. Psychol. 2012;3:1–8. [Europe PMC free article] [Abstract] [Google Scholar]
60. Wesseldijk LW, Gordon RL, Mosing MA, Ullén F. Music and verbal ability—A twin study of genetic and environmental associations. Psychol. Aesthet. Creat. Arts. 2021 10.1037/aca0000401. [CrossRef] [Google Scholar]
61. Gustavson DE, et al. Musical instrument engagement in adolescence predicts verbal ability 4 years later: A twin and adoption study. Dev. Psychol. 2021;57:1943–1957. [Europe PMC free article] [Abstract] [Google Scholar]
62. Fiveash, A., Dalla Bella, S., Bigand, E., Gordon, R. L. & Tillmann, B. You got rhythm, or more: The multidimensionality of rhythmic abilities. (submitted). [Europe PMC free article] [Abstract]
63. Stepanov A, Pavlič M, Stateva P, Reboul A. Children’s early bilingualism and musical training influence prosodic discrimination of sentences in an unknown language. J. Acoust. Soc. Am. 2018;143:EL1–EL7. [Abstract] [Google Scholar]
64. Liu L, Kager R. Enhanced music sensitivity in 9-month-old bilingual infants. Cogn. Process. 2017;18:55–65. [Europe PMC free article] [Abstract] [Google Scholar]
65. Field J. Intelligibility and the Listener: The Role of Lexical Stress. TESOL Quarterly. 2017;39(3):399–423. 10.2307/3588487. [CrossRef] [Google Scholar]
66. Kohler KJ. Rhythm in speech and language: A new research paradigm. Phonetica. 2009;66:29–45. [Abstract] [Google Scholar]
67. Soderstrom M, Seidl A, Kemler Nelson DG, Jusczyk PW. The prosodic bootstrapping of phrases: Evidence from prelinguistic infants. J. Mem. Lang. 2003;49:249–267. [Google Scholar]
68. Woodruff Carr K, White-Schwoch T, Tierney AT, Strait DL, Kraus N. Beat synchronization predicts neural speech encoding and reading readiness in preschoolers. Proc. Natl. Acad. Sci. 2014;111:14559–14564. [Europe PMC free article] [Abstract] [Google Scholar]
69. Lundetræ K, Thomson JM. Rhythm production at school entry as a predictor of poor reading and spelling at the end of first grade. Read. Writ. 2018;31:215–237. [Europe PMC free article] [Abstract] [Google Scholar]
70. Brandt A, Gebrian M, Slevc LR. Music and early language acquisition. Front. Psychol. 2012;3:1–17. [Europe PMC free article] [Abstract] [Google Scholar]
71. Lerdahl F, Jackendoff R. An overview of hierarchical structure in music. Music Percept. Interdiscip. J. 1983;1:229–252. [Google Scholar]
72. Lerdahl F. Concepts and representations of musical hierarchies. Music Percept. 2015;33:83–95. [Google Scholar]
73. Brennan JR, Hale JT. Hierarchical structure guides rapid linguistic predictions during naturalistic listening. PLoS ONE. 2019;14:1–17. [Europe PMC free article] [Abstract] [Google Scholar]
74. Frank SL, Bod R, Christiansen MH. How hierarchical is language use? Proc. R. Soc. B Biol. Sci. 2012;279:4522–4531. [Europe PMC free article] [Abstract] [Google Scholar]
75. Patel AD. The OPERA hypothesis: Assumptions and clarifications. Ann. N. Y. Acad. Sci. 2012;1252:124–128. [Abstract] [Google Scholar]
76. Patel AD, Iversen JR. The linguistic benefits of musical abilities. Trends Cogn. Sci. 2007;11:369–372. [Abstract] [Google Scholar]
77. Fitch WT, Martins MD. Hierarchical processing in music, language, and action: Lashley revisited. Ann. N. Y. Acad. Sci. 2014;1316:87–104. [Europe PMC free article] [Abstract] [Google Scholar]
78. Patel AD. Rhythm in language and music: Parallels and differences. Ann. N. Y. Acad. Sci. 2003;999:140–143. [Abstract] [Google Scholar]
79. Heard M, Lee YS. Shared neural resources of rhythm and syntax: An ALE meta-analysis. Neuropsychologia. 2020;137:107284. [Abstract] [Google Scholar]
80. Fitch WT. Empirical approaches to the study of language evolution. Psychon. Bull. Rev. 2017;24:3–33. [Abstract] [Google Scholar]
81. Peretz I. Music, language and modularity framed in action. Psychol. Belg. 2009;49:157. [Google Scholar]
82. Asano R, Boeckx C, Seifert U. Hierarchical control as a shared neurocognitive mechanism for language and music. Cognition. 2021;216:104847. [Abstract] [Google Scholar]
83. McMullen E, Saffran JR. Music and language: A developmental comparison. Music Percept. 2004;21:289–311. [Google Scholar]
84. Leong, V., Kalashnikova, M., Burnham, D. & Goswami, U. Infant-Directed Speech Enhances Temporal Rhythmic Structure in the Envelope Centre for Neuroscience in Education, Department of Psychology, University of Cambridge pp. 2563–2567 (2014).
85. Kalashnikova M, Peter V, Di Liberto GM, Lalor EC, Burnham D. Infant-directed speech facilitates seven-month-old infants’ cortical tracking of speech. Sci. Rep. 2018;8:1–8. [Europe PMC free article] [Abstract] [Google Scholar]
86. Wermke K, Robb MP, Schluter PJ. Melody complexity of infants’ cry and non-cry vocalisations increases across the first six months. Sci. Rep. 2021;11:1–12. [Europe PMC free article] [Abstract] [Google Scholar]
87. Wermke K, Leising D, Stellzig-Eisenhauer A. Relation of melody complexity in infants’ cries to language outcome in the second year of life: A longitudinal study. Clin. Linguist. Phon. 2007;21:961–973. [Abstract] [Google Scholar]
88. Wermke K, Mende W. Musical elements in human infants’ cries: In the beginning is the melody. Music. Sci. 2009;13:151–175. [Google Scholar]
89. Saffran JR, Loman MM, Robertson RRW. Infant memory for musical experiences. Cognition. 2000;77:15–23. [Abstract] [Google Scholar]
90. Harris PA, et al. Research electronic data capture (REDCap)-A metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 2009;42:377–381. [Europe PMC free article] [Abstract] [Google Scholar]
91. Harris PA, et al. The REDCap consortium: Building an international community of software platform partners. J. Biomed. Inform. 2019;95:103208. [Europe PMC free article] [Abstract] [Google Scholar]
92. Ladányi, E. et al. Using Motor Tempi to Understand Rhythm and Grammatical Skills in Developmental Language Disorder and Typical Language Development. Neurobiology of Language. 1–28. 10.1162/nol_a_00082 (2022). [Europe PMC free article] [Abstract]
93. Ehrler, D. & McGhee, R. PTONI: Primary test of nonverbal intelligence. Pro-Ed. (2008).
94. Newcomer, P. L. & Hammill, D. D. TOLD-P:4 Test of language development (4th ed.). Pro-Ed. (2008).
95. Rice, M. L. & Wexler, K. Test of Early Grammatical Impairment. The Psychological Corporation (2001).
96. Grahn JA, Brett M. Impairment of beat-based rhythm discrimination in Parkinson’s disease. Cortex. 2009;45:54–61. [Abstract] [Google Scholar]
97. Essens PJ, Povel DJ. Metrical and nonmetrical representations of temporal patterns. Percept. Psychophys. 1985;37:1–7. [Abstract] [Google Scholar]
98. McAuley JD, Jones MR, Holub S, Johnston HM, Miller NS. The time of our lives: Life span development of timing and event tracking. J. Exp. Psychol. Gen. 2006;135:348–367. [Abstract] [Google Scholar]
99. Soman UG. Characterizing Perception of Prosody in Children with Hearing Loss. Nashville, Tennessee: Vanderbilt University; 2017. [Google Scholar]
100. R Core Team (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.r-project.org.
101. Wickham H, et al. Welcome to the Tidyverse. J. Open Source Softw. 2019;4:1686. [Google Scholar]
102. Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J. Stat. Softw. 2019;48(2):1–36. 10.18637/jss.v048.i02. [CrossRef] [Google Scholar]

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

Citations & impact 


Impact metrics

Jump to Citations

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/142258903
Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/142258903

Smart citations by scite.ai
Smart citations by scite.ai include citation statements extracted from the full text of the citing article. The number of the statements may be higher than the number of citations provided by EuropePMC if one paper cites another multiple times or lower if scite has not yet processed some of the citing articles.
Explore citation contexts and check if this article has been supported or disputed.
https://scite.ai/reports/10.1038/s41598-022-21902-0

Supporting
Mentioning
Contrasting
0
7
0

Article citations


Go to all (6) article citations

Data 


Data behind the article

This data has been text mined from the article, or deposited into data resources.

Funding 


Funders who supported this work.

NCATS NIH HHS (1)

NICHD NIH HHS (2)

NIDCD NIH HHS (3)