The final published version of this paper is available at:
http://www.springerlink.com/index/10.1007/s11192-010-0305-6
Please, quote as: Acosta, M., Coronado, D., Ferrándiz, E., y León, M. D. 2011. Factors
affecting inter-regional academic scientific collaboration within Europe: the role of
economic distance. Scientometrics, 87(1): 63-74.
Factors affecting inter-regional academic scientific collaboration
within Europe: The role of economic distance
Manuel Acosta
Daniel Coronado
Esther Ferrándiz
Mª Dolores León
UNIVERSITY OF CADIZ
Facultad de Ciencias Económicas y Empresariales, Universidad de Cádiz.
c/ Duque de Nájera, 8
11002-CADIZ
SPAIN
Corresponding author:
Daniel Coronado
e-mail: daniel.coronado@uca.es
Tf: 34-956015415
Fax: 34-956015388
Abstract: This paper offers some insights into scientific collaboration (SC) at the regional level by
drawing upon two lines of inquiry. The first involves examining the spatial patterns of university SC
across the EU-15 (all countries belonging to the European Union between 1995 and 2004). The second
consists of extending the current empirical analysis on regional SC collaboration by including the
economic distance between regions in the model along with other variables suggested by the extant
literature. The methodology relies on co-publications as a proxy for academic collaboration, and in order
to test the relevance of economic distance for the intensity of collaboration between regions, we put
forward a gravity equation. The descriptive results show that there are significant differences in the
production of academic scientific papers between less-favoured regions and core regions. However, the
intensity of collaboration is similar in both types of regions. Our econometric findings suggest that
differences in scientific resources (as measured by R&D expenditure) between regions are relevant in
explaining academic scientific collaborations, while distance in the level of development (as measured by
per capita GDP) does not appear to play any significant role. Nevertheless, other variables in the analysis,
including geographical distance, specialization and cultural factors, do yield significant estimated
coefficients, and this is consistent with the previous literature on regional SC.
Key words: economic distance, academic scientific collaboration, gravity equation, co-authorship
MSC classifications: 62P20, 62J02, 97K40
JEL classifications: R11, O33, I23
1
Introduction
In this paper, we provide some insight into the factors affecting SC using two lines of
inquiry. The first involves examining the university SC patterns across Nomenclature of
Territorial Units for Statistics (NUTS) II regions in the EU-15 (all countries belonging
to the European Union between 1995 and 2004). The second line of inquiry consists of
extending the empirical literature by introducing economic aspects, such as the role of
economic distance (regional differences in the levels of development and higher
education research and development (R&D) expenditures), along with other variables
suggested in the literature. This paper then addresses a problem in the small literature on
this topic, whereby there is some focus on the effects of different kinds of distances
(geographical, institutional and cultural), but the effects of economic factors at this scale
remain largely unknown, especially in Europe.
The dataset used to measure regional scientific production and collaboration consists of
a regionalized sample of 994,938 scientific papers by authors affiliated with European
universities from 1998 to 2004. The data were obtained from the Thomson ISI
(Information Sciences Institute) database and include papers from all scientific fields
(except the social sciences and humanities) for over 500 European universities at the
NUTS II level of regional aggregation. The empirical analysis involves two parts. First,
we carry out a brief descriptive study of the co-publication patterns across regions.
Second, we put forward an econometric framework to test the relevance of the
economic factors, along with other variables, on the intensity of collaboration. The
paper is organized as follows. Section 2 reviews the relevant literature. Section 3
describes the methodology. Section 4 details the data. Section 5 provides the results.
The main conclusions and policy implications are given in the final section.
2
Empirical background
As stated by the scarce literature on the collaborative patterns of research at the regional
level (Liang and Zhu, 2002; Okubo and Zitt; 2004; Ponds et al., 2007; Boshoff, 2010;
and Hoekman et al., 2009, 2010), geographical distance, similarities in the
specialization patterns of SC in regions and cultural proximity are significant factors
that positively affect SC between regions. However, no previous research has
considered the role of economic factors on SC at a regional scale. We need to turn to the
international literature on collaboration to find suitable references. In this regard, the
centre–periphery discussion at the international level (Schott, 1998; Schubert and
Sooryamoorthy, 2010) provides some clues on whether differences in economic
development between areas may determine the patterns of scientific collaboration.
According to this hypothesis, peripheral countries are willing to collaborate in order to
gain access to resources, while core (centre) countries collaborate for the purpose of
complementarities. For example, using interviews, Hwang (2008) concluded that the
main aim for Korean scientists and engineers in international collaboration was to
obtain advanced knowledge and technologies from core scientists in exchange for
funding core knowledge production. Sonnenwald (2007) described some examples of
collaboration between Africans and non-Africans; Africans granted access to local
communities and non-Africans provided free treatment, lab equipment and training.
Also between China and Taiwan; Taiwan provided the experienced, mid-career
scientists that China lacked because of the Cultural Revolution and China provided a
large number of younger scientists to increase the size of Taiwan’s scientific
community. Despite this promising work, empirical research including specific
economic indicators is limited and the evidence is weak. In light of this discussion,
3
some scepticism on the centre–periphery hypothesis remains. For example, Wagner and
Leydesdorff (2005) pointed out that the centre–periphery model does not explain the
dynamic through which scientific centres both collaborate and compete with one
another for partners at the international level. Some doubt also remains on the role of
the level of economic resources devoted to R&D on international collaboration. In fact,
among the scant empirical evidence, Kim (2005) surprisingly concluded a negative
relationship between R&D expenditure and international collaboration in Korea.
Methodology
In order to test the role of economic distance along with other variables on regional
academic scientific collaboration, we put forward a gravity equation. The gravity model
has been increasingly applied in a number of studies of the regional scientific
collaboration (Ponds et al., 2007; Scherngell and Barber, 2009; Hoekman et al., 2010).
Typically, a simple version of the gravity equation takes the form:
Fij = α 0Yiα1 Y jα 2 Dijα 3
(Eq. 1)
where F (scientific collaboration) is a function of characteristics of the origin (Yi),
characteristics of the destination (Yj), and some measurement of distance between both
areas (Dij). To account for deviations from the theory, stochastic versions of the
equation are used in empirical studies by adding an independent stochastic error term.
Given the count nature of our data and the large number of zero observations in the
sample, we estimate the gravity equation using a negative zero-inflated binomial
(ZINB) model in which the collaboration count between regions is specified as an
exponential mean regression model. The zeros for the several regions (observations) not
including collaboration potentially arise from two sources. The first distinguishes those
regions with no potential for scientific collaboration (for example, one or both do not
4
have any publications during the initial period of the sample). The second source stems
from those regions with the capacity to collaborate, but which did not present any
collaboration in the observed period. The model takes the form:
λij + (1 − λij )h( Fij = 0,θ | X )
for Fij = 0
Pr( Fij ) =
for Fij = 1,2,...
(1 − λij )h( Fij ,θ | X )
(Eq. 2)
where h( Fij ,θ | X ) is the negative binomial density with mean exp( X , β ) , dispersion
parameter α , and θ = ( β 'α )' . Here, λ is a zero-inflation parameter representing the
proportion of observations with a strictly zero count (0 < λ < 1) as determined by a logit
model on all (or several) observed explanatory variables: λij = exp( Xϕ ) / 1 − exp( Xϕ ) .
The dependent variable Fij = Ascijt represents the counts of academic scientific
collaboration between region i and region j for the period t. We use co-publications to
measure collaboration among regions, as it is a well-established indicator with a long
tradition in scientific collaboration studies at both the individual and international levels
of analysis (for reviews, see Melin and Persson (1996), Katz and Martin (1997) and
Laudel (2002)).
X is a vector including the following independent variables as suggested by the
literature:
Pubit0 is the number of academic scientific publications in region i for a period t0 before
the collaboration takes place.
Pubjt0 is the number of academic scientific publications in region j for a period t0 before
the collaboration takes place.
Gdistij is the geographical distance in kilometres between the capitals of regions i and j.
Contij is a dummy variable that takes a value of 1 if i and j are contiguous regions, 0
otherwise. Note that while this is an easy way to measure regional proximity, it does
have several drawbacks. Of these, the main limitation is that we are unable to capture
5
differences in the size of the regions, the number of bordering regions and the regional
concentration of higher education institutions using this simple dummy variable.
Countryij is a dummy variable that takes a value of 1 when regions i and j are in the
same country, 0 otherwise. This is because regions in the same country usually share a
similar culture, language, policies, etc. This variable helps capture these factors.
Specijt0 is the proximity in scientific specialization between regions i and j in period t0.
This variable is measured using an index similar to that proposed by Peri (2005) of the
correlation coefficient between the 12-field composition of scientific papers in regions i
and j.
In order to contrast the centre–periphery hypothesis in collaboration patterns across
regions, we also include in our model two variables to account for the effects of
economic distance between pairs of regions:
Edistijt0: economic inequality between regions i and j is proxied with the absolute
difference in per capita income for the period t0.
RDdistijt0: the difference in academic economic resources between regions i and j is
captured with the absolute difference in per capita higher education R&D expenditure
for the period t0.
To prevent endogeneity, the explanatory variables refer to the initial period t0, that is,
before collaboration takes place. This is because although the theoretical gravity
equation establishes that the number of collaborations between a pair of regions
depends on the “mass” of publications in each region, reverse causality is also possible,
given the effect that collaboration may exert on scientific productivity. Details on the
estimation procedure of the ZINB model can be found in Long (1997) and Cameron and
Trivedi (1998, 2009).
6
Data
The empirical data used in this study comprises a set of research articles published in
scientific journals indexed by the Science Citation Index Expanded (SCI). As is well
known, the SCI is a bibliographical database produced by the Information Sciences
Institute (ISI), which is in turn a part of Thomson Reuters’ Web of Science. The main
advantage of ISI citation indexes is that they provide a complete list of all authors and
their affiliations. There are also some known limitations of this database. For example,
it does not include all journals, and the ISI journal list is biased towards journals
published in English. At the regional level, collaboration takes place when a paper is coauthored by researchers affiliated with universities located in different regions.
The procedure to account for collaboration between pairs of NUTS II regions in the EU15 followed these four steps: i) Data on academic publications containing at least one
author affiliated with a university from an EU-15 country for 1998–2004 were retrieved
from the SCI. We included several search terms to help identify higher education
institutions in both English and other languages (fachhochschule, yliopisto, ecole,
institut nacional polytehcnique, politécnico, scuola, hogskola, etc.). This search resulted
in 994,938 publications.
ii) The second step involved regionalization at the NUTS II level of aggregation of the
academic publications obtained in Step 1 (213 regions1). We first identified the NUTS
II associated with each university using the list provided by the members of the
1
Number of regions in the EU-15 according to Regulation (EC) No. 1059/2003 of the European
Parliament and the Council of 26 May 2003 on the establishment of a common classification of territorial
units for statistics (NUTS) (excluding extra-regio).
7
European Indicators, Cyberspace and the Science-Technology-Economy System
(EICSTES). For those universities not included in the EICSTES list, we searched for the
address on each university’s website. Following the full-count process (assigning the
entire publication to those regions that collaborated in its production), we obtained
1,206,644 publications and 387,545 inter-regional collaborations. iii) The third step
involves classification by scientific field. We grouped the ISI categories into 12 broad
scientific disciplines using the Third European Report on S&T indicators2. iv) The
fourth step provided the collaboration matrix between regions. The data on scientific
collaboration was placed into a (213 × 213) symmetrical matrix containing all copublications between regions. Each cell then includes the number of scientific copublications between region i and region j, and therefore excludes domestic
collaboration (academic scientific collaboration between researchers in the same
region). Consequently, there are potentially (213 × 212) ÷ 2 =22,548 collaboration links
(observations) in the EU-15 at the NUTS II scale of analysis.
A summary of the main statistics from the full sample is reported in Table 1. Note that
the number of academic publications increased by 22.83% from 1998 to 2004, while the
number of regional collaborations increased by 51.28% over the same period.
[Table 1. About here]
The indexes in Table 1 reveal that both the production of scientific knowledge and the
patterns of scientific collaboration present a high level of concentration in a few regions.
As shown in Table 1, the Gini coefficient for publications takes a value of 0.61 for the
2
The classification was established by the Centre for Science and Technology Studies (CWTS) at Leiden
University (see Tijssen and van Leeuwen, 2003). For categories not included in the CWTS 2003
classification, we used an updated (but unpublished) classification kindly provided by the CWTS.
8
initial year (1998) and 0.59 for the latest year (2004) in the sample. The value of the
Gini coefficient is slightly lower for regional collaborations. Further, as shown, the
trend in the Gini coefficients for both publications and collaborations is slightly
downward over the period 1998 to 2004. The remaining concentration indexes in Table
1 lead to the same conclusion. For example, the value of the C5 index takes a value of
about 12 for collaborations, suggesting that just five regions account for 12% of papers
co-authored with academics in other regions. Likewise, the value of the C10 index is 21,
indicating that 10 regions provide 21% of co-authored papers.
Table 2 details the number of academic papers and the number of papers in
collaboration by type of NUTS region. Drawing on this Table, it is clear that 29% of the
less-developed EU-15 regions contributed to only 15.7% of the EU-15 published
papers. On average, the capacity for publication of a region in this group is about 45%
of the capacity of a developed region in the core group. The disparities are rather
stronger when we consider a classification of regions based on higher education R&D
expenditure. However, despite this apparently unbalanced picture of the generation of
academic papers, the intensity of collaboration is similar in both groups of regions. The
main question to respond to in the next section is to what extent economic distance is an
obstacle to collaboration between regions with different levels of development or
university R&D resources.
Results
The empirical equation of the gravity model was run using cross-sectional data where
the dependent variable, Asc, includes the counts of academic scientific collaboration
between EU-15 regions from 1998 to 2004. The explanatory variables capturing the
9
mass of publications, specialization and economic distances refer to the initial year,
1998. Descriptive statistics are presented in Table 3.
[Table 3. About here]
We estimated three models (Table 4). Model I includes the number of publications (in
logs) in each region, measures of distance and the dummy variable capturing cultural
factors. Model II contains the variable capturing the similarities in scientific
specialization between regions, along with the explanatory variables in Model I. We
first estimated these two models in order to determine whether the effects of the factors
affecting academic collaboration between regions provide similar behaviour to that in
the empirical literature. Model III adds two new variables capturing the differences
between regions in terms of economic development and resources. Note that the number
of observations is different for each model. To start with, Model I was estimated with
all possible observations between the pairs of regions. In Model II, some observations
were excluded on the basis of the variable Spec, because it does not make sense to
obtain the coefficient of correlation between scientific specializations in regions i and j
when one or both have no publications. Finally, Model III was obtained with fewer
observations because of missing data on higher education R&D expenditure for some
regions.
[Table 4. About here]
To test the reliability of these estimates, we follow a top-down procedure, and first
estimate the ZINB equations and then other count data models, including Zero-Inflated
Poisson, Negative Binomial and Poisson (results not presented). For the purposes of
comparison, we applied the usual statistics of over-dispersion and the LR test in Vuong
(1989) (see Table 4). In all cases, the ZINB models were preferred. Based on this table,
the main findings suggest that:
10
1. The coefficients of the variables capturing the differences in the levels of
development and economic resources devoted to R&D in universities (Model 3) both
display a negative sign, suggesting that the greater the economic difference between two
regions, the fewer the number of collaborations. However, the economic distance
measured as the absolute differences in per capita income does not affect collaborative
behaviour, while the absolute differences in the level of economic resources devoted to
university R&D are highly significant. Together, these results indicate that regions tend
to collaborate with other regions independently of their level of economic development.
They collaborate with regions with similar characteristics in terms of the level of
resources devoted to R&D3. These results are inconsistent with the SC centre–periphery
hypothesis because, as explained earlier and according to the SC centre–periphery
hypothesis, we expect a positive relationship between SC and economic distance.
Although our data do not provide reasons for this result, a tentative explanation is that
the economic distance between our pairs of regions is probably not sufficiently wide, as
it is in other contexts where this hypothesis holds (as in, say, Boshoff’s (2010) study of
SC between African and non-African countries). In addition, the significant negative
relationship between SC and R&D distance in European regions is not entirely
unexpected, as the greater the amount of resources, the greater the opportunities to
attend international conferences and to engage in collaboration. Moreover, core regions
may not find complementarities with less-developed regions (those with scarce
resources devoted to R&D).
3
Note that the level of development and university R&D expenditure (or their differences) do not have to
be necessarily related. For example, some regions may have a high level of development because of
tertiary activities (such as tourism) that have little to do with university R&D expenditure.
11
2. The other variables included in the models present signs and coefficient as expected
and consistent with previous empirical literature:
a) The coefficients of the variables capturing the number of publications for each pair of
regions are both significant and have positive signs. This is a natural result because the
mass of publications usually implies more researchers in each region and therefore more
opportunities for collaboration.
b) The coefficients of the two variables of distance are both significant, and also have
their expected signs. Accordingly, geographical distance and contiguity are both
relevant variables in explaining academic scientific collaboration between regions. The
negative sign of the first variable indicates that collaboration decreases with distance,
while the positive sign of the second variable shows that bordered regions explain their
scientific collaborative behaviour. The main argument explaining this result relies on
the fact that collaboration usually requires the mobility of researchers; that is,
coordination, knowledge sharing and feedback sometimes require face-to-face contact.
c) The variable capturing the correlation between the levels of scientific specialization
between regions displays a positive sign, suggesting that proximity in scientific
specialization is significant in explaining the number of collaborations between regions.
d) Finally, the binary variable capturing collaboration between regions in the same
country is also relevant. This suggests that cultural similarities and other characteristics,
12
such as a common language or policies, help explain scientific collaboration between
regions.
Note that these findings hold for all three models. This means that a reduction in the
number of observations used in estimating the models produces some change in the
estimated coefficients, but not in their levels of significance.
Conclusions
This paper attempts to identify the spatial distribution of academic scientific
collaboration patterns across European regions, and is mainly aimed at evaluating the
role of economic differences between regions. A preliminary descriptive analysis
suggests a growing trend in collaboration between regions, increasing from 28.35% of
co-authored publications in 1998 to 34.92% in 2004. The data also displays a high level
of concentration of SC in a few regions, with little change over the period 1998–2004.
The separation of regions according to different levels of economic development
indicates that an Objective 1 region (one with a GDP per capita less than 75% of the
EU-15 mean) produced on average less than half (45%) the papers of a more
economically advanced region. However, both groups of regions display a similar rate
of publications involving collaboration with other regions.
Another important question we responded to in the empirical analysis was the extent to
which economic distance is an obstacle to collaboration between regions with different
levels of development and/or university R&D resources. For this purpose, we estimated
a gravity equation using empirical ZINB models for the period 1998–2004. The results
lead to the following conclusions:
13
- The centre–periphery hypothesis applying to SC suggests that researchers in researchlagging countries are willing to collaborate with those in core countries in order to gain
access to resources, while researchers in core countries collaborate by seeking
complementarities. According to our analysis, this hypothesis does not hold at the
regional level in the EU-15. From a policy viewpoint, this finding suggests that if
collaboration becomes a priority, economic distance (in terms of per capita R&D
expenditures) needs to be reduced in order to successfully attain the fulfilment of a
European Research Area.
- Other results in the gravity model indicate that there are also other variables explaining
SC between European regions. In particular, we found that the number of publications
in the initial year, geographical distance and border contiguity, similarities in scientific
specialization between the two regions, and the sharing of similar languages, cultures
and policies, also help explain SC. Results concerning the relevance of these variables
are similar to those obtained in previous work.
Finally, the focus of this paper was to analyze the relationship between SC and
economic distance, along with other variables, but we have offered only a few clues on
the reasons for our outcomes. Further research is necessary to explain, for example, the
variables capturing economic distance in European regions, particularly whether
transport costs can explain the negative effect of geographical distance on SC.
14
Acknowledgements
The authors would like to thank Martin Feldkircher (Austrian National Bank) for
providing the spatial weight contiguity matrix. They are also grateful to Raffaele Paci
and Barbara Dettori from the Centro Ricerche Economiche Nord Sud (CRENOS) at the
University of Cagliari for their assistance in the construction of the distance matrix and
for providing the coordinates of the centre regions, and Robert Tijssen of the Centre for
Science and Technology Studies (CWTS) at Leiden University for providing the
updated classification. An early version of this paper was presented at the 12th
European Network on Industrial Policy (EUNIP) International Conference held in Reus,
Spain, from 9–11 June 2010. We thank the conference participants, particularly James
Wilson, for their helpful comments and suggestions. The authors are also very grateful
to the reviewers for constructive and insightful comments. This work was supported by
the Consejería de Innovación, Ciencia y Empresa (Junta de Andalucía-Spain) [Grants
P06-SEJ-02087 and P08-SEJ-3981].
15
References
Boshoff, N. (2010). South–South Research Collaboration of Countries in the Southern
African Development Community (SADC). Scientometrics, 84, 481–503.
Cameron, C., Trivedi, P.K. (1998). Regression Analysis of Count Data. Cambridge:
Cambridge University Press.
Cameron, C., Trivedi, P.K. (2009). Microeconometrics using Stata. College Station
Texas: Stata Press.
Hoekman, J., Frenken, K., Oort, F. (2009). The Geography of Collaborative Knowledge
Production in Europe. Annals of Regional Science, 43, 721–738.
Hoekman, J., Frenken K., Tijssen, R.J.W. (2010). Research Collaboration at a Distance:
Changing Spatial Patterns of Scientific Collaboration within Europe. Research
Policy, doi:10.1016/j.respol.2010.01.012.
Hwang, K. (2008). International Collaboration in Multilayered Center–Periphery in the
Globalization of Science and Technology. Science Technology Human Values, 33,
101–133.
Katz, J.S., Martin, B.R. (1997). What is Research Collaboration? Research Policy, 26,
1–18.
Kim, M.-J. (2005). Korean science and international collaboration, 1995–2000.
Scientometrics, 63, 321–339.
Laudel, G. (2002). What Do We Measure by Co-authorships? Research Evaluation, 11,
3–15.
Liang, L., Zhu, L. (2002). Major Factors Affecting China’s Inter-regional Research
Collaboration: Regional Scientific Productivity and Geographical Proximity.
Scientometrics, 55, 287–316.
16
Long, J.S. (1997). Regression Models for Categorical and Limited Dependent
Variables, Sage, Thousand Oaks.
Melin, G., Persson, O. (1996). Studying Research Collaboration using Co-authorships.
Scientometrics, 36, 363–377.
Okubo, Y., Zitt, M. (2004). Searching for Research Integration across Europe: A Closer
Look at International and Inter-regional Collaboration in France. Science and Public
Policy, 31, 213–226.
Peri, G. (2005). Determinants of Knowledge Flows and their Effect on Innovation.
Review of Economics and Statistics, 87, 308–322.
Ponds, R., van Oort, F., Frenken, K. (2007). The Geographical and Institutional
Proximity of Research Collaboration. Papers in Regional Science, 86, 423–443.
Scherngell, T., Barber, M.J. (2009). Spatial Interaction Modelling of Cross-region R&D
Collaborations: Empirical Evidence from the 5th EU Framework Programme.
Papers in Regional Science, 88, 531–546.
Schott, T. (1998). Ties between Center and Periphery in the Scientific World-System:
Accumulation of Rewards, Dominance and Self-Reliance in the Center. Journal of
World-Systems Research, 4, 112–144.
Schubert, T., Sooryamoorthy, R. (2010). Can the Centre–Periphery Model Explain
Patterns
of
International
Scientific
Collaboration
among
Threshold
and
Industrialised Countries? The Case of South Africa and Germany. Scientometrics,
83, 181–203.
Sonnenwald, D.H. (2007). Scientific Collaboration: A Synthesis of Challenges and
Strategies, in: Cronin, B. (Ed.), Annual Review of Information Science and
Technology, vol. 41. Medford NJ: Information Today, Inc.
17
Tijssen, R.J.W, van Leeuwen, T.N., (2003). Bibliometric Analyses of World Science,
Extended Technical Annex to Chapter 5 of the Third European Report on Science &
Technology Indicators.
Vuong, Q.H. (1989), Likelihood Ratio Tests for Model Selection and Non-Nested
Hypotheses. Econometrica, 57, 307–33.
Wagner, C.S., Leydesdorff, L. (2005). Mapping the Network of Global Science:
Comparing International Co-authorship from 1990 to 2000. International Journal of
Technology and Globalization, 1, 185–208.
18
Table 1. Evolution and regional concentration indexes of academic scientific
publications and collaborations 1998-2004
1998
1999
2000
2001
2002
2003
2004
1998–2004
157,446
164,492
166,669
170,603
174,266
179,770
193,398
1,206,644
Mean
739.19
772.27
782.49
800.96
818.16
844.00
907.98
5,664.99
Max.
5,794
5,950
5,887
6,162
6,186
6,401
6,701
43,081
Min.
0
0
0
0
0
0
0
0
Std Dev.
937.41
972.52
976.27
995.60
1,013.52
1,046.23
1,100.44
7,024.09
C. Var.(1)
1.26
1.25
1.24
1.24
1.23
1.23
1.21
1.23
Gini coeff. (2)
0.61
0.60
0.60
0.60
0.59
0.59
0.58
0.59
C5(3)
13.38
13.38
13.16
13.31
13.12
13.32
12.91
13.18
C10(4)
23.04
22.88
22.82
22.65
22.41
22.64
22.13
22.61
C25(5)
44.86
44.69
44.13
44.05
44.10
44.06
43.37
44.02
44,643
48,588
51,459
55,169
58,203
61,947
67,536
387,545
Publications
N1
N2
Collaborations
(N2÷N1)×100
28.35
29.54
30.87
32.34
33.40
34.46
34.92
32.12
Mean
209.59
228.11
241.59
259.00
273.25
290.83
317.07
1,819.46
Max.
Min.
1,374
0
1,566
0
1,665
0
1,821
0
1,882
0
2,020
0
2,180
0
12,508
0
Std Dev.
251.90
272.54
287.61
304.44
323.30
346.52
369.17
2,149.98
C. Var.(1)
1.20
1.19
1.19
1.17
1.18
1.19
1.16
1.18
Gini coeff. (2)
0.59
0.59
0.58
0.58
0.58
0.58
0.57
0.56
C5(3)
12.35
12.31
12.30
12.30
12.29
12.38
11.98
12.22
C10(4)
21.94
21.74
21.77
21.37
21.55
21.66
21.02
21.53
C25(5)
42.78
42.66
42.50
41.64
42.19
42.54
41.84
42.22
(1)
Coefficient of variation = Std Dev. ÷ Mean; (2) The Gini coefficient ranges between 0 and 1; the larger the value the higher the
level of regional concentration in publications or collaborations. (3)(4)(5) Concentration indexes of publications for the top 5, 10
and 25 regions with the largest number of scientific papers, respectively.
Table 2. Regional production of academic papers and collaborative papers by type of
NUTS region (*)
Regions with more than Regions with less than 75%
75% of the EU-15 average of the EU-15 average GDP
GDP per capita
per capita
1998
2004
A. No. Papers
20,996
30,463
Mean
338.65
491.34
Std. Dev
434.08
589.93
6,108
10,548
98.52
170.13
122.34
197.46
B. No. Coll.
Mean
Std Dev.
(B/A)*100
A. No. Papers
Mean
45.09
72.69
29.09
34.63
19.02
136,450
162,935
19.41
903.64
1,079.04
1,035.29
1,211.95
B. No. Coll.
38,535
56,988
Mean
255.20
377.40
Std Dev.
276.41
405.18
Std Dev.
1998–2004
% increase
47.89
Groups of regions according their level of higher education
R&D expenditure
1998–2004
1998
2004
% increase
Regions with more than Regions with less than 75%
75% of the EU-15 average of the EU-15 average R&D
per capita
R&D per capita
Groups of regions according to their level of development
A. No. Papers
12,064
15,855
Mean
236.55
317.10
Std Dev.
319.20
430.61
3,443
5,905
Mean
67.51
118.10
Std Dev.
78.00
150.39
B. No. Coll.
(B/A)*100
31.42
71.51
28.54
37.24
30.50
86,905
102,274
17.68
Mean
1,259.49
1,461.06
Std Dev.
1,054.00
1,214.21
B. No. Coll.
24,635
34,998
Mean
357.03
507.22
Std Dev.
278.48
398.17
A. No. Papers
42.07
(B/A)*100
28.24
34.98
23.85
(B/A)*100
28.35
34.22
20.72
(*) The group of less-developed regions comprises 62 NUTS regions, where the number of NUTs regions with more than 75% of the EU15 average GDP per capita is 151 (213 in total). Because of the lack of data, the number of regions with less and more than 75% of EU-15
average university R&D per capita falls to 120, with 51 NUTS regions in the first group and 69 in the second group.
19
Table 3. Descriptive statistics
Variable
Ascij
Mean
Std Dev.
Min.
Max.
23.9821
73.0597
0
1,487
Ln(Pubi)
6.1460
1.6060
0
8.6645
Ln (Pubj)
6.2067
1.5884
0
8.6645
10.9701
7.6767
0.4228
60.0071
Gdistij
Country
0.1855
0.3887
0
1
Contij
0.0341
0.1815
0
1
Specij
0.4618
0.3639
–0.5502
0.9905
Edistij
7.9900
6.1366
0.0023
36.7493
RDdistij
0.2890
0.2893
0
1.4652
Table 4. Results of zero-inflated negative binomial regressions
MODEL 1 (ZINB)
Coeff.
Std Err.
Constant
MODEL 2 (ZINB)
Coeff.
Std Err.
MODEL 3 (ZINB)
Coeff.
Std Err.
–6.9972
0.1044
***
–6.7297
.1041
***
–8.905
0.1635
***
Pubi
0.7614
0.0103
***
0.7083
.0104
***
0.8113
0.0149
***
Pubj
0.6310
0.0093
***
0.5865
.0096
***
0.7976
0.0155
***
Gdistij
–0.0152
0.0013
***
–0.0131
.0013
***
–0.1798
0.0027
***
Country
1.8186
0.0303
***
1.7509
.0303
***
1.9096
0.0352
***
Contij
0.8971
0.0615
***
0.9138
.0607
***
0.8846
0.0600
***
0.6405
.0389
***
***
Specij
0.5934
0.0483
Edistij
–0.0044
0.0027
RDdistij
–0.1716
0.0513
***
9.7186
0.7727
***
Inflated (logit)
Constant
8.3158
0.3389
***
8.4406
.3647
***
Pubi
–0.8210
0.0371
***
–0.8416
.03973
***
–0.9331
0.0801
***
Pubj
–0.8603
0.0289
***
–0.8071
.0338
***
–0.9498
0.0812
***
Gdistij
–0.0049
0.0054
–0.0086
.0055
–0.0187
0.0153
Country
–1.8372
0.1961
***
–1.9299
.2105
***
–3.4769
0.6475
***
Contij
–2.2896
0.6269
***
–2.2542
1.061
***
–0.7423
1.0202
***
–0.6781
.1731
***
0.2543
0.3212
***
Edistij
0.0189
0.0167
RDdistij
0.2187
0.3470
Specij
LnAlpha
Alpha
LR-test
Likelihood-ratio test alpha
Vuong test
No. obs.
–0.1089
***
–0.1441
12,627.72
***
1.2e+05
***
7.75
***
0.8967
22,578
***
–0.6509
12,863.10
***
6,899.73
***
1.2e+05
***
3.7e+04
***
8.37
***
4.24
***
0.8682
16,110
***
0.5215
5,978
* p < 0.10, ** p < 0.05, *** p < 0.01
20