Project report Linking Science to Technology Bibliographic References in Patents Volume 3 Literature Review IMPROVING HUMAN RESEARCH POTENTIAL AND THE SOCIO-ECONOMIC KNOWLEDGE BASE EUR 20492 Linking Science to Technology Bibliographic References in Patents Volume 3 Literature Review Authors : A. Verbeek, E. Zimmerman, P. Andries (Researchers) Prof. Dr. Ir. K. Debackere, Dr. M. Luwel, Prof. Dr. R. Veugelers (Promotors) In co-operation with Synes NV Katholieke Universiteit Leuven – Incentim, Belgium This publication results from work funded by the European Commission, Research DG, Directorate K – Kwoledge-based economy and society under the Common Basis of Science, Technology and Innovation Indicators (CBSTII) action of the Support for the Development of Science and Technology Policies in Europe sub programme of the Improving Human Research Potential and the Socio-Economic Knowledge Base specific programme of the Fifth Framework Progamme for Research and Technological Development (contract number ERBHPV2-CT-199303). The projects were managed by Brian SLOAN of the Competitiveness, economic analysis, indicators Unit of Directorate K of Research DG . 2 3 EUROPEAN COMMISSION RESEARCH Commissioner : Philippe Busquin Directorate-General for Research Director General: Achilleas Mitsos The Directorate-General for Research initiates, develops and follows the Commission’s political initiatives for the realisation of the European Research Area. It conceives and implements the necessary Community actions, in particular the Framework Programmes in terms of research and technological development. It also contributes to the implementation of the “Lisbon Strategy” regarding employment, competitiveness at international level, the economic reform and the social cohesion within the European Union. The Directorate " Knowledge-based economy and society" (Directorate K) contributes to the realisation of the European Research Area in the fields of the social sciences, economic, science and technology foresight, and the respective analyses. To this end, it monitors and encourages science and technology foresight activities, conducts the economic analyses necessary for the work of the Directorate-General, and co-ordinates policy as regards the relevant political, economic, human and social sciences. It prepares the European reports on science and technology indicators, and it contributes to the development and implementation of the Framework Programmes in these fields. It monitors the progress made in the implementation of the Lisbon strategy. Director : Jean-François Marchipont. Head of Unit “Competitiveness, economic analysis and indicators”: Ugur Muldur Scientific Officer: Brian Sloan [email protected] http://www.cordis.lu/rtd2002/indicators.html 4 PREFACE This report is one of a series of 9 volumes presenting the results of a study whose aim is to trace and quantify the linkages between scientific disciplines and fields of technology1. The study focuses on a specific form of S&T interaction: the presence of scientific research in the “prior art” description of a patented invention. It is a direct form of S&T interaction, which may be grasped through rich, plentiful, and directly available and accessible data: patents and publications. Establishing this bridge enables one not only to trace the linkages between specific fields of technology and related science disciplines, but also to measure the science intensity of such linkages (in general, a higher number of science citations is observed when a particular technology field is more sciencebased). Moreover, as most scientific articles are published by universities and public research centres, and most patents are granted to industry, these linkages may provide an insight into the effectiveness of the interface between publicly funded research and the industrial exploitation of science. The study forms a starting point for the qualitative understanding of the science-technology interaction by revealing networks and crossroads of scientific and technological activity. It may also provide insights into the rate and the speed of science diffusion into technology, as well as into commonalities within the science base that are relevant across different technologies, and that can help us to understand which technologies interact with one another to breed new hybrids. The study is presented in 9 volumes : Volume 1: Volume 2: Volume 3: Volume 4: Volume 5: Volume 6: Science and Technology Interplay : Policy relevant findings and interpretations Methodological Framework Leterature Review Detailed analysis of the Science-Technology Interaction in the field of Aeronautics & Space Detailed analysis of the Science-Technology Interaction in the field of Biotechnology Detailed analysis of the Science-Technology Interaction in the field of Environmental Technology Volume 7: Detailed analysis of the Science-Technology Interaction in the field of Information Technology Volume 8: Detailed analysis of the Science-Technology Interaction in the field of Telecommunication Volume 9: Detailed analysis of the Science-Technology Interaction in the field of Nanotechnology 1 This study was financed by the European Commission contract number ERBHPV2-CT 1993-03 under the activity “Common Basis of Science, Technology and Innovation Indicators” (CBSTII) of the 5th RTD Framework Programme. 5 6 TABLE OF CONTENT PREFACE ...........................................................................................................................................1 ABBREVIATIONS .............................................................................................................................9 GENERAL INTRODUCTION ..........................................................................................................10 PART I: SCIENCE, TECHNOLOGY AND ECONOMIC GROWTH – A SYNOPSIS OF RELATIONS ..............................................................................................................13 CHAPTER 1 - A BRIEF INTRODUCTION TO THE LITERATURE ON TECHNOLOGICAL CHANGE AND ECONOMIC PERFORMANCE .................................................................................................13 CHAPTER 2 - THE DIFFERENT CONCEPTS AND THEIR RELATIONS .....................................................16 I. Science and Technology ................................................................................................................................... 16 II. Research and Development............................................................................................................................. 16 III. Innovation and technical change ................................................................................................................... 17 CHAPTER 3 - THE ROLE OF KNOWLEDGE IN TECHNOLOGICAL AND ECONOMICAL PROGRESS ...........19 I. Introduction ....................................................................................................................................................... 19 II. Tacit and codified knowledge ......................................................................................................................... 19 III. Knowledge as a productivity factor............................................................................................................... 20 IV. Diffusion of knowledge ................................................................................................................................. 21 V. From the ‘linear model’ to the ‘network model’ of knowledge production, transfer and use ..................... 22 CHAPTER 4 - ECONOMICS OF SCIENCE: CONTRIBUTION AND ASSESSMENT ......................................23 I. Academic versus industrial research ................................................................................................................ 23 II. The contributions of academic research to industrial innovation.................................................................. 24 III. Assessment and rationalisation of government-supported scientific activity.............................................. 25 CHAPTER 5 - THE NEED FOR PRIORITISATION IN SCIENCE AND TECHNOLOGY ................................29 PART II: SCIENCE AND TECHNOLOGY - EXAMINATION AS TWO SEPARATE SPHERES30 CHAPTER 6 - EXAMINING SCIENCE AND TECHNOLOGY ...................................................................30 I. Introduction ....................................................................................................................................................... 30 II. Patents and publications as output indicators ................................................................................................. 32 III. Some considerations around the use of S&T indicators............................................................................... 33 CHAPTER 7 - PATENTS AND MEASUREMENT WITH PATENTS............................................................34 I. Patents, the patenting system and its economic rationale ............................................................................... 34 I.1. A patent and its economic rationale .......................................................................................................... 34 I.2. Patenting systems....................................................................................................................................... 36 I.3. Patent data sources..................................................................................................................................... 37 II. Patent statistics and trivialities around patents............................................................................................... 38 II.1. Rationale of patenting: patenting behaviour............................................................................................ 39 II.2. The Classification problem ...................................................................................................................... 42 II.2.1. The International Patent Classification (IPC) .................................................................................. 42 II.2.2. Relating patents to economic categories .......................................................................................... 42 II.3. The problem of the ‘value’ of patents...................................................................................................... 44 III. The use of patent statistics for technology measurement ............................................................................. 47 III.1. Measuring and comparing with simple output indicators; ‘foreign’ patenting .................................... 47 III.2. Specialisation indices.............................................................................................................................. 48 III.3. Impact indicators ..................................................................................................................................... 51 III.4. Maps of technology................................................................................................................................. 51 7 CHAPTER 8 - BIBLIOMETRIC INDICATORS AND ANALYSIS OF RESEARCH SYSTEMS ...........................52 I. Introduction to ‘bibliometrics’ ......................................................................................................................... 52 II. Bibliometric information: publications and data sources............................................................................... 53 II.1. Publications as data .................................................................................................................................. 53 III. Limitations to the use of bibliometric data.................................................................................................... 56 III.1 Completeness of bibliometric data .......................................................................................................... 57 III.2. Coverage of scientific literature databases............................................................................................. 57 III.3. The problem of ‘statistics’ ...................................................................................................................... 58 IV. Citation analysis ............................................................................................................................................. 59 IV.1. Introduction ............................................................................................................................................. 59 IV.2. The use of citations as ‘information’ ..................................................................................................... 60 IV.3. Basic assumption and problems around citation analysis ..................................................................... 60 IV.3.1 A general overview........................................................................................................................... 60 IV.3.2. Detailed discussion: basic assumptions underlying citation analysis............................................ 61 IV.3.3. Possible problems/pitfalls related to ‘Citation Analysis’............................................................... 62 V. Use of bibliometric data in a science policy context ..................................................................................... 65 VI. Bibliometric performance indicators............................................................................................................. 66 VI.1. Indicators of publication output and productivity ............................................................................. 66 VI.2. Impact indicators..................................................................................................................................... 67 VI.2.1. The relationship between ‘quality’ and ‘impact’........................................................................... 67 VI.2.2. Short-term vs. long-term impact .................................................................................................... 68 VI.2.3. Actual vs. expected impact............................................................................................................. 69 VI.2.4. Overview of impact indicators ....................................................................................................... 70 VI.2.5. Graphic representation of impact analyses ..................................................................................... 72 VI.3. Indicators of collaboration...................................................................................................................... 73 VI.4. Indicators of scientific activity ............................................................................................................... 74 VI.5. Intermezzo: one-dimensional vs. two-dimensional measurement and assessment.............................. 75 VI.6. Mapping of science: techniques and utility for science policy purposes.............................................. 76 VI.6.1. Mapping of science: an introduction............................................................................................... 76 IV.6.2. Several techniques ........................................................................................................................... 77 IV.6.3. Utility of mapping exercises for science policy purposes.............................................................. 80 PART III: SCIENCE AND TECHNOLOGY - EXAMINATION OF THE INTERACTION........82 CHAPTER 9 - LINKING SCIENCE TO TECHNOLOGY ...........................................................................82 I. General introduction ......................................................................................................................................... 82 II. Background on the science and technology interaction................................................................................. 82 II.1. Importance of the S&T interaction .......................................................................................................... 82 II.2. Science and technology interrelation: who leads and who follows?...................................................... 83 A. The changes in the knowledge production and diffusion system.................................................. 83 B. Discussion of the science – technology interaction in the light of cross-citation analysis........... 85 III. Implications from a policy perspective ......................................................................................................... 86 IV. Exploration of the science – technology interaction .................................................................................... 87 I. Direct/explicit S&T interrelations.................................................................................................... 89 IV.1. Citations to scientific publications in patent documents....................................................................... 89 IV.1.1. Non-patent references (NPRs)......................................................................................................... 90 IV.1.2. Differences in citation intensity ...................................................................................................... 93 IV.1.3. Interpretation of the science – technology linkage based on citations .......................................... 96 IV.1.4. Relevance of direct linkage approach from a policy perspective .................................................. 97 IV.1.5 Science – Technology linkage indicators ........................................................................................ 97 II. Indirect/implicit S&T interrelations ............................................................................................. 110 IV.2. Patents of scientific institutions and publications of industrial enterprises........................................ 110 IV.3. Tendency to integrate scientific and technological activities ............................................................. 112 IV.4. Co-activities (joint activities) between scientific institutions and industrial enterprises................... 112 IV.5. Parallel observation of patents and publications ................................................................................. 113 IV.6 Cartographical approach based on co-occurrences of publication and patent keywords.................... 114 APPENDIX I: KEY CHARACTERISTICS OF SEVERAL EMPIRICAL STUDIES ON S&T ANALYSIS..............115 REFERENCES ................................................................................................................................120 8 ABBREVIATIONS CHI ISI EC EPC EPO NPR SCI USPTO S&T DST REFI FhG Computer Horizons Inc., Haddon Heights, NJ, USA Institute for Scientific Information European Community European Patent Convention European Patent Office Non-patent reference Science Citation Index United States Patent Office Science and Technology Decision Support Tool REference FIle (document citations of the European patents) Fraunhofer Gesellschaft 9 GENERAL INTRODUCTION The purpose of this first chapter is to set the stage for the remainder of the report, and, even more important, to provide the reader with a number of points of departure that should be kept in mind when reading through the rather detailed and extensive description of the existing literature in the areas of sciento-, techno-, and econometrics presented in this report. In the early work of Adam Smith and Karl Marx science and technical change have been considered as driving factors behind economical progress. Adam Smith pointed out that ‘each individual becomes more expert in his own peculiar branch, more work is done upon the whole, and the quantity of science is considerably increased by it’. The science factor underlying growth and wealth is apparent. Ever since the seminal works of Joseph Schumpeter (see Hanusch (1999) for a comprehensive overview), the role of and the interaction between science and technology (S&T) in explaining patterns of economic growth and development have received widespread scholarly attention (see for example Nelson and Winter, 1982; Scherer, 1985; Dosi and Fabiani 1994; Freeman, 1994; Silverberg and Soete, 1994; Nelson, 1994 or more recently, Dosi, 2000). Technical progress and technical change therefore have become fundamental issues in many studies on innovation and economic development (Freeman, 1994; Grupp, 1998; Dosi 2000). Scientific studies have explored, developed and adopted complementary approaches to understand the intertwined dynamics of technological progress and economic development, including the effects of the S&T interaction on these dynamics. This S&T interaction is, however, far from linear and sequential. Instead, it is dynamic, heterogeneous, and highly complex. Although the systems of science and technology are since long assumed to be converging (Toynbee, 1963), this convergence is at best complex and requires a detailed understanding of its modus operandi (Freeman, 1982). Toynbee (1963) compared the S&T interaction to a ‘pair of dancers.’ De Solla Price (1965) uses the same simile of dancing partners. Only in his metaphoric adaptation, they are dancing to the same music, having their own steps. The emergence of biotechnology (Bud, 1994), a prototypical ‘sciento-technology’, has further stimulated scientific inquiry into technology’s increasing dependence on scientific discovery. Especially in policy oriented (research) circles, this dependence has been subject to much examination and speculation. Processes of ‘knowledge creation’ and the variety of possible modes of ‘knowledge diffusion’ are elements, even central themes, in the ongoing debate on science, technology, and innovation and their interaction (Gibbons et al., 1994). The nature of the knowledge generation process itself is evolving towards more network-embedded structures, with more emphasis on partnerships, the interplay between knowledge demand and supply, as well as a growing transdisciplinarity and heterogeneity of the actors involved in the dynamics of technology convergence. The network-embeddedness of the processes of knowledge creation and diffusion is becoming increasingly manifest in phenomena like universityindustry interactions and the advent of technoscientific breakthroughs like “nanotechnology” and “biology in silico.” 10 Since a couple of decades, it has become obvious that the linear model of the knowledge creation, transfer and diffusion “value chain” is no longer valid since it does not capture the current complexity and multiplexity of the relationships involved along the chain. This is despite the fact that some technological fields, with biotechnology as a prototypical example, are heavily based on scientific discovery. This complex and multiplex nature of the S&T interaction triggered the theoretical development and the empirical testing of knowledge production functions (Grilliches, 1990). Before one will be able to model an entire knowledge production function, a more detailed understanding of the interaction between science and technology is needed. A central issue hereby is the quantification and the modelling of the complex web of linkages and interactions between S&T development and progress. In this report, we attempt to describe the ‘state of the art’ on science, technology, and their interaction. During the past two decades, wide-ranging socio-economic and technological transformations have caused European governments to reformulate their policies concerning government-supported scientific activity. This reformulation has been accompanied by shifts and even complete turnarounds in research funding across fields of inquiry as well as concerning the orientation of the research conducted (basic versus applied). The present constraints on public expenditures in general, the enormous investments involved in sustaining the econo-techno-scientific complex, and the actual debate on the effectiveness of government supported scientific research, all increase the need for more accountability and effectiveness in the area of publicly funded research (Hanusch, 1999; Ziman, 1994; Moed, 1989). Therefore, at a policy level, disentangling the S&T interleaving is assumed to lead to considerable insights as to how to handle those challenges. More precisely, it can lead to an ex-ante decision support instrument and an ex-post evaluation tool. The purpose of this report is to provide the theoretical background upon which the further discussion on science – technology interaction can be based. The structure of this theoretical background is as follows. The report consists of three broad parts. Part I presents and discusses some major theoretical contributions on the interrelation between science – technology – economical development thereby paying substantial attention to the role of knowledge as one of the production factors. Part II, one of the key parts of this report, discusses different methodological approaches for measuring, evaluating, and as such reviewing science and technology independently. The focus lies on patents and publications as respective proxies of technology and science. In part III of this report, the central issue of the present project, the science – technology interrelation, will be extensively discussed. In figure 1 we present the internal structure of this report and the theoretical building blocks it consists of. 11 Figure 1 - Internal structure of this report PART I Science, Technology and Economic Growth - A synopsis of relations PART II Science and Technology - Examination as two separate spheres Foresight Technology Science measurement PART III Science and Technology - Examination of the interaction Science Technology 12 Part I: Science, Technology and Economic Growth – A synopsis of relations CHAPTER 1 - A BRIEF INTRODUCTION TO THE LITERATURE ON TECHNOLOGICAL CHANGE AND ECONOMIC PERFORMANCE Before a detailed elaboration on the actual subject of our study, the science and technology interface, we will provide the reader with a short overview of a number of approaches, contributions, in regard with the impact and the nature of the relation between technological change and economic performance. It stands to reason that understanding the nature of the interaction offers a scala of possibilities for policy makers to actively intervene in this interaction. There is a wide consensus concerning the importance of the technical progress and its contribution to economic development. Technical progress and change are fundamental issues in economics (Grupp, 1998). Mansfield (1969) states: ‘Technological change is an important, if not the most important, factor responsible for economic growth’. A number of schools have developed complementary approaches in order to understand the dynamics of technological progress and economic development. There are several possibilities for differentiating among the existing literature that contributed to a greater understanding of the technology – economy interaction. A distinction can be made between Neo-classical approaches (Schumpeter, Arrow, Von Hayek and others) and the Institutional or Evolutionary approaches (Freeman, Dosi, Pavitt, Nelson and others). An unambiguous distinction between neo-classical and evolutionary approaches, or theories on innovation research, is not feasible. This is partly due to the non-contradictory nature of both approaches (Grupp, 1998). Grupp used three pairs of opposites in order to categorise the various approaches. The first pair of opposites relates to the view on technological change as exogenous or endogenous. The second opposite concerns the description of the economic system by equilibrium states or developing processes. The last pair of opposites encompasses the interaction of economic actors based on rational decision theory or on selfregulating empirical processes. The exogenous/endogenous criterion refers to the process of technology genesis and its determinants. An endogenous approach, closely linked to the knowledge creating process, considers technological change to originate within the economy and society. Amable (in: Silverberg & Soete, 1994) states that endogenising the technical progress implies focussing on learning effects as a source of improvement in technology. In a typical neo-classical approach, technical change is made endogenous because economic agents choose to allocate certain amounts of resources to its development (for instance R&D expenditures). However, technologies are conceived, developed and diffused under multiple economic constraints (OECD, 1992). Nelson (1994) differentiates between two aspects of economic growth modelling that make technological advance endogenous. First, R&D investments are profitable for firms because they can make proprietary at least a part of the value of the increased productivity or better product performance won through R&D. Secondly, to deal with the recognition that technology is in a way proprietary and that support of R&D is possible only if price exceeds cost of production, markets are assumed to be imperfectly competitive. The neo-classical growth theory acknowledges the centrality of technical change in the growth process (Nelson & Winter, 1974) 13 The second criterion differentiates among various approaches by emphasising the time dimension of an innovation event. The neo-classical approaches pay significant attention to the properties of the equilibrium state but less attention to the aspects of how the system has arrived at that point. In the evolutionary and institutional approaches, also referred to as new neo-classical approaches, the relevant processes and pathways are defined carefully in contrast to the specification of the final state. The last criterion deals with whether a system is regarded in terms of carefully defined, stable relationships between its constituents, as in the rational actors approach, or that the whole cannot be so reduced. Instead, the properties of complex systems develop from non-linear interactions between their selfregulating components. Although neo-classicism and institutional economics share many key areas, the latter is primary concerned with treatment of the following subjects (Grupp, 1998): § The formation of institutions § The variable relationship between the economics and legal systems § The effects of technical change on institutional structure Discussing all the contributions from either the neo-classical scholars either the institutional or evolutionary scholars, would lead us too far. However, special attention should be paid to the contribution of Freeman and his colleagues from the SPRU University in Sussex, who are regarded as the grounders of the Institutional innovation theory. By analysing the most intensive twentieth century branches of the economy (chemicals, mineral oils, plastics) Freeman points out that industrial innovative activity is always professionally managed and that technological development is increasingly dependent on scientific knowledge. However, according to Freeman, the professionalization of the R&D system is not the only agency that affects the increasingly scientific nature of technology. Other factors are growing complexity, highly developed system and network techniques and the general trend towards division of labour and specialisation. Pavitt (1984), in a study aiming at identifying and describing sectoral patterns of technical change, introduces the ‘science based’ sector, which can mainly be found in the chemical and electronic/electrical sectors. Innovations are directly linked to scientific progress. Bart Verspagen (In: Silverberg & Soete, 1994) categorises the empirical literature on the interaction between technology and growth mainly into two streams. The first one has its roots in the notion that R&D can be viewed as an additional production factor (see also the work of Arrow, 1994) entering the production function much in the same way as various types of fixed capital or labour. Mohnen (1992) discusses this approach in more detail. A second stream tries to establish an empirical relation between knowledge accumulation and growth of output or productivity, focusing on cross-country samples. Inter-country knowledge spill over is the most important factor leading to knowledge accumulation and thus growth. For a detailed and excellent overview of growth literature in retrospect, we can refer to Nelson (1994) or Dosi & Fabiani (1994) and Grupp (1998). 14 Silverberg and Lehnert (in: Silverberg & Soete, 1994) formulate four basic characteristics of innovation and productivity growth, to which researchers of all persuasion would probably subscribe. I. The innovation process is inherently uncertain and irregular, and neither the exact nature nor the timing of innovations is under the control of actors. II. Although the Schumpeterian description of entrepeneurship hinges on imperfect competition and appropriability, technical progress has very rarely been monopolisable for any length of time by a single party. Instead, it has been the joint product of the actions of many agents and therefore only imperfectly appropriable at best. III. The productivity enhancing effects of innovations only become manifest during diffusion. Investment and capital formation, learning and organisational change thus play an important role in translating innovation into ongoing technical change. Furthermore, diffusion often requires considerable time. IV. It follows from the simultaneity of diffusion processes that the economy at any time will consist of a superposition of different technologies inherited from the past, as well as a multitude of ‘cutting edge’ technologies competing for dominance on the frontier. The nature of competition determines replacement dynamics and choice of technique. One of the conclusions based on these four general points is that innovation in a multi-actor and thus dynamic environment is to a certain extent uncontrollable. Furthermore, the effects of innovation on economical growth become manifest only after diffusion of knowledge incorporated in the form of an innovation. Knowledge and the diffusion of knowledge are thus crucial in realising economic growth. Stoneman (1987) states that technology will have impact on productivity, trading performance, employment, investment, income distribution, quality of goods, growth, inflation, environment, security and defence and the industrial structure of an economy. Some impacts may be beneficial while others may be harmful. The effect on national growth in GDP (per capita) has been discussed in detail by Fagerberg (1994). He states that national growth curves are determined by the level of education and training, and by the level of R&D, consisting of scientific research and technological development. Narin (1994), who considered publications and patents as proxy-measures of knowledge production, already found this positive effect of R&D on Gross Domestic Product. Only by understanding the whole mechanism can policy makers construct policies to discourage harmful effects and to encourage beneficial effects. According to the 1998 report of The National Science Board on science and engineering indicators, the relationship between science and technology on the one hand, and economic development on the other hand, appears to be self-propelling. Science and technology lead to competitiveness and commercial success. This in turn generates additional resources that can be spent on more scientific research and technological development, thus closing the circle. 15 CHAPTER 2 - THE DIFFERENT CONCEPTS AND THEIR RELATIONS I. Science and Technology The first issue to be addressed concerns the definition of both science and technology. The term science is understood to cover the creation, discovery, examination, classification, reorganisation and dissemination of knowledge on physical, biological or social subjects. In a recent study carried out by Senker et al. (1999), in the context of a TSER-project on the European comparison of Public Research Systems, research has been defined as ‘original investigation undertaken to acquire new knowledge in the natural sciences, social sciences and humanities’. Technology is the science application know-how. In other words, it emphasises the blueprint of the application of knowledge. As such, it belongs to a larger group of activities that embrace the creation and use of artefacts, crafts and items of knowledge as well as various forms of social organisation (Grupp, 1998). Mansfield (1969) points out that it is important to distinguish between technological change and scientific advance. Pure science is directed towards understanding, whereas technology is directed towards use. Technological change or evolution as a result of inventions is not necessarily based on new scientific principles. It can also be based on alternative evolutionary use of existing knowledge. One of the key terms in the above-presented definitions is ‘knowledge’. Science includes processes of knowledge creation and diffusion. Technology on the other hand focuses on the application or usage of the created knowledge. The research process is of major importance for the materialisation of innovations. This is a widely accepted principle. In present research statistics, a distinction can be made between fundamental or ‘basic’ research, applied research and experimental development. These three approaches are often combined under the heading ‘research and development’ and will be discussed further in the next paragraph. II. Research and Development ‘Research and Development is systematic, creative work that advances the state of our knowledge, whether in connection with man, culture or society and uses this knowledge to identify new potential applications’ (Grupp, 1998). The Frascati manual issued by the Organisation for Economic CoOperation and Development (OECD, 1994b) provides definitions and conventions for the measurement of research and experimental development (R&D) aiming at a form of European standardisation. In the Frascati manual, R&D comprises the creative work undertaken on a systematic basis in order to increase the stock of knowledge, including knowledge of man, culture and society, and the use of this stock of knowledge to devise new applications. This definition is very similar to the definition used by Grupp (1998). As mentioned above, R&D covers three activities: basic research, applied research and experimental development. These three activities will be discussed below. Basic research or fundamental research refers to experimental or theoretical work, geared ‘primarily’ to the acquisition of new knowledge about the basic origin of phenomena and observable events without targeting a particular application of use. It focuses on the advancement of knowledge without any economic or social targets and it is certainly not dedicated to solving practical problems. The term 16 application-oriented fundamental research is used in situations where basic research targets certain areas of general interest or is focused in their direction. Applied research is predisposed to specific and practical purposes or objectives. It also includes a new knowledge generation process but always in regard of the practical application. The results of applied research are intended to be valid for a limited range of products of processes. Experimental development is systematic work structured on existing knowledge with either research of empirical origin, and which is directed towards production of new materials, products, equipment or the installation of new processes, systems or services. Research and development is perceived as the driving force behind the innovation event. Science and technology need to be distinguished from research and development. Scientific research, in the sense of research carried out by academia, can be regarded as taking place outside the company, in the public domain (cfr. section 5.I). The latter contrasts with research and development itself, which can be attributed more to the private domain (Grupp, 1998). In view of the impact of knowledge production on the economic growth, discussed extensively among others by Arrow, Amable and Verspagen (In: Silverberg & Soete, 1994), R&D can be regarded as an identifiable economic activity. Due to its characteristics, science is increasingly being reviewed as an economic activity as well, as it can lead to the introduction of return requirements (financial and non-financial) on public and private funding of science (see chapter 5). For a more detailed discussion of R&D in terms of activities covered, statistical measurements and budgeting issues, we refer to the Frascati manual (OECD, 1994b). III. Innovation and technical change A third important concept related to science and technology in a broader sense, is innovation. Several definitions on innovation can be found in literature. Grupp (1998) defines it as follows: ‘Innovation relates to the attained quantity of ideas’. Verspagen (In: Silverberg & Soete, 1994) associates innovation with an original contribution to the stock of knowledge in the economy. Innovations can occur in the form of new consumer goods, new production of transport methods, new markets or new organisations. In the Frascati manual (OECD, 1994b) innovation concerns new products and processes and significant technological changes in product and processes. An innovation is implemented, if it has been introduced on the market (product innovation) or used within a production process (process innovation). Innovations therefore involve a series of scientific, technological, organisational, financial and commercial activities. It is beyond discussion that innovation can basically be characterised by new ideas, concepts, applications and, generally speaking, new knowledge finally contributing to a new product of process, including service-processes. The result of the innovation process, a new product or process, can be characterised as ‘innovation’. As Grupp (1998) points out, the novelty concept can be regarded as a synonym for the innovation concept. The R&D activity is just one of the activities of the complex process of innovation. R&D can be used in any phase of the innovation process, not only as an original source of inventive ideas, but also as a catalyst for solving problems that can occur at any stage in the implementation process. Besides R&D as an activity in the whole innovation process, six fields of innovative activities are often distinguished (OECD, 1994b): 17 § § § § § § Tooling-up and industrial engineering Manufacturing start-up and pre-production development Marketing for new products Acquisition of disembodied technology Acquisition of embodied technology Design Another concept, closely linked to innovation, is imitation. Imitation can be understood as the diffusion of knowledge to agents other than the original innovator. As we shall also see in the discussion of the different theories on technology and economic growth in the next chapter, innovation and imitation represent different aspects of technological change and also lead to different economical results. Research and development is associated with both innovation and imitation, as it can be directed towards the creation of new knowledge and/or imitation. This also implies that behind R&D lies a set of profound choices. Verspagen (In: Silverberg & Soete, 1994) offers another view on innovation in relation to imitation. He connects imitation to knowledge spill over and innovation to patenting and knowledge creation. The most important criterion for an innovation to be patented is undoubtedly ‘novelty’. This brief discussion of the concepts allows us to conclude that a clear and unambiguous distinction between the concepts is a rather difficult task. This is partially due to the fact that the explanation of the concepts is largely contingency-based. Science and technology are two different sides of the same coin. Science aims at the expansion of the knowledge base, whereas technology refers to the application of knowledge in order to realise an innovation, a new product or process. R&D, which is only one of the activities of the process of technological innovation and clearly has an economic impact, seems to be the trigger for research and at the same time for the creation of new applications. Finally, all this may result in a single innovation, the result of the whole process of technological innovation. In the subsequent discussion, this will be the context in which the above-mentioned concepts will be used. 18 CHAPTER 3 - THE ROLE OF KNOWLEDGE IN TECHNOLOGICAL AND ECONOMICAL PROGRESS I. Introduction As we have seen in the concepts of science, technology and innovation, knowledge plays an important role in technological progress and subsequently in economic growth. Knowledge is undoubtedly connected with the idea of technical advance. According to Gibbons et al. (1994) the very nature of knowledge is evolving to a more network-oriented structure, with greater emphasis on strategic alliances, knowledge demand and supply chains and a growing transdisciplinarity and heterogeneity. As a consequence, the identification of scientific priorities in the actual socio-economic environment becomes increasingly complex. The term ‘knowledge society’, recently introduced by Drucker (1993), reflects the increasing importance attached to knowledge as the key resource for progress, and not just as another resource alongside traditional factors of production - labour, capital and land (cfr. section 4.II.). The increasing importance of knowledge as a competitive resource has stimulated and enhanced the debate on the mechanisms and processes by which knowledge is created, both at individual and organisational level (Nonaka and Takeuchi, 1995). Understanding these mechanisms is a complex challenge, given the blurred boundaries between tacit and codified knowledge and their complementary and dynamic nature. II. Tacit and codified knowledge Codified knowledge is knowledge that is ‘transmittable in formal, systematic language’ (Senker, 1993, 1995). It is mainly embodied in manuals, textbooks, scientific and technical journals, technical specifications of materials or components, operating manuals for commercial process plant and research equipment. Codified knowledge is an important component of knowledge creation, although it is not easily accessible by individuals and organisations. Martin et al. (1996) point out that in order to interpret en derive value from the growing pool of codified knowledge, organisations have to invest substantially in research capacity. Codified knowledge becomes increasingly available and accessible. Due to the advances in information technology (David and Foray, 1995) and the new combinations of existing stocks of knowledge stored in different electronic resources, a significant variety and novelty may be generated. The process of codification is thought to be diminishing the conventional borders between science and technology and to be increasing the extent of knowledge formalisation, with direct impact on the production of new products and processes, facilitated by advanced simulation in the design process. Codified knowledge is complemented by tacit or person-embodied knowledge, of which the essence can de defined as ‘One can know how to do things, without necessary being able to describe how’ (Arrow, 1994). The increased use of computer simulations and information technologies can be seen as a means of distributing tacit knowledge and person-embodied skills, rather than a new source of new codified knowledge. Senker (1995) argues that there are limits to the codification of knowledge, the process of conversion of tacit knowledge into codified knowledge. These limitations have a number of reasons, including: the difficulty of acquiring specific scientific or technological expertise within specific firms or sectors, the emergence of new technologies, and the need for interaction between producers and users of new technologies in order to appropriate tacit knowledge, skills and techniques. 19 Scientific research, especially through basic research activities, was considered to primarily produce codified knowledge, which could afterwards be used by other individuals or firms as a ‘public good’. Several authors claimed the importance of tacit knowledge and skills, accumulated during university education and further developed through industrial training, especially in emerging technologies and fast-moving scientific fields. Large amounts of tacit and codified knowledge are acquired by universitytrained scientists, researchers and engineers during their university education, and are further enhanced during industrial in-house training provided by the firms who recruit them. For an excellent discussion of the knowledge creation process on company level, including group-level, we refer to the work of Nonaka and Takeuchi (1995). III. Knowledge as a productivity factor From an economic modelling point of view, knowledge can also be seen as a productivity factor, in addition to the important factors labour and capital. Knowledge as a productivity factor can be related with input and output processes. Kenneth Arrow (1994) discusses knowledge as one of the major input factors for the production of goods. This principle can also be applied to the production of services. Know-how about how to produce a certain product or service is a critical success factor. On the other hand, knowledge can also be regarded as output, a result of the transformation of goods. When producing a product or service, knowledge increases about the production process and its results, being the product or service. From a systemic point of view, this also increases the effectiveness of the feedback actions, thus enabling double loop learning2. Knowledge can therefore be seen as a by-product of the transformation process. In order to produce new knowledge, additional resources have to be combined, such as capital and labour and of course existing knowledge. Devoting more resources to the production of knowledge would then increase the rate of growth of the knowledge stock. However, an exogenous stock of knowledge exists as well, in the form of public knowledge. For example, the knowledge creating function of a university is partly based on the further development of existing knowledge. A large part of basic science is self-generated. But the resources devoted to science play an important role. And this is precisely the issue that is subject to policy decisions. In this context, it is essential to recognize the need for a sufficient and well-composed knowledge base that is necessary for acquiring and employing external knowledge in an effective and efficient way. It is important for a country or region to have a critical mass of research and production competencies. Especially in the case of exogenous knowledge input – for example through foreign investment in a country – a critical knowledge mass is indispensable in order to ‘absorb’ this external knowledge and ‘create’ additional knowledge on this basis (‘absorptive’ capacity). Without building a certain scientific level, a country cannot adequately participate in science-based technology (Schmoch et al., 1993). This finding was confirmed by Tidd & Brocklehurst (1999) in their study of the Malaysian government policy on high-tech stimulation. The elaborated government support for high-tech industries remained without effect, due to the fact that a critical mass of local knowledge was lacking. It takes serious efforts and time to build this critical knowledge mass in a specific industry sector. 2 Double loop learning is a concept linked to the control cycles. A control cycle of a production process consists of the activities: measuring input/output, evaluating, interfering and norming that is the generation of norms or parameters in order to control the process. A fifth activity that has the biggest learning effect and improves the whole control process is ‘evaluating’. Evaluation concerns the discussion and review of the existing norms based on experience and results. This last activity creates a double loop learning cycle (Van Amelsvoort, 1992). 20 The previous discussion of knowledge makes us conclude that knowledge is a multidimensional concept, which makes it inappropriate to only identify knowledge with the factor productivity. Various studies have looked at the positive effects of knowledge creation for the firm through scientific and technological activities. Deeds, DeCarolis & Coombs (1999) found that new product development capabilities are enhanced by the quality of the scientific team, under the condition that the scientist’s attention is not distracted from research by involvement in management issues. In an earlier study (Deeds, DeCarolis & Coombs, 1997), they found that the money raised in an initial public offering (IPO) was positively influenced by the quality of the scientific team, measured by the number of citations to their work. This has also been confirmed by McMillan, Narin & Deeds (2000). They attributed the importance of science for individual companies to the fact that social norms in science offer better knowledge protection than legislation. They also indicated the importance of social networks, with special attention to the role of joint research with the academic world. Allen, Booz & Hamilton (1982) found that the share of innovations in companies’ profits surged from a modest 20% in the 1970s to 33% in the 1980s. However, when emphasising the importance of internal knowledge creation, we cannot forget that process or product innovation, as an outcome of internal knowledge creation, is not the only way for firms to gain success. R&D is not the only factor through which a firm can achieve growth. Intelligent use of knowledge spillovers, leading to imitation, is another. This leads us to the more general methods of knowledge diffusion (cfr. infra). IV. Diffusion of knowledge Another important aspect in technological progress and evolution is the actual spread out of knowledge. Firstly, diffusion of knowledge enables technological progress, due to the availability of knowledge and its possible applications. Secondly, available knowledge will be used as a basis for creating new knowledge, re-generation of knowledge. Tacit or personal knowledge is, in contrast to codified knowledge, complex to transfer. There are several mechanisms of diffusion (Arrow, 1994): § Through some mechanical means § As a result of deliberate policy § By special activities of people in order to obtain knowledge With regard to the second group of mechanisms, there is one specific mechanism that should be emphasised: imitation. Verspagen (In: Silverberg & Soete, 1994) also discusses the importance of imitation for the spillover of knowledge. The firm goes out and looks at other firms and finds out what they are doing and what can be good for them. Firms watch their more sophisticated competitors in order to absorb and use the techniques of the latter ones. Not all knowledge can be diffused in an easy manner. Tacit knowledge, which is the more personalised knowledge, will be more difficult to transfer than codified knowledge, the more formalised knowledge. Learning by doing provides a possibility for transferring tacit knowledge. Innovation and imitation represent different aspects of technological change and also lead to different economical results. Research and development is associated with both innovation and imitation, as it can be directed towards the creation of new knowledge as well as towards imitation – see for example the practise of 'reverse engineering'. 21 V. From the ‘linear model’ to the ‘network model’ of knowledge production, transfer and use The traditional understanding of the contribution of basic research to industrial innovation is related to the ‘linear model’ of knowledge production and transfer. Steinmueller (2000) links this model to the view of science as a Social Instrument: a social investment in the production and dissemination of knowledge that is expected to generate economic returns as this knowledge is commercially developed and exploited. According to this approach, basic research is a source of ideas, theories and discoveries, that are further developed through applied research, then tested as a part of the ‘development’ process and translated into industrial innovations. Finally, through various stages of production a further materialisation in new products and processes takes place. This approach has been the basis for many economic studies and policy analyses. Some industrial sectors like biotechnology or the pharmaceutical industry, which rely heavily on advances in basic research, still reflect to a large extent the stages of the ‘linear model’. The usefulness of this model lies mostly in its functioning as a starting point, heuristic for further research and as a way of understanding the function of knowledge as input for industrial innovations. However, the linear approach is an oversimplified representation, ignoring the evidence that technological change is often built on experience and ingenuity, divorced from scientific theory or method and ignoring the role of technological developments in motivating scientific explanation and the technological sources of instruments for scientific investigation (Rosenberg, 1982) In the light of more recent studies, the relation between academic research and industrial innovation has been considered to evolve according to much more complex patterns of knowledge production, transfer and use, which the ‘linear model’ fails to capture. For instance, Steinmueller (1994) argues that the ‘linear model’ does not reflect the economic and social determinants of scientific research activity and overlooks the influence of technology on the scientific agenda. The ‘network model’ of knowledge production, transfer and use is likely to characterise more adequately the complex interactions between knowledge producers and users. Analysing the relations between basic research and industrial innovation, Steinmueller (1994) stresses the fundamental links between basic research and the economic and social determinants. He considers the fact that the ‘network’ approach opens new and useful economic perspectives, like the increasing network value with the number of participants, the decreasing rate of overlapping research projects through network centralisation, and the complementary investments for information dissemination that may lead to economic benefits. Information flows within the network appear as more easily accessible by governments and firms, increasing their choices about specialisation, co-operation and competition. The network model can be associated with a view of science as a Social Institution. Science is assumed to be a social institution of which the norms and practices are distinct from, and only partially reconcilable with, the institutions of market (Steinmueller, 2000). 22 CHAPTER 4 - ECONOMICS OF SCIENCE: CONTRIBUTION AND ASSESSMENT I. Academic versus industrial research The views on research presented so far have focused on the academic and industrial research communities without explicitly differentiating between those specific dimensions/activities. In the discussion of the concepts, we highlighted that R&D activities are more related to the private domain, in contrast to science, which is more related to the public domain. In this section, we will focus on science that is carried out by the public sector and financed with public funds. Universities and other higher education institutions can be considered as institutions in the public domain. Among the two basic functions, education and research, which universities and other higher education institutions perform and which are both essential in the process of innovation, it is academic research that is addressed specifically within the scope of our analysis. A further distinction will be made between two aspects of academic research, basic and applied research. This distinction comes mainly from their relation to commercial purposes. In the spirit of a traditional ‘division of labour’ between industry and academia, the largest part of academic research is concentrated in basic research, although considerable applied research efforts are undertaken in many academic departments, such as engineering, new materials or computer sciences. Basic research is considered to encompass: ‘...both ‘curiosity-oriented’ research (experimental or theoretical research undertaken primarily to acquire new scientific or technical knowledge for its own sake) and ‘strategic’ research (undertaken with some instrumental application in mind, although the precise process or product is not yet known).’ (Martin et al., 1996: 2) In ‘Industry’, R&D function of a firm, the same distinction between basic and applied research can be used (cfr. section 2.II.). It has to be stressed that the importance attached to performing basic research by companies lies in acquiring ‘a ticket of admission to an information network’ (Rosenberg, 1990) or in a prerequisite to better assess and perform applied research: ‘For one thing, firms often need to do basic research in order to understand better how and where to conduct research of a more applied nature... For another thing, a basic research capability is essential for evaluating the outcome of much applied research and for perceiving its possible implications... In an even more general sense, a basic research capability is often indispensable in order to monitor and evaluate research being conducted elsewhere... A firm is much less likely to benefit from university research unless it also performs some basic research.’ (Rosenberg, 1990: 170-1) A sharp distinction between basic and applied research is often very difficult to make, given the very high degree of interaction between them, especially in some areas and disciplines, like health, medicine and agriculture. Pavitt (1997) distinguishes between academic and industrial research and development, by examining their purposes. In his view, academic research as a main source of codified knowledge is aimed at explaining and predicting reality, while industrial research and development is mainly concerned with the design and development of artefacts, focussing on the more practical application. 23 There is a strong interaction between academic and industrial research. Academic research increases the research capacity of business research to solve complex problems and challenges. II. The contributions of academic research to industrial innovation In this section we will discuss how academic research, public research carried out by universities, contributes to technological progress. Following the title of this chapter, we will focus on the economic aspects, that is the aspects that influence economical progress through industrial innovation. Martin et al. (1996) classified the various forms of economic benefit from academic research into six broad categories, interconnected and mutually supportive. The six categories are: 1. 2. 3. 4. 5. 6. Increasing the stock of useful information; New instrumentation and methodologies; Skilled graduates; Professional networks; Technological problem solving; Creation of new firms. The importance of academic research in increasing the stock of useful information has been addressed in the previous section. Furthermore, we have to take into account the complex interaction process between the tacit and the codified knowledge that is transferred across University-Industry institutional borders. Traditional representations of this contribution tend to draw implicitly upon the ‘linear model’ as offering new technologies for use in industry, but the greater emphasis here on tacit knowledge transfer implies a greater emphasis on the mobility of skilled graduates that enable diffusion of tacit knowledge. As Meyer-Krahmer and Schmoch (1998) point out, there is a ‘two-way interaction’ between public and private knowledge generation and diffusion (cfr. section 4.IV.) New instrumentation and methodologies play an important role both as a source of scientific progress and as one form of economic benefit from academic research. Rosenberg (1992) considers scientific instruments as ‘capital goods’ of the scientific research industry, of which the development cannot be separated from advances in basic research. He identified two main diffusion flows of instruments: firstly, from one scientific discipline to another, and secondly, from basic research laboratories to industry. Scientific instruments have become almost indistinguishable from industrial capital goods (ibid.). The third benefit lies in skilled graduates, who not only bring knowledge of recent scientific research into industry, but who also have an ability to solve problems, perform research and develop new ideas. These skills, developed during their education, may be valuable. The process of graduates entering an industrial environment needs large investments (for example training in industrial practice). The provision of access to professional networks is a major benefit of academic research. Government funding allows individuals and organisations to participate in the world-wide community of research and technological development. Within the professional networks, publications are very important not only as a source of codified knowledge, but also for the tacit knowledge, which they signal. Publications help to consolidate the scientific credibility of researchers within firms and serve as a vehicle in the relation with academics, thus acting as an ‘entry ticket’ to professional networks (Hicks, 1995). 24 Rappa & Debackere (1992) showed that, although academics are more willing to publish their findings, industrial actors also seek an exchange of ideas within their technological communities. Network relations between academic and industrial actors are profound. Technological problem solving is the most widely recognised benefit from academic research. The extent of interaction between academic research and technological problem solving, is highly variable across research fields and industrial sectors, coming from their differing origins and development. A positive influence of academic research on the creation of new firms cannot be generalised (Salter & Martin, 1999), despite the substantial and varied links between different industrial fields and universities that sometimes lead to the creation of new companies aiming at further (commercially) exploiting an innovation (spin-offs, join ventures etc.). It cannot be concluded unambiguously that significant investment in basic research generates spin-off companies even if the correlation between university research and firm birth is positive. Martin et al. (1996) have identified three general methods for measuring the economic benefits of academic research: a) econometric studies; b) surveys and c) case studies. Econometric studies focus on large-scale patterns and are effective only in providing an aggregate picture of statistical regularities and in estimating the rate of return on research and development expenditures. A big disadvantage of this type of studies is that the results might be misleading due to the simplistic and unrealistic approaches. Surveys analyse the extent to which government-funded research constitutes a source of innovative ideas for firms. Surveys have focussed on the interaction between academic research in both its aspects (basic and applied) and industry. Case studies are the best tool for examining the innovation process directly. They generally provide support for the main findings from econometric studies and surveys. Until recently, few attempts were made to measure the rates of return on funded research and development. Mansfield (1991) made substantial contributions in measuring these benefits; he focussed on research within 15 years of the innovation under consideration. Using a sample of 76 US firms in seven industries, he found out that 11% of new products and 9% of new processes could not have been developed without academic research. Furthermore, he estimated the rate of return form academic research to be 28% (ibid., p.10). Narin et al. (1997) have developed a new approach to evaluating the benefits of publicly funded research based on analysing scientific publications cited in US patents. For 42.000 papers with at least one US author, they determined the sources of US foreign research support acknowledged in the papers. Their conclusion was that, based on the increasing number of scientific references cited in patents, the knowledge flow from US science to US industry has multiplied. Of course some critical notes can be placed. III. Assessment and rationalisation of government-supported scientific activity During the past two decades, wide-ranging socio-economic and technological transformations have caused European governments to reformulate their policies for public sector funding (PSF). Such policies affect the organisation and location of research, the prioritisation in specific fields, the agencies responsible for funding research, and the mechanisms for allocating research funds. Determining the principles governing the allocation of resources to science as well as the management and consequences of the use of these resources, are the central issues of the economics of science (Steinmueller, 2000). 25 In a study performed by Senker et al. (1999) on Public Science Research (PSR) among 12 European countries, clearly illustrated that governments had either explicit or implicit expectations that their investments in PSR would achieve some of the following functions (in relative importance): i. The advancement of knowledge ii. The support of policy transformation and implementation iii. The support of public welfare (e.g. health, environment etc.) iv. The support of economic development (technology transfer) v. Programmes to build and support prestige activities and capabilities in ‘frontier’ science Furthermore, it appeared that all the countries traditionally investing in PSR aimed at advancing knowledge and promoting economic development, being central pillars in their science and technology policies. This illustrates the awareness of the impact of knowledge creation and distribution on the economical progress. Another interesting result of the study was the apparent growth in resources for ‘application-oriented’ research in new and key technologies, including biotechnology and the environment. It appeared that there have not only been shifts in allocation between fields of research, but also changes in the type of research funded. Let us take a look at the possible explanations for the changes in this funding behaviour. Beside the socalled ‘sophistication factor’ (Martin, 1996), i.e. the growing costs conducting scientific research − instrumentation, facilities and infrastructure to conduct frontier research became more expensive −, two main reasons can be identified for the fact that assessments of government-funded research have become increasingly necessary in recent years. The first reason has to do with the perceived increasing constraints on public expenditure, including spending on research and, as a result, the requirement for greater public accountability in all areas of public expenditure. The second reason relates to the increasing pressure on the peer review system that was normally used to address these types of problems (Martin, 1996; Moed, 1989). Both reasons will be discussed in more detail below. § Increasing constraints on public expenditure and requirement for greater accountability During the 1960s, funding of scientific research occurred in a relatively unrestricted way in most western countries. How to allocate an ever-growing science budget in a sensible way appeared to be the main problem. However, from the start of the 1970s on, greater selectivity concerning the allocation of research funds had to be displayed by government and industry (Moed, 1989). Science was no longer seen as cheap. Moreover, it does not offer any immediate return to an investor. As a result, science, requiring a total layout of several percent of a country's Gross National Product, found itself in serious competition with other desirable items of public or private expenditure. Science entered in a ‘steady state’. As such, funding of scientific research has to be justified in this context (Ziman, 1994). Since the resources needed to develop a ‘science base’ can no longer be regarded as a marginal item in the national budget, science will have to compete with rival institutions, if it aspires to gain a bigger piece of the pie (Ziman, 1994). In this context, demands for evaluation and for performance indicators are put forward to assure both the government and the public that public money is being spent well (Martin, 1996). At the end of the 1980s and in the 1990s, although the economic situation in many western countries improved significantly, the general attitude on the funding of scientific research activities did not change very much. Nowadays, there is less willingness to fund research ‘carte blanche’. As Moed (1989) indicates, ‘value for money’ is often the main objective. Greater emphasis is placed on 26 selectivity and competition of research programmes for funding. Consequently, systematic evaluation and monitoring of scientific programmes became customary. Under these circumstances, the publicsector funding agencies search for a more efficient resource allocation to scientific and technological activities, for better meeting the nation’s economic and social needs3. If science is instrumental in technological progress and ultimately in economic growth and prosperity, as indeed we have seen, then the economic theory of resource allocation should be applicable to science (Steinmueller, 2000). § Pressure on the peer review system In order to be able to obtain a truthful evaluation and monitoring of a particular scientific programme, information on, for example, the quality or benefits of such programmes is needed. Traditionally, this information was provided by the scientists themselves, and this seemed to work well in periods such as the 1950s and 1960s, when government spending was increasing by five to ten percent a year. However, the pressure on the peer review system increased in periods that were characterised by an essentially level budget. Evaluation of scientific research was always done in a rather qualitative way, mostly by means of expert panels. Peer judgements are assessments of the ‘quality’ or ‘significance’ of scientific research performance, as perceived by experts in the particular field involved (Moed, 1989). Peer evaluation is thus inherently based on scientists' perceptions of contributions by other researchers. As such, statements of well-informed experts are clearly prone to subjectivity (Tijssen, 1992). Judgements from the peer review system have been criticised on several occasions, both from a reliability and a validity point of view. In short, Martin (1996) identifies three main problems. First, due to the political and social pressures within the scientific community, such as the increasing competitiveness and the concentration of research resources in an ever-smaller number of large research centres, it becomes more difficult to find truly impartial peers. Second, peers in different cognitive and social locations may evaluate scientific contributions quite differently. Third, peers will often base their judgement on limited or imperfect information on the contribution being evaluated. As a result of these and other problems, their perceptions may provide no more than a partial indicator of scientific contributions. Nowadays, as research funding is supposed to cover wider areas of scientific activity, the expertise of scientific peers is considered to be a base too small and too subjective to found allocation decisions on (Tijssen, 1992). As a consequence, the need arose to develop a method for evaluating relevant aspects of the input, throughput, and output of scientific research in a more ‘objective’ − that is, a more quantitative − way. Hence, analytical tools based on readily available quantitative data such as, in particular, bibliometric information, were developed to be used complementary to expert judgements (Tijssen, 1992; Moed, 1989). Since the beginning of the nineties, a considerable amount of experience has been gained in applying bibliometric analysis4 in the assessment of research performance (Van Raan, 1997). Following the trend towards an increasing quantification of science, the use of bibliometric analysis for the evaluation and monitoring of scientific outputs has become widespread (Tijssen, 1992). Partly due 3 The study by Senker et al. (1999), pointed out that most countries seem to focus their science policy on four main themes: evaluation, coordination, prioritisation and technology transfer. Especially prioritisation is developing slowly through ‘Prospective’, ‘Forecasting’ and ‘Foresight’ activities. 4 Bibliometrics and the different techniques involved for measurement and analysis of science will be the central issue of chapter 4) 27 to the existence of large computerised bibliographic databases, bibliometric research performance analysis can be characterised as a methodologically and technically sophisticated field, as shown by the variety of indicators and its application to different levels of aggregation. Recently also, more and more bibliometric applications become apparent in fruitful combination with assessments by means of peer review (Van Raan, 1997). Science as a social institute has a high impact in many parts of society. As such, the assessment of scientific activity and the prioritisation in science policy become more and more a public subject. Certain European countries, especially The Netherlands and Scandinavia, which have a tradition of public participation, are involving the public more in debates and priority setting on scientific and technological issues. This has been described as a ‘post modern’ research system (Rip, 1992). Apparently, apart from the increasing globalisation and world economic competition, a number of drivers has put increasing pressure on government spending. Among them are the political dimensions, international and EC influences (looking at Europe), industrial needs and the emergence of new technologies (IT, biotechnology and new materials) leading to a shift in funding patterns and thus priorities. Beyond that, governments are also facing hard constraints to balance their budgets, to decrease the public spending allocated to scientific research and to cut taxes, which generates a high demand for greater public accountability and better ‘value for money’. 28 CHAPTER 5 - THE NEED FOR PRIORITISATION IN SCIENCE AND TECHNOLOGY The variety of approaches on the relationship between science, technology and economic growth proclaimed by the different scholars in the field, all share the statement that technical progress, in one or another way, is seen as very important for economic growth. The creation and the use of knowledge, generally referred to as science, plays a dominant role in technical progress. A number of studies have pointed out that the relationship between the knowledge production, transfer and usage, broadly speaking the view on the relation between science and technology, has shifted from linear to reciprocal or network oriented. The ex-ante identification of scientific priorities in the actual socio-economic environment, and the ex-post justification and evaluation of government support, becomes increasingly complex. The growing impact of emerging technologies, such as ICT, electronics, advanced materials, etc. on the socio-economic development has added new dimensions to globalisation and world industrial and economic competition. The emergence of knowledge-based industries, firms and services, generates an enormous competitive pressure on international markets. The emergence of a new ‘long wave’ or Kondratieff cycle of economic development (Freeman and Perez, 1988) has raised and is still raising complex challenges for a better understanding of ways in which progress in new technologies is science-dependent. On the other hand governments face hard constraints to balance their budgets, to decrease public spending allocated to scientific research and to cut taxes, which in turn generates a high demand for greater public accountability and better ‘value for money’. The simple interaction between demand and supply, which works fairly well for products with a clear cost and benefit structure, fails, due to the absence of supply in the case of science. Governments therefore have to intervene and correct the market imperfections. Martin (1996) mentions two reasons why, in the case of government-funded research, objective evaluation seems to be necessary. The first reason relates to the growing costs of scientific instrumentation, facilities and infrastructure required to conduct frontier research – the so-called ‘sophistication factor’. The second reason concerns the emerging problems, as Martin (ibid) perceives them, with ‘peer review’. He pointed out that peer review groups worked well when government spending was increasing by 5-10% a year. New and promising areas of research could be identified successfully, but the results are less satisfactory when it comes to identifying declining areas and groups. Furthermore, peer review groups cannot fulfil the task of showing public accountability in government spending, as they cannot be seen as totally unbiased themselves. Thus, other instruments on which evaluation can be based are more than welcome. Policy interventions also focus on a broader social integration of science and technology issues, which should not be accessible exclusively to scientists and engineers, and which should not be reduced to those technologies with a well-defined economic impact on productivity growth or supply and demand mechanisms. The social shaping of science and technology issues also raises the point of the policy choices that need to be made in relation to the endogenous nature of technological change, stemming from a given economic and social system, and not as a consequence of factors external to this system. Again, this points towards the need for a larger comprehension of the factors shaping the science and technology interaction. 29 Part II: Science and Technology - Examination as two separate spheres CHAPTER 6 - EXAMINING SCIENCE AND TECHNOLOGY I. Introduction Understanding innovation presupposes an understanding of S&T activities. Measurement of S&T activities enhances the understanding of the S&T interaction and provides useful insights for policy makers. One major concern of analysts is to describe S&T activities in qualitative as well as in quantitative terms, so that the indicators can be used in the context of models, explicit or implicit. However, the general problem is that S&T can only be measured indirectly, using input, output of impact indicators (OECD, 1994a). Measuring S&T activities should not be performed solely in a quantitative manner but also in a qualitative manner. However, the importance of quantitative assessment should not be underestimated. Martin (1996) pointed out that quantitative assessment of government-funded research is important and necessary, also in view of the raising concerns around peer review groups (qualitative). Both forms of measurement are necessary in order to perform a balanced evaluation. Besides the various concerns about the measurement tools themselves, one should also have in mind ‘what’ the different indicators and models actually measure. As already mentioned in part I, S&T, and even more R&D, is only one element in the whole innovation process. R&D processes are not necessarily sequential and the borderlines between the stages in the R&D process are not clear-cut. There is no universal indicator appropriate for the description of each phase of the R&D process. Thus, the results of these activities cannot be measured in the customary scientific sense of ‘measuring’ a variable. One of the solutions is to use indicators that are proxies, rather than direct measures of the output of the R&D process. Certainly, the interpretation of these measurements should be done with great caution. In innovation economics, indicators can be classified into three categories: input indicators (or resource indicators), byput (or R&D result) indicators and output indicators (or progress indicators) (Grupp, 1998). ‘Input indicators’ or ‘resource indicators’ should be regarded as a generic term embracing every possible means for measuring personnel, monetary, investment and other expenditure on research and development and innovation. They can contain for example R&D outlays, R&D personnel, investment statistics, royalties paid, etc. ‘Byput indicators’ or ‘R&D result indicators’ measure sub-phenomena of technical change. Grupp (1998) states that byput or throughput indicators measure ‘attendant’ or ‘partial’ effects of technical progress and can thus be regarded as the result of R&D activities, regardless of whether or not these results are important for the success of the innovation. Patent, publication and citation statistics are the most utilised R&D result indicators. ‘ 30 Output indicators’ or ‘progress indicators’ relate to the economic effects of the innovative performance, rather than to detailed R&D activities. They represent the qualitative, quantitative or value-rated advances in production processes of products, which are not solely caused by R&D activities, according to the author. Clearly, the scope of the analysis, the innovation system as a whole or the R&D system, influences the operationalisation and thus the construction of input, byput and output indicators. Figure 2, clarifies the relation between the R&D functions, the input and output indicators and their application. The specific relation between patents and R&D will be discussed in the next chapter. Figure 2 - R&D functions and their linkage to in- and output indicators Input indicators Throughput & output indicators Functions Applications Knowledge R&D personel citations Research (fundamental / basic) R&D expenditures Scientific papers citations Research (strategic / applied) paper citations S&T transfer Patents Industrial development patent citations Technometric specifications Innovation & imitation Foreign trade with R&D intensive products Diffusion (technological/ product) Source: Grupp & Schwitalla (1989) In the middle of the figure, we see the different functions of the R&D or invention process. It is needless to say that the borderlines between the different stages are unclear and that innovation lines are neither linear nor simple. However, a crude structure between R&D stages and the various indicators does exist (Grupp & Schwitalla, 1989). Attention should be paid to the role attributed to ‘citations’ – i.e. publications being cited - and ‘paper citations’ – i.e. literature citations in patents – in the S&T transfer, the diffusion of knowledge. As an introduction, these two output indicators will be discussed in the next two sections. 31 II. Patents and publications as output indicators The most frequently used indicators for measuring technological output are based on patents. Patents, as a detailed source of information on inventive activity, offer an interesting monitor device to identify main lines and trends, and even, under specific conditions, the possibility to analyse R&D processes in more detail (Engelsman & Van Raan, 1994). But what exactly do patents measure? A patent does at least represent a minimal amount of invention that has passed a thorough examination by the patent office on both the novelty of the claimed item and its potential utility (Grilliches, 1990). On the input side, we see various R&D indicators such as research personnel and internal and external R&D expenditures. The total image of relationships has to be taken into account. Scientific publications are considered as output for theory and model development, basic and applied research. The field of Scientometrics covers the measurement of scientific and technical research activity (Gauthier, 1998; Van Raan, 1997). Bibliometrics is a branch of scientometrics that focuses on the quantitative study of scientific publication (scientific output) for statistical purposes. Bibliometric methods serve three main functions: description, evaluation and scientific and technological monitoring. Patents and the use of bibliometrics as technology and science indicators will be discussed extensively in chapters 7 and 8. The science base of technologies can be analysed by ‘linking’ areas of science to technological domains. Patents and publications, as closely related to the S&T interaction, play a central role in the linkage. The relationship between science and technology can be of a direct nature, through literature references in patents, or of a more ‘indirect’ nature, for example inventor – author relationships and co-operations. Both approaches are feasible and have frequently been empirically tested leading to different results (see chapter 9). In our study we will focus on the ‘direct’ linkage approach. In this approach the non-patent references (references to literature) play a central role. Patent inventors occasionally refer to (scientific) literature as a source of inspiration and knowledge for the creation of their invention, also known as ‘prior art’. Extraction of the non-patent references and the identification of the relevant science domains referred to, enables the identification of the science base of the technology under review. Further analysis of the characteristics of the linkage such as linkage intensity, distribution references, validity range etc., will enrich our understanding of the linkage and increase its applicability as an S&T policy decision support tool, ex-ante and ex-post. 32 III. Some considerations around the use of S&T indicators Before starting a more in depth discussion of patents, publications and linkage, we would like to briefly discuss a number of issues related to S&T indicators. It was already pointed out that patents and publications are only partial indicators of respectively technology and science. Therefore, when interpreting results and drawing conclusion based on indicators, one should be very cautious. In a detailed study on knowledge formation and technical development in the pulp and paper sector, Laestadius (1998) argues that aggregated published statistics are not reliable for a deeper understanding of the R&D activities and specifically of the radical technological change that took place in this sector. Furthermore, he concludes that “they are thus also of little use for ranking and comparing countries and or industries” (ibid., p. 393). Thus, statistics (or indicators) should, because of their nature as proxy measures, be interpreted with great care. A second related issue, pointed out to us by Martin (1996), is that evaluation of basic research is best carried out using a range of indicators, or what we could be referred to as a ‘scorecard’ of indicators, for example including peer evaluations and other qualitative as well as quantitative elements. Indeed, the more indicators, the more balanced and reliable the evaluation will be. This approach may very well be projected on the evaluation of not only basic research but also on the evaluation of a range of other activities such as patenting and technological progress. The last general issue that we would like to touch upon concerns what Glänzel (1996) refers to as “The need for standards in bibliometric research and technology”. In order to increase the reliability and validity of bibliometric analyses, standards need to be developed. Standardisation is not only important in regard of used techniques but also in regard of data sources on which the techniques are applied. Differences between classifications, terminology, methodology and databases need to be taken into account when working with S&T indicators. Despite of all this, the construction of indicators is and remains a highly important tool in the analysis of S&T and, as we will see further on, in many cases the only tool available. 33 CHAPTER 7 - PATENTS AND MEASUREMENT WITH PATENTS I. Patents, the patenting system and its economic rationale I.1. A patent and its economic rationale The analysis of patent information is considered to be one of the most established, directly available and historically reliable methods of quantifying the output of a science and technology system (Soete & Wyatt, 1983). Patents, more than any other innovation indicator, occur widely in economic literature. Or like Grupp (1998) says: ‘No other innovation indicator can be traced back over comparatively long periods of time, may at the same time be disaggregated at a very low level allocable to individual economic units, and is also precise and accurate insofar as identification of the timing of the innovation event is concerned’ (Grupp, 1998, p.144-145). Before looking into patents as technology indicators, it may prove useful to identify what exactly a ‘patent’ is. Grilliches (1990) gives a clear description of what is understood under a patent: A patent is a document, issued by an authorised governmental agency, granting the right to exclude anyone else from the production or use of a specific new device, apparatus, or process for a stated number of years. The grant is issued to the inventor of this device or process after an examination that focuses on both the novelty of the claimed item and its potential utility. The right embedded in the patent can be assigned by the inventor to somebody else, usually to his employer, a corporation and/or sold to or licensed for use by somebody else. This right can be enforced only by the potential threat of or an actual suit in the courts for infringement damages (Grilliches, 1990, pp. 1662-1663). A patent is a ‘property right’ based on an ‘officially sealed’ claim (Grupp, 1998). For the claim to be recognised by other competitor companies, all property right details have to be public. The purpose of the patenting system is the protection of the inventor. Without property rights, technological knowledge would be public property and competitors would be able to imitate without penalty and claim new knowledge to be their own. The inventor is granted a temporary monopoly situation and, by doing so, ensuring him of sufficient benefits for his innovative efforts. Patenting also keeps incentives to innovate high enough for private inventors, so that a sufficient number of innovative ‘efforts’ are made, which in turn favours technological advance and economic growth. 34 According to Grupp (1998), a patent has three qualitative properties that require attention in connection. On one hand, a patent grants to the owner the exclusive right of exploitation of a precisely defined technical knowledge for a specific period of time. Three conditions for this grant need to be fulfilled: novelty, quality and the possibility of being commercially applicable. The stimulating function of the patent facility is supplemented by the information function, which manifests itself through publication. The information function is the second qualitative property of patents. Patents can be used by others than the inventors, with the purpose of obtaining knowledge about the progress of technological knowledge. This is mainly possible through the patent and non-patent citations that have a major knowledge component. Thus, from a social-economic point of view, patents have the advantage that the information contained in the patent administration is publicly accessible (a slight ‘contradictio in termini’ when we look at the nature of patents), which leads to a greater diffusion of technological innovations. The patenting system fulfils an important role in the information distribution in the sense that it avoids needless duplication of R&D efforts, which in turn can accelerate technological progress. The third function relates to the output function of a patent document. Successful R&D activity is usually followed by a patent giving detailed information (date, time, circumstances, locations etc.) on the activity itself. The last mentioned property is mainly used in connection with innovation measurement. With regard to the second function of a patent, it is not clear whether the patenting system also enhances social welfare. The system stimulates innovations and technological competition. Simultaneously though, it grants monopoly rights in order to keep the incentives for private inventors as high as possible, which disturbs regular competition, albeit temporarily. Economists like to point to the fundamental discord between appropriation and diffusion of knowledge, a very important factor in economical progress (cfr. section 4.III.). The economic literature suggests various instruments that can be used by policy-makers to deal with this discord. In particular, both length − the duration in time − and width − the range − of the patent protection can be used to influence social welfare in an optimal way. The relationship between patents, as a measure for innovative output, and R&D, as a measure of innovative input, has repeatedly been investigated. The results in the literature indicate a strong statistically significant relationship between R&D expenditures and patent numbers in the crosssectional dimension, i.e. across firms and industries. Pakes and Grilliches (1984) for example found that firms that spend more on R&D, possess more patents. In the within-firm time-series dimension, a statistically significant relationship between R&D and patent counts is found too, but here the relationship appeared to be much weaker. In a study performed by Arundel and Kabla (1998) it was shown that large R&D intensive firms do not patent a higher percentage of their innovations than firms with low R&D intensities. 35 I.2. Patenting systems An inventor (individual, agency or company) wanting to protect an invention in a particular country files an application in that country’s patent office. In the application, one or more claims will be stated, referring to the aspects in which the product or process has to be considered innovative. In the so-called prior art examination, these claims will be subject to further assessment and comparison. It is also possible to protect an invention in more than one country. Up to one year after the ‘priority year’ the applicant can file for a patent in any desired country, at a regional patent office such as the European Patent Office (EPO) for simultaneous protection in more than one country or at the international patent office, such as the World Intellectual Property Institute (WIPO), for a broader protection. Two of the major regional patenting systems are the European Patent Office (EPO) and the United States Patent and Trademark Office (USPTO). As a general rule, a patent office acts only for its own country. An exception on this rule is the EPO, which in some matters can act on behalf of a group of European countries. The difference in patenting procedures is naturally reflected in the patent data made available by each of the patent offices. The major points of difference between the EPO data and the USPTO data are summarised below (Debackere et al., 1999). § "First-to-file" vs. "first-to-invent" Whereas the European Patent Office assigns property rights to the one who is first to file a patent application, the US Patent and Trademark Office employs the "first-toinvent" principle. That is, whoever can demonstrate that he was the first to invent something (for example, by means of "laboratory notebooks"), will be assigned all intellectual property rights, irrespective of the fact that someone else has filed a patent application for the same invention. § Patent applications vs. patent grants Within the system employed by the European Patent Office, patent applications without exception are published 18 months after the priority date. In this system, a patent application is also published, even if the patent is not granted in a later stage. In the United States, however, until March 2001, only patent grants are published by the USPTO. Grants do not follow a strict timetable, and can in many cases take up to five years (OECD, 1994a). After March 2001 however, a separate application-database has been made available by the USPTO. In terms of statistical comparisons, obviously, this will result in a difference in terms of "presence" and "visibility" of companies or other actors in the respective patent databases (see figure 3). Figure 3 – The different lead between EPO and USPTO application procedures (prior to March 2001) National priority (first appl. EPO publication EPO grant Years since priority 1,5 1 Priority year 2 3 4 US publication and grant (patent date) 36 5 § Patent and non-patent references Unlike in the EPO database, the USPTO database also includes citations to documents considered to be relevant. That is, citations to relevant patents as well as to scientific publications are included in the database. As a result of the aforementioned differences, information extracted from the EPO and USPTO databases is not fully comparable. In addition, we also have to be aware that, when making international comparisons with respect to patenting activity of various actors and/or countries, such comparisons are hampered by international differences in the legal conditions surrounding the granting of a patent. In practice, this results in US companies having a comparable advantage with respect to patent grants in the USPTO system, relative to foreign companies. This is because US companies have better knowledge of how the system works. In the EPO system, however, such distortions seem to influence the granting of a patent to a lesser extent. This is mainly due to the fact that the European Patent Office is a supranational agency. European applicants therefore will have little, if any "home advantage" in the EPO system, when compared to the US situation. I.3. Patent data sources In short, patents, because of their electronic availability (e.g. CD-ROMs of the USPTO or "Espace Bulletin" of EPO), make it possible to investigate a whole lot of research questions. Besides the already mentioned "official" patent databases, such as EPO and USPTO, there are various ("commercial") information providers that offer patenting information both in "raw" and in "edited" format, such as e.g. Dialog and Datastar, the most well-known patent database being DERWENT. The two most widely used patent databases are (1) the USPTO patent database, administered by the United States Patent and Trademark Office (USPTO), and her European counterpart (2) the EPO patent database, administered by the European Patent Office (EPO). USPTO Patent Database The USPTO patent database is administered by the United States Patent and Trademark Office (USPTO). The database contains data on all granted patents over the period 1976 up to now. A weekly update is possible through USPTO's FTP-site. The information available consists of: § Title and description of the patent; § Main IPC-code and supplementary IPC-codes; § All relevant time data (application filing date, issue date, priority date); § Assignee information (e.g. name and address); § Inventor information (e.g. name and address); § References to other patents - U.S. Patent Reference and Foreign Reference - and references to non-patent documents - Other References. q EPO Patent Database The establishment of the European Patent Office (EPO) was a consequence of the 1973 Munich Convention that laid down a number of rules concerning filing procedures and the drafting of patent applications. As a result of this convention, it became possible to file one single application for all associated European countries. After granting the patent, it is being split up in a bundle of national patents for each country for which legal protection is requested. q 37 The EPO patent database contains information on all European patent applications as well as on all patent grants since 1978. It contains all relevant information with respect to the technical side of the invention, as well as the administrative and legal history of the patent application and possible grant. In particular, the database contains information on: § Title and description of the patent; § Main IPC-code and supplementary IPC-codes; § All relevant time data (application filing date, issue date, priority date); § Assignee information (e.g. name and address); § Inventor information (e.g. name and address); § Opposition against the patent. Contrarily to the USPTO patent database, the EPO database does not contain information on possible references to other patents or scientific literature. This information is nevertheless included in the REFI database. The REFI (Reference File) database contains all references that are included in European patents from 1978 on. These references consist of citations to other patents, as well as citations to scientific publications that served as an inspiration source for the invention described in the patent document. II. Patent statistics and trivialities around patents As mentioned before, if we want to address questions about the sources of economic growth, the rate of technological change or the competitive position of different firms and countries, it appears that we have almost no good measures. As a consequence, we are reduced to pure speculation or to the use of various, only distantly related, ‘residual’ measures and other proxies. Patents are one of these residual measures. As Grilliches (1990) indicates: ‘In this desert of data, patent statistics loom up as a mirage of wonderful plenitude and objectivity’. Patents offer a host of methodological-technical advantages. However, like with any indicator or methodology, several biasing aspects have to be kept in mind. The overview below illustrates the different pro’s and contra’s (the latter mainly biases that have to be kept in mind during interpretation). Advantages of patents indicators § The special proximity of patents to the output of industrial R&D and other inventive and § Patents cover virtually every field of technology useful for the analysis of the diffusion of key innovative activities implies that there is no other equivalent for this purpose technologies (excepted software, which is generally protected by copyright and can be patented only when it is integrated in a technical process of product) § Patents offer a world-wide geographical coverage § The very detailed classification in patent documents, which allows for almost unlimited choice of aggregation, levels from broad fields to single products § Patent documents include many details of interest, such as year of invention, technical § The statistical processing of data is largely free of errors, because patent documents are legal classification, assignee, inventor and so on documents in which the details are recorded carefully § Accessibility, electronic availability of patent data (EPO/USPTO data on CD-ROM’s, DERWENT, Dialog, Datastar) 38 Biases towards the use of patent indicators § Firms differ in their propensities to patent (# patents per unit of expenditure on R&D or just # of patent applications) § Technology fields differ in their propensity to patent § Countries differ in their propensity to patent: size and geographical position give rise to different expectations of the returns from patent protection (combination with other input or output indicators is necessary) § Differences among national patent systems, arising from legal, geographical, economic and cultural factors (issue of ‘home advantage’) Many of the abovementioned advantages are in practice not very visible, and in some cases not visible at all, meaning that some advantages are disadvantages at the same time. However, to cite Grilliches (1990) once more: ‘In spite of all the difficulties, patents statistics remain a unique resource for the analysis of the process of technical change. Nothing else even comes close in the quantity of available data, accessibility, and the potential industrial, organisational, and technological detail.‘ (Grilliches, 1990, p.1702) When using patent statistics in order to answer questions about sources of economic growth, the rate of technological change or the competitive position of different firms and countries, certain other trivialities intrinsic to patents and the patenting system have to be kept in mind, besides the ones mentioned above. These trivialities are: § Rationale of patenting § Classification system of patents § Value of a patent We will discuss these trivialities in the following three paragraphs. II.1. Rationale of patenting: patenting behaviour With ‘rationale of patenting’ we refer to the underlying motives and considerations of the decision to apply for a patent. As we have mentioned before, various reasons for not applying for a patent can exist, apart form the reasons behind the procedure for patenting. Patent protection is not the only way to reap market success from an innovation. Secrecy, rapid launching, low prices and so on can supplement or even replace patent protection. In areas of fast-moving development, patent protection may be of little value because inventions quickly become obsolete. This implies that: § Not all inventions are utilised and commercialised, and consequently do not lead necessarily to innovations; § Not all innovations are patentable and those that are patentable, are not necessarily patented; § Sometimes a company patents inventions that are not used. 39 The economic literature also pays a lot of attention to the strategic patenting behaviour of companies. Many empirical studies have pointed out that not all innovation-active companies engage in patenting and that companies that do engage in patenting do not apply for patents for all of their innovations. This has important implications for the utility of patents (the validity range) as a measure for innovativeness of companies and nations. Figure 4 – Validity range of patent indicators according to product groups PATENTABLE INVENTIONS ALL INVENTIONS PATENT APPLICATIONS INNOVATIONS Innovation relevant patent applications Patent applications without economic applications Innovationrelevant inventions without patent application Non-patented inventions without economic application Source: Grupp (1998) The relation between inventions, innovations and patents is summarised in figure 4. The figure shows that not all inventions are patentable and that not all patentable inventions are actually patented. Furthermore, not all patents or inventions are economically applicable. These differences between inventions and patents and between patents and final economic application are crucial in the interpretation of patent indicators or of other quantitative measurements with patents. Why do companies choose not to patent a certain invention? Various important reasons can be identified. First, the benefits of patenting an innovation may not outweigh the costs involved, implying the direct costs involved with the application process, and also the disadvantages experienced due to the patent conveying information to possible imitators, which in turn are able to innovate more quickly around the granted patent with less R&D investment. Scherrer (1996) mentions that in the Cohen survey, it was found that applications are only submitted for 52 percent of product inventions and for 33 percent of process inventions. Other reasons for not patenting are summarised in the table 1. Table 1 – Overview of motives for not patenting an invention Reasons for not patenting In % Too easy to invent around 65% Too hard to show novelty 55% Reluctance to disclose 47% Cost of application 37% Source: Scherrer (1996) 40 Scherrer (1996) concludes from these results that mostly the unimportant inventions are not patented. In 1993 MERIT in The Netherlands and SESSI in France conducted a survey on the innovative activities of Europe’s largest industrial firms in collaboration with INSEE5. The propensity rate was assumed to be equal to the percentage of innovations for which a patent application is made. The analysis revealed that the average propensity rate for product innovations is 35.9%, varying between 8.1% in textiles and 79.2% in pharmaceuticals. The average for process innovations is 24.8%, varying from 8.1% in textiles to 46.8% for precision instruments. Only four sectors have patent propensity rates that exceed 50%, for both product and process propensities (Arundel & Kabla, 1998). Furthermore, the effect of secrecy as a reason for not patenting was examined. It appeared that firms that find secrecy to be an important protection method for product innovations are less likely to patent, but that secrecy has little effect on the propensity to patent process innovations. The R&D intensity of the firms has no effect on patent propensity rates for both product and process innovations (ibid., 1998). Mansfield (1991) pointed out that protection by means of a patent has little effect on the speed and cost associated with imitating a particular innovation. On average, copying patented product innovations appeared to cost no more than 11% more than copying non-patented innovations. Moreover, companies estimate the cost of imitation being a mere 6% higher, if they had patented the products that are currently not patented. Whereas some innovations are not patented, others lead to a plenitude of patents. This depends on the nature of the innovation. In chemistry, for instance, an entire family of molecules can be used to make a product with specific characteristics. The industrial application, however, will probably only use one of those molecules, but the innovator is not effectively protected against imitation until he has a patent for the entire family of these molecules. In addition, strategic patenting behaviour of firms can cause a profusion of patents. A patent portfolio strengthens a company's bargaining position in technology agreements, and it can also be used to keep new companies out of the market. The environment in which companies operate, influences the cost-benefit analysis of protection by means of patenting. Important determining environmental factors are the intensity of technological competition and the ease with which technological innovations can be copied. This can explain why patenting behaviour appears to be very sector specific and innovation type specific (process or product), as also pointed out in the MERIT/SESSI survey (see Arundel & Kabla, 1998). Furthermore, the stage of the technological lifecycle in which the product is situated has an influence on the patenting behaviour. Pakes (1986) finds that patents are applied for at an early stage in the inventive process, a stage in which there is still substantial uncertainty concerning both the returns that will be earned from holding the patents, and the returns that will accrue to the patented ideas. Large companies (>100 employees) are inclined to consider both strategic and legal protection mechanisms, whereas small companies, in view of their financial possibilities, apply less quickly for patenting protection. 5 SESSI is the Statistical Service of the French Ministry of Industry. The survey was conducted in co-operation with the National Institute of Statistics and Economic Studies. 41 II.2. The Classification problem A second major issue around patents and the patenting system is the classification problem. To use patents in economic research of any kind, we must be able to relate them to meaningful economic categories of interest. However, the classification system is primarily based on technological and functional principles and is only rarely related to economists’ notions of products of well-defined industries (Grilliches, 1990). According to the OECD patent manual (1994a), ‘Inventions are classified by one or more symbols, so that patents belonging to a given technological field can be filed and retrieved’. Patent offices classify patents in classes and subclasses based on the need to ease the search for prior art. ‘Some system is necessary for accessing the technical information contained in published patent documents’ (ibid., 1994a). Thus, the classification scheme has been constructed from a technical point of view. First of all, we will briefly explain the International Patent Classification (IPC). Then we will discuss the issue of relating the classification scheme to meaningful economic categories of interest. II.2.1. The International Patent Classification (IPC) In view of the international dissemination of patent information, a single international system has proven itself necessary. This international system became the IPC, which is applied in a large number of countries and four international organisations. Its symbols are printed on published patents. The IPC is divided in sections, classes, subclasses, groups and subgroups. IPC is designed so that each technical object to which a patent relates can be classified as a whole. A patent may contain several technical objects and may therefore be allocated several classification symbols. An invention is normally classified according to its function or intrinsic nature, except when its application alone determines its technical characteristics. For example in IPC, subclass F02K is a product-oriented subclass that contains all jet propulsion plants. But subclass H03K is a functionoriented subclass covering the whole range of pulse techniques and contains many different products. According to the OECD patent manual (1994a), the IPC is a combined function/application classification system in which the function takes precedence. The IPC is revised and if necessary amended every five years, but not retroactively. This means that no re-indexation whatsoever takes place and that the whole classification is no longer very accurate (Grupp, 1994, Research Policy). This is one of the biggest disadvantages of the IPC. Due to the changing classification, comparison of the evolution of technology fields, by using the classification, becomes difficult. It is essential that the relevant versions of the classification should be used for a study covering number of years. II.2.2. Relating patents to economic categories Classification schemes evolve in time due to changes in the categories (add, remove, merge categories). Furthermore, if we want to make use of patenting information to examine economic performance of innovations, we are confronted with the problem that patent classifications (IPC) and industrial classifications (such as SIC for OESO data and NACE for Eurostat/NIS data) are not directly comparable. As we discussed, the patent classification system is based primarily on technological and 42 functional principles and is only rarely related to economists’ notions of products or well-defined industries. Grilliches (1990) notices that even before any classification is attempted, one has to take into account the inherent ambiguity of the task. When we want to assign a particular invention to an industry, we can either assign it: a. to the industry in which it was made (‘industry of origin’); b. to the industry that is likely to produce it; c. to the industry that will use the resulting product or process and whose productivity may benefit thereby (‘industry of destination’ or ‘industry of use’). In the last 15 years several attempts to overcome this problem have been undertaken. Grilliches (1984) and his colleagues at the National Bureau for Economic Research started from patent totals for particular firms and then grouped them into industries according to the company’s primary activity. This thus comes down to a classification by ‘origin’. However, firm data are often not the most appropriate to look at. This approach may be useful for the analysis of company level data, relating patents to R&D investments and the subsequent wealth of the companies in, which they originated. But, according to Grilliches (1990), it is much less useful for the analysis of industrial data, because particular patents may have an impact far beyond the boundaries of their industry of ‘origin’. Grouping of companies by industrial classification codes (SIC) also appears to be a rather unsatisfactory solution, because of the conglomerateness of many of the large corporations. A company may change its name and moreover, the patent office does not always use a consistent company code in its computer record. At the micro-level, Trajtenberg (1987) provides a more pragmatic, rather than an all-out solution for the classification problem. He proposes a more modest approach based upon the availability of powerful techniques for computerised search in large databases. Search techniques can make use of keywords in the title and/or in the abstract (possibly established with the help of experts) pertaining to the product in question that may appear, identifying a small set of relevant patents, their classification codes and any other relevant information. Those techniques should allow one to identify quite easily all the patents issued in predetermined economic categories and to retrieve them for further analysis. Nevertheless, Trajtenberg (1987) also acknowledges that there is no absolute certainty that this approach will deliver all the patents in a given field, and only those. Rather, the search process consists, at least at first, of trial and error, something that is also supported by Grupp (1998). Therefore, a distinguishing feature of Trajtenberg’s (1987) approach is that the units of analysis are narrowly defined product classes or technology fields, rather than firms or industries. Additionally, Grupp (1998) mentions that the so-called combined search strategies for patent documents, by using both a systematic search by means of classification symbols in combination with the relevant keywords have been shown to be rather successful (see for example Noyons et al. 1998; Meyer, 2000b). Over the years, various attempts were undertaken to develop ‘concordance’ tables for classifying patent data by economic sectors. In the mid-1970s, the US Office of Technology Assessment and Forecast (OTAF) tried to produce patent statistics at the three and two-and-a-half standard industrial classification (SIC) digit level. This was done by developing a ‘concordance’ table between the patent class and subclass classification system and the SIC industrial classification. If a subclass did not 43 obviously appear to belong to a single SIC industry, it was counted in all of the relevant ones, resulting in considerable double counting. This concordance table was heavily criticised, both because of the arbitrariness in the assignment of some of the subclasses and because of the misleading inferences that could arise from the double counting. These fundamental problems are inherent to concordance schemes. In 1994, Verspagen et al. (1994) developed the MERIT concordance table between the International Patent Classification (IPC) used by the European Patent Office (EPO) and the International Standard Industrial Classification of All Economic Activities (ISIC – rev.2) of the United Nations. The table makes it possible to assign the patent data by field of technology (IPC) to a classification by economic sector (ISIC). In this concordance table, if an IPC-class does not obviously belong to a single ISICindustry, it is assigned to all the relevant ISIC-classes for a percentage proportional to the share of the patents in that IPC-class. The percentages thus give the share of the patents in each IPC-class assigned to the accessory ISIC-category. It should be mentioned that this concordance is also arbitrary. According to Grupp (1998), the division of a functional relationship between markets and areas of technology that is not derived from a specific innovation project, but that is to be universally valid, would seem to be particularly unrealistic when a radical innovation is present. This is because the application of such an innovation is expected to be wide ranging with a great influence on several economic sectors. ‘The construction in each case represents an attempt of a pragmatic partial solution with a shake theoretical basis’ (ibid., 1998). However, Grupp (1998) considers the concordance problem soluble for individual innovation processes (microeconomic level), but not for universal purposes (macro- and meso-economic). II.3. The problem of the ‘value’ of patents A second major problem with patents concerns the value and valuing of a patent. Patents appear to vary tremendously in their technological and economic ‘importance’ or ‘value’. Only a very small number of patents comprise inventions of high technological or economic value, while the bulk of patents are only of marginal value. Grupp (1998) discusses this issue by questioning the existence of so-called ‘key patents’. These are patents serving as the basis for a number of other patents and which are thus extremely valuable. With regard to the latter, Grupp (1998) also mentions the ‘basic patent’ concept, which is often applied to genetic engineering and robotics, areas generating fundamental breakthroughs. The central question is: how can we differentiate between these very valuable and less valuable patents? By simply counting the number of patents, we give equal weight to all patents, implicitly considering them to be of equal importance. Because the economic significance of a patent is so variable, it is very difficult to estimate the average value of patent rights or the average value of the invention represented by a particular patent. Scherrer (1996) identifies three prolific avenues of research dealing with the problem of ‘importance’ of patented inventions: 1. The use of patent citation counts as an index of the importance of patents. 2. The use of data on the number of countries in which a particular patent is applied for. 3. The use of patent renewal data. 4. The stock market valuation of firms to investigate the ‘value’ of patents. 44 Grupp (1998) also mentions patent citation counts as a possibility for valuing a patent and making it comparable. But, furthermore he also mentions the use of patent claims as another way of making the economic value of patents comparable. In the next sections, we will briefly discuss each of the identified ways. The use of patent citation counts Trajtenberg (1990) found that the economic value of patents could better be measured with a so-called citation weight than with individual patent figures. He worked out how often a patent is cited in later patents (‘prior art’ search information) and argued that this frequency is related to the economic value of the patent. The higher the number of citations, the higher the economic value of the patent. Interesting at this point is to mention the study of Narin & Noma (1987), who examined the links between corporate patent and citation data, and several other indicators of corporate performance in 17 US pharmaceutical companies. One of their conclusions was that the number of patents correlates with expert opinions, budgets and publication data but not with financial performance, whereas citations per patent do correlate with financial performance. This confirms that patent citations relate to the economic value of a patent. q This line of research can be traced directly to the widespread use of citations appearing in the scientific literature (more about this subject in chapter 8). Unlike in scientific publications, when applying for a patent, it is the (legal) responsibility of the applicant to describe fully the ‘prior art’ (United States). The applicant, assuming that he or she is also the inventor, must set out the background in such a way as to show how the claimed invention relates to, but is innovatively different from what was already public knowledge. The applicant's citations therefore identify work that is related to the new invention, whereas the examiner's citations serve to define the area in which the invention is truly original and therefore merits the granting of a patent. The use of data on the number of countries in which a particular patent is applied for Here, the ‘value’ of a particular patent is assumed to be proportional to the number of countries in which it has been applied for, also referred to as ‘geographical scope’ of the patent requested protection (Debackere et al., 2000). The reasoning is that for the more ‘important’ inventions, patents are applied for in relatively more countries than for inventions that are less ‘important’ in the judgement of the applying company. q The original application and its duplicates (in other countries) are termed a ‘patent family’. When a patented invention is applied for in three countries, the ‘patent family’ would thus consist of three members. If it would have no external/foreign applications then it would only have one ‘family member’. As already mentioned, the reasoning would be that inventions with a large patent family should be considered as more important than a patent that is only applied for domestically. Grupp (1998) however, objects that the patent's contribution to progress − i.e. what makes it a more or less ‘important’ patent − cannot be correctly estimated from the size of the family, because taking out a patent in other countries than the domestic one is often subject to marketing and other corporate strategic considerations. Moreover, he mentions, one has to take into account specific economic geographic conditions as well. It is generally known that, for example, European companies, due to their geographical situation and high volume trade with other European countries, apply for patents in many more external (European) markets than for example U.S. companies. 45 The use of patent renewal data In this approach, the ‘value’ of a patent is determined by the patent renewal information. In most countries, holders of patents must pay an annual renewal fee in order to keep their patents in force. If the renewal fee is not paid in any single year, the patent is permanently cancelled. It is assumed that renewal decisions are based on economic criteria. This means that agents will only renew their patents if the value of holding them for an additional year exceeds the cost of such renewal. Moreover, patents for which patent protection is more valuable will tend to be protected by payment of renewal fees for longer periods of time. q By using a learning model for the early years of a patent's life, Pakes (1986) finds that patents are applied for at an early stage in the inventive process. At that moment, often, there is still a substantial uncertainty about the returns that will be earned from holding the patents. Gradually, the applicants uncover more information about the actual value of their patents. Most of them turn out to be of little value, but the rare ‘winner’ justifies the investments that were made in developing them. By using these results, Schankerman and Pakes (1985) estimated the average value of a patent right. It appeared that the average value of a patent right is quite small, about $7.000 in the population of patent applications in France and the U.K. and $17.000 in Germany. However, the distribution of these patent right values appeared to be very dispersed and skewed. One percent of patent applications in France and the UK appeared to have values of $70.000 or more, while in Germany one percent of the patents granted had values of $120.000 or more. They also estimated that annual returns to patent protection decay rather quickly over time, with rates of obsolescence of around 10 to 20 percent per year. Stock market value This line of research uses data on the stock market values as indicators of the success of inventive activity and as the driving force behind the investments in it. According to Grilliches, Pakes and Hall (1987), the use of stock market values as an ‘output’ indicator of the research efforts has one major advantage: under simplifying circumstances, a change in these research efforts would be reflected immediately in the stock market value of the firm. A significant disadvantage of this type of measurement is the large volatility in stock market measures. The results of Pakes' (1986) study on the relationship between patents, R&D and the stock market rate of return indicate that about 5 percent of the variance in the stock market rate of return is caused by the events that change both R&D and patent applications. q Finally, we mention the suggested approach by Grupp (1998): the use of patent claims in valuing a patent. The number of claims that are in the patent can represent its economic value. For instance, at the American Patent Office, Japanese patents on average have fewer claims than German patents. A big disadvantage of this method is that the patent claim indicator cannot be detected in databanks and has to be constructed manually. Before turning to the actual measurement of patents, let us summarise a number of findings that will have to be kept in mind when measuring with patents, also in our study. In short, not all inventions are patented (for different strategic considerations) and not all patent applications have an economic value. The value of patents differs greatly depending on the used approach. Measuring with patents is not without trivialities. However, understanding them makes their use easier. 46 III. The use of patent statistics for technology measurement In this paragraph we will look at the quantitative aspects of measuring technological progress through patents. Patent data can, as already mentioned, be aggregated and analysed in a number of ways, including (OECD, 1994a): 1. Patenting by type of inventor, by firms or groups of firms 2. Filings in one or more fields of technology 3. The patenting activity of a country or a region 4. Patenting patterns over time It is clear that patent indicators can be constructed on macro- (country or region), meso- (industry) and micro level (the single firm or institute). The aggregation level differs, depending on the study under review. Our study will focus mainly on the macro level (for more details on the indicators that will be used in this study, we refer to part III). Besides the simple counts of patents of an actor (country, region, firm etc.) in a certain field of technology, it is also possible to relate patent counts with indicators such as R&D spending, indicators for innovation etc. In the next sections, we will discuss a number of indicators and aspects of using patents in measuring technological progress, that are relevant for our study. III.1. Measuring and comparing with simple output indicators; ‘foreign’ patenting The simplest type of patent indicator is derived by counting the number of patents that comply with one or more criteria (technology field, application year, certain inventor etc.). Comparing the number of patents between countries in a certain technological field can provide a basic insight in the differences in technological performance. In a clearly defined macroscopic domain, comparisons based on simple counts (such as between countries) are feasible and reliable. If more than one inventor makes a joint application for a patent, there are mainly two possibilities for assigning this patent to their countries of origin (OECD, 1994a). The first one suggests ‘sharing’ the patent among the various countries concerned, each country receiving an equal share. The second possibility suggests appointing a full count (1 patent) to each country concerned (cfr. section 8.IV.3.3.) For large aggregations of patents, both methods provide equally reliable findings. However, national patenting activity depends on institutional factors, on the nature of the legal system and on many domestic factors, including the size of the population and the economy, the size of its R&D and research community and the technological infrastructure. In order to take these differences into consideration, we can relate patent counts to demographic (# inhabitants), economic and research variables (GDP, R&D expenses). This will provide patent indicators that are independent of the size and characteristics of the countries and that enable a more ‘equal base’ comparison. This process is frequently referred to as ‘normalisation process’. An often-constructed measure is the ‘propensity to patent’. This contains the number of patents per dollar (or other monetary unit) invested in R&D or per R&D staff member. The differences in the propensity to patent, as pointed out by Caniëls (1997) and others, can be caused by inefficiencies in the innovation process and/or by the fact that R&D investment is used merely for imitation. In other words, 47 it measures the extent to which R&D inputs (in terms of expenditures or personnel) are translated into patents, and it can therefore be regarded as an (imperfect) measure of R&D output. When making international comparisons of patenting activity, one has to take into account possible biases in the degree to which countries (or regions) patent their inventions. According to Soete & Wyatt (1983), international comparisons raise the question of whether there are any specific biases in the degree to which countries patent their inventions. In the 1970s, it was widely known that the Japanese patented significantly and systematically more across all industries, whereas socialist countries hardly made use of patents. International comparisons of patents are hampered, furthermore, by international differences in the legal conditions surrounding the granting of a patent, above referred to as institutional factors, domestic factors and the legal system. The ratio of patent grants to patent applications can vary significantly between countries. McCulloch (1980), Soete & Wyatt (1983), OECD (1994a) suggest that in order to avoid such differences, and again compare countries on a ‘equal basis’, one should make use of the relative penetration of third country markets. For example, when comparing U.S. and Japan, one could calculate the fraction of total United Kingdom patents issued to each country. In this way, national (or regional) performance is compared in the face of a common set of legal and economic conditions, such as a similar screening procedure, etc. However, Soete & Wyatt (1983) explicitly mention that firm- and industry-specific patent propensity biases will not be corrected. For a further discussion of a number of studies about foreign patenting, we refer to Pavitt (1985). Foreign patenting is an approach that will also be used in our study. When comparing the patenting activity of countries, we will analyse their patenting activity in a third country or, in this case, a relatively impartial patenting office (USPTO or EPO). III.2. Specialisation indices ‘Specialisation indices’ can be used to answer questions such as ‘where does the country (or region) under study stand in various technology domains compared to other countries (or regions)?’. The ‘Revealed Technological Advantage (RTA)’, developed by Soete and Wyatt (1983), is the most frequently used specialisation index and will be discussed in the next section. § Revealed Technological Advantage (RTA) To understand the structural factors underlying the relative technological positions of the major economic zones, insight is needed into their respective technological strengths and weaknesses. The RTA-index or ‘specialisation index’ or ‘activity index’ or ‘relative specialisation index’, frequently used in patent analyses, is an index that can provide this kind of insight. The RTA-index consists of the ratio of the number of patents of a country in a particular technological sub-domain, divided by the total number of patents in this sub-domain, and number of patents of the country under study in the whole field, divided by the total number of patents in the field. In other words, the RTA-index compares the share of a particular country’s patents for a particular technological sub-domain with the share of other countries in the same domain. As such, it is a relative indicator of technological specialisation (strength). If country X has a share that appears to be bigger than that of 48 other countries, we can say that country X has a ‘revealed technological advantage’ for that specific technological domain. The definition of the RTA index is as follows: åP RTA = åP åP Pij ij i ij j Pij ij ij = Number of patents of country i in sub-domain j åP = Number of patents of all countries in sub-domain j åP = Number of patents of country i in the whole field åP = Number of patents of all countries in the whole field ij i ij j ij ij The value of the RTA-index varies from 0 to +∞. A value lower than 1 reflects that country i has a relative disadvantage in category j. A value of 1 corresponds to a neutral position, whereas a value exceeding 1 signifies a relative advantage. The index6 will be used to compare and to reflect possible changes in time in relative positions for the relevant technological sub-domains. Evolutions in specialisation levels for a particular country or group of countries will also be reflected. According to Soete & Wyatt (1983), the evaluation of the technological performance of a country over time has to be done by splitting the considered time-span of evaluation into two (or more) time frames and calculating RTA-indexes for both of them. The indexes for both time frames can be plotted in a diagram that makes further comparison of the performance possible (see figure 5). RTA 1st period Figure 5 – RTA-index diagram for comparing countries 3 I II IV III 2 1 0 1 2 3 RTA 2nd period Source: Soete & Wyatt (1983) The vertical axis represents the RTA-indices for the first time frame, whereas the horizontal axis does the same for the second time period. As a result, in the upper left quadrant (I), technological sub6 A major disadvantage of the RTA-index is its skewed distribution (it is asymmetrical and unconstrained on one side), which makes it less suited for "distance" measurements between countries for various technology classes. For distance measurements a symmetrical version is available, the RPA-index. 49 domains can be found where the country under study had a relative advantage in the first time frame, but had lost it in the second. The upper right quadrant (II) then represents sub-domains in which the country had a technological advantage in both periods. The lower right quadrant (III) contains subdomains in which the country developed a technological advantage during the second period, from a position of disadvantage in the first period. Finally, technological sub-domains for which the country concerned had a relative disadvantage in both periods can be found in the lower left quadrant (IV). § Revealed Patent Advantage (RPA) In order to make the RTA-index suitable for distance measurements, the RPA-index is advisable. The RPA is an adjusted version of the RTA-indicator. The method of calculation is given below: RPA = 100 ln RTA = 100 x (RTA2p1) / (RTA2+1) The value of this index varies between −100 and +100, with 0 as a neutral value. It can appropriately be used for distance measurements. Interpreting the RTA-index should be done with great caution, especially in the case of a low number of patents in the fields under review. The RTA is actually a combination of a specialisation index and an activity index. Accordingly, the only correct interpretation of the RTA-index is that it reveals specialisation of a certain country in a certain domain and corrects this specialisation by the countries patenting activity. Indeed, results for countries with low numbers of patents should be interpreted with care. Furthermore, we would like to mention a less known patent indicator introduced by Grupp (1998), the ‘International Technology Production (ITP). This indicator yields an adjustment to control for the domestic advantage. This ‘domestic advantage’ H (Home country) for domestic applications from country X can be calculated with the aid of external patent applications in another country Y. The ITP results in the actual number of patent documents applied for externally (or which were applied for on domestic territory if no domestic advantages existed). A normalisation is achieved for the size of market, the attractiveness of the patent office and the legal peculiarities. 50 III.3. Impact indicators § Attractivity Index (AII), Citation Performance Index (CPI) In order to analyse the quality or value of patents (cfr. section 7.II.3.) of a certain country, and/or blocks of countries, the impact of patents (per domain) can be measured through their citation intensity. For example: what is the impact of German patents in Europe? Or what is the impact of German patents in a certain technological field? The added value of this type of analysis is to add a qualitative aspect to the pure quantitative analysis. In order to address impact issues of patents, the Attractivity Index (AII) can be calculated. This indicator is a variant on the RTA index, except now citations are used instead of patents (for calculation, see RTA-index). Another indicator for measuring the impact of patents, developed by CHI Research which is a United States company specialising in science and technology indicators, is the ‘Citation Performance Index’. Its procedure is to compare the most highly cited 10 percent of patents for a country to those of the world. A ration of 1.0 means the country’s citation performance exactly matches that of the world. This indicator also measures the impact or ‘quality’ of the patents of a certain country. The formula for the indicator is: Pi Pt Pi = the percentage of country i’s patents appearing among the most cited 10 per cent Pt = the same percentage for the worlds patents III.4. Maps of technology Finally, we should mention the use of information contained in patents to construct ‘maps of technologies’. For this purpose, besides information on the innovating company and the specific characteristics of the invention itself, information is gathered on the references contained in each patent application both to previous relevant patents and to research papers reporting results on which the invention is based. ‘Maps’ of various technological (sub-) domains can then be constructed by examining the interrelation between frequently cited patents. Co-citation, co-classification or co-word analyses are possible. Moreover, within each (sub-) domain, an assessment can be made of the relative position of different actors ranging from companies, research institutes to entire countries or regions. We shall discuss the technique of mapping extensively in the next section. As Pavitt (1985) mentions: “Carpenter, Narin and their colleagues, have carried over into the analysis of patent statistics many of the bibliometric techniques developed for the analyses of scientific papers”. That is why we prefer to discuss these items at their origin: bibliometrics. 51 CHAPTER 8 - BIBLIOMETRIC INDICATORS AND ANALYSIS OF RESEARCH SYSTEMS I. Introduction to ‘bibliometrics’ Just like patents constitute an (imperfect) output-indicator of inventive activity, scientific publications constitute an (imperfect) output-indicator of scientific activity. Whereas patents indicate a transfer of ‘knowledge’ to industrial innovation and a transformation into something of commercial and social value, scientific research is the production of ‘knowledge’. Publication has three objectives: to spread scientific findings, protect intellectual property and to gain fame. These three objectives reveal that scientific publications can be seen as a means of examining the knowledge production function and specifically analysing the R&D results in regard of scientific performance. Scientific journals − sometimes referred to as the ‘serial literature’ − obviously play a leading role in the communication of research findings in many scientific fields (Moed, 1989). The area of research utilising the information contained in research publications to get a clearer view on an actor's scientific output is labelled ‘bibliometrics’. In 1969, Pritchard coined a new term – ‘bibliometrics – for a type of study that had been in existence for half a century and was at the time known s ‘statistical bibliography’. In literature, in relation to bibliometrics we can find the terms ‘scientometrics’ and ‘informetrics’, which in some cases are being used as synonyms but actually have a different scope. Several definitions of bibliometrics can be found in literature. Pritchard defines bibliometrics as: ‘… the application of mathematics and statistical methods to books and other media of communication.’ (Pritchard, 1969, pp. 348 – 349) ‘Informetrics’ deals with the quantitative aspects of information in broader sense whereas ‘scientometrics’ refers to the application of bibliometric techniques to science and or technology measurement, according to Van Raan (1997). Bibliometrics thus can be seen as a branch of scientometrics that focuses principally on the quantitative study of scientific publications for statistical purposes. In view of the aims of our study we shall continue with referring to the measurement and evaluation of science as ‘bibliometrics’. It should also be noticed that in our view bibliometrics are also suitable, and frequently used, for monitoring science and technology. Derek de Solla Price, one of the pioneers in scientometrics and author of the milestone work Little science, Big science (1963), indicated that the quantification and mathematical treatment of science can be extremely valuable because of their power to bring order in the world of observation. However, he notifies, quantification will surely not solve all problems of science. In addition, he recognised that measuring ‘quality’ of scientific research in a quantitative way is by no means an easy topic to deal with. 52 ‘Bibliometrics’ then indicates the collection, handling and analysis of quantitative bibliographic data, derived from scientific publications. Bibliometric methods serve three main functions: description, evaluation and scientifical and technological monitoring. As a descriptive tool bibliometrics provide an account of publishing activities at micro-, meso- and macro level. Secondly, bibliometrics can be used for assessing the performance of research institutes or even countries. Finally, and that is the third function, bibliometrics can be used for monitoring developments on S&T since longitudinal studies are possible. The use of bibliometric data made it possible to study science unobtrusively, that is, independently of information obtained from the scientists − the object of study − themselves (Tijssen, 1992). Just like patents appeared to be an important, yet imperfect indicator of technological activity, publications in scientific journals play the same role with respect to the measurement of scientific activity. Still, although a large part of the communication of ideas takes place in the form of spoken language (e.g. personal communications or lectures), and although not all written communication (e.g. reports, memo's, working papers) appear as articles in scientific journals, one can assume that eventually all relevant research findings are reported in the serial literature (Moed, 1989). However, several studies (Moed et al, 1985; Moed & Van Raan, 1988) have indicated that the role of the serial literature is not equally important among all sub-fields within science. II. Bibliometric information: publications and data sources II.1. Publications as data The principle objects of measurement in bibliometrics are scientific publications. A scientific publication can be defined as any kind of written material, either in a physical format, or as its electronic equivalent in a computerised database, containing information with respect to scientific research activities. Research reports, books, conference proceedings and articles in scientific journals are considered to be the most important types, whereas research notes and letters, for example are considered to be less important (Tijssen, 1992). As Martin (1996) points out that scientific publications are a reasonable measure of scientific production. Furthermore, he points out that most publications make only a very modest incremental addition to knowledge, while only a very few make a major contribution. Publication counts, just like patent counts, are only a partial indicator of contributions to knowledge. It is a variable reflecting (a) the level of scientific progress made by an individual or group and (b) a number of other factors such as social and political pressures (ibid., 1996). A scientific publication usually contains a title, the names of the authors and their institutional affiliation. Within the text, frequently, citations are given to other publications of which the content, in the mind of the authors, is related in some way or another to their own article. All citations given in the text are usually reproduced at the end of the article, in the reference list. In order to make them easily retrievable, every publication is uniquely represented by the bibliographic information it contains: author(s) name(s), journal title, publication year, volume number and starting page number. Finally, specific key-words, terms or classification codes, added by the author or the journal's reviewers, can be included in order to define the content of the article adequately (Moed, 1989). However bibliometric information contains a lot of errors. As Glänzel (1996) argues there is still a long way to go on the subject of standardisation in bibliometric research and certainly on the issue of publication information like author names, addresses etc. This topic will also prove to be very important in our study. 53 According to Tijssen (1992), scientific publications contain a multitude of quantifiable elements related to three main aspects of scientific activity: (1) The ‘size’ of scientific activities, as reflected in the output of research publications (the ‘products’) (2) The transfer of knowledge (the ‘process’). Scientific publications are considered to be the major communication channel for presenting original scientific findings and dissemination of scientific knowledge, and the citation process reflects communication of knowledge in the scientific community. (3) The social and cognitive networks of science (the ‘structure’). Relational aspects of science can be studied by means of information on the authors of co-authored publications, their addresses, reference lists, and keywords taken from abstracts or the text. Very similar to these elements of scientific publications are the categories that Martin (1996) distinguishes: scientific activity, scientific production and scientific progress. Quantitative measures in general give the impression of being more objective, more precise, and more reliable than information from a qualitative nature. However, validity, reliability and relevance of measurements should be checked when trying to quantify science (Tijssen, 1992). Besides, just like with patents, science measurement and assessment should not be solely based on qualitative or on quantitative measures. Reliability refers to the problem of stability or instability in measurement respectively leading to similar or not similar outcomes. Any bibliometric measurement incorporates a certain amount of unsystematic (‘random’) error, such as e.g. misspellings of author names. However, also systematic errors, which represent a serious problem, may occur, e.g. when not all-important researchers in a field are covered. Validity pertains to the issue whether measurements really measure what they purport to measure. In the field of bibliometrics especially the validity of bibliometric variables as indicators of ‘quality’ of the scientific research performance is questioned. Finally, relevance is concerned with limitations in the range of application. This definitely also applies to bibliometrics, since it is founded upon the assumption that scientific publication represent in a consistent way the total outcome of scientific research activity. The issue of relevance can pose a problem in those (sub)-fields of scientific activity that are characterised by local orientation of research activities, non-journal publishing practices, and insufficient reliance on references to acknowledge intellectual debts (Nederhof, 1991). Since insufficient validity, reliability, and coverage are for some part inherent to bibliometric data, Tijssen (1992) stresses that one always has to take into account that any bibliometric measure will only remain a proxy for a qualitative parameter in the complex system of knowledge production. II.2. Scientific literature databases The source for bibliometrics is always a database (OECD, 1997). Relevant bibliographic pieces of information are extracted from scientific publications and gathered in large scientific literature databases. The content of these databases is usually available in a hard copy format, but nowadays most databases are also available in computer readable form and even through the Internet. The producers of these databases extract bibliographic information from scientific publications, and usually also apply some kind of indexing system by adding keywords or classification codes to the articles. In 54 parallel with the growth of science, the number of scientific publications has been doubling about every 15 years in the past 300 years (Tijssen, 1992). To keep up with this pace, new ways of cataloguing and indexing the literature and making it retrievable have been developed. Probably the most important database for bibliometric analyses is the Science Citation Index (SCI), produced by the Institute for Scientific Information (ISI) in Philadelphia, USA. Several studies have argued that at the moment the SCI is the most suitable source for detailed international bibliometric analyses on macro level. The ‘inventor’ of the SCI is Eugene Garfield. The Science Citation Index is a multi-disciplinary database covering more than 5700 major journals across 164 scientific disciplines in the natural and life sciences. The SCI limits the scope of coverage to world-class scientific journals, representing the ‘core’ scientific output in specific fields and eliminating research that not presented in the ‘mainstream’. Unlike most other databases that cover at most one discipline − e.g. Chemical Abstracts, that covers the field of (bio) chemistry, or INSPEC, a world-wide database on Physics, Electronics and Computing − the SCI is a multi-disciplinary database that claims to cover the most important journals in the natural and life sciences. The subject coverage has been expanded from the initial SCI to include the Social Sciences Citation Index (SSCI) and the Arts and Humanities Citation Index (A&HCI). The SCI Expanded provides access to the SCI from 1973 forward and allows for searching full-length, Englishlanguage author abstracts for 70% of the articles in the database. Some of the disciplines covered by the SCI Expanded include agriculture, astronomy, biochemistry, biology, biotechnology, chemistry, computer science, materials science, mathematics, medicine, neuroscience, oncology, paediatrics, pharmacology, physics, plant sciences, psychiatry, surgery, veterinary science and zoology. The Social Sciences Citation Index is also a multi-disciplinary database covering the journal literature of the social sciences. It covers more than 1725 journals across 50 disciplines, as well as individually selected, relevant items from over 3300 of the world's leading scientific and technical journals. Some of the disciplines included in the Social Sciences Citation Index are anthropology, history, industrial relations, information science and library science, law, linguistics, philosophy, psychology, psychiatry, political science, public health, social issues, social work, sociology, substance abuse, urban studies and women's studies. The Arts & Humanities Citation Index is the third multi-disciplinary database indexing 1144 of the world's leading journals in the fields of arts and humanities, as well as a selection of relevant items from over 6800 major science and social science journals. Some of the disciplines covered by the Arts & Humanities Citation Index include archaeology, architecture, art, Asian studies, classics, dance, folklore, history, language, linguistics, literary reviews, literature, music, philosophy, poetry, radio, television and film, religion, and theatre. There are two important sections to this Science Citation Index: the Source Index and the Citation Index. The Source Index contains bibliographic information, by author, of all articles published in the journals or books processed for the SCI. Bibliographic information enclosed includes all co-authors, the full title of the publication, the journal title, volume, issue, page, year, type of item and number of references in the bibliography (Egghe & Rousseau, 1990). An important aspect is the classification system of the SCI covered journals. Journals and not the individual publications are classified in so-called ‘subject categories’ (over 160). This classification is based on the content and citation relations between the journals. If necessary the classification is being 55 altered yearly. The subject categories are partly comparable with scientific sub-disciplines. Between the journals and the subject categories there is no one-to-one relationship. This means that a journal can be assigned to more that one subject category. Luwel (1999a) grouped the categories in scientific disciplines based on a classification scheme used in ‘Science et Technologie 1994, Rapport de l’Observatoire des Sciences et de Techniques’. Of all scientific literature databases, the Science Citation Index probably is used most frequently in bibliometric analyses. However, other scientific literature databases exist. We already mentioned some field-specific databases as Chemical Abstracts and INSPEC. Other databases are Compendex, a specialist engineering and technology database, Embase, a specialist physical science database and Pascal, also a general database covering several fields. Furthermore, a number of bibliographic databases, varying in scope and size, are available via host-computers of vendor agencies such as Dialog, DIMDI, ESA and STN. These databases provide easy access to an immense body of readily available catalogued bibliographic data on scientific research. As a consequence, a quantitative assessment of scientific performance can be performed in a relatively easy and timesaving way. However, a number of limitations have to be kept in mind. First of all there are the limitations of each database for compiling bibliometric indicators. There is a suitable database for each study. Each database has it own characteristics like structure, coverage, reliability, classification etc. Secondly, when working with citations to publications, we have to be aware of the trivialities around citations. Examples are ‘self-citations’, ‘negative citations’, ‘language factors’ etc. A final remark around bibliometric databases is that beside the advantages that commercial databases provide in retrieval and use of information, there are also several disadvantages that occur with the commercial exploitation of this kind of data. According to Moed (1989), and also suggested by Glänzel & Schoepflin (1992), the dominance of commercial firms in the computerised bibliometric analysis has been one of the impediments for progress in the field of bibliometrics. First of all, expertise was often kept ‘in house’, and often few details were given on the methods that were used. Second, analyses performed by these firms were often very expensive. Moreover, due to the specific licence agreements of Computer Horizons Incorporated (CHI) and CRP with ISI-Thomson Scientific, these firms are only allowed to present ‘final results’ to their customers. The data underlying these ‘final results’ could not/and cannot be disclosed. III. Limitations to the use of bibliometric data As already mentioned before, the concept of reliability may be defined as the extent to which the results are independent of its technical effectuation (Moed, 1989). This section deals with the specific problems that exist with respect to collecting, handling and interpreting bibliometric data. Roughly three main issues related to reliability of bibliometric indicators can be distinguished (issues around citation analysis will be discussed in the next paragraph): (1) Completeness of bibliometric data; (2) Coverage of scientific literature databases; (3) The problem of statistics; Each of these issues will be discussed in more detail in the following sections. 56 III.1 Completeness of bibliometric data The first problem relates to the completeness of the data. Several researchers (among which Moed et al., 1985) mention that it is very difficult to obtain complete publication and citation data. Almost always, several omissions are detected in the ‘raw’ data, and have to be completed by manual checking. Moed et al. (1985) mention that the number of missing data can easily amount to about 10 percent of the total number of data involved. Especially if the units of analysis are quite small by nature, e.g. when the research group is taken as a unit of analysis, this can pose a problem. Let us take research groups as an example. Research groups, containing on average two to ten researchers, normally produce a relative low number of publications and citations. Consequently, bibliometric indicators are constructed, based on small numbers, and obviously even small errors or a small number of omissions can lead to dramatic differences in results and interpretations. Moreover, the distribution of citations among articles published by a research group is found to be highly skewed. Consequently, omission of even one publication can have great implications if the omitted publication happens to be a highly cited one. Of course, if like in our study a higher level of aggregation is used, thus operating on larger numbers, completeness becomes less a problem. This is, for example, the case when large departments, clusters of departments, (sub) faculties, or entire countries or regions are taken as units of analysis. III.2. Coverage of scientific literature databases The second reliability issue is related to the coverage of scientific literature databases. Let us take a closer look at the Science Citation Index in particular. The question is whether this database covers the scientific literature in a sufficient way, both as to the different (sub-) fields (of natural and life sciences) and to all countries. As can be seen, this question relates to two equally important aspects: (1) coverage of all countries, and (2) coverage of all (sub-) fields of science. As for the coverage of the scientific literature of the various countries, it is found that the ISI-databases (the SCI, SSCI and A&HCI) show a bias towards publications from the USA (Egghe & Rousseau, 1990). Moreover, significant differences appear to exist between the SCI and other sources of national coverage of fields with a more dispersed literature. This seems to be especially so when it concerns journals from countries with non-Roman alphabets, such as Russia and Japan (Moed, 1989). Problems of inadequate coverage appear to arise particularly in scientific fields in which researchers do not publish primarily in a limited number of international oriented, and consequently mainly Englishlanguage, top journals, but rather in a wide range of journals with a national scope, which may be nonetheless of high quality. The latter will often be non English-language journals. Moed et al. (1995) found that such journals from non English-speaking countries indeed are less likely to be included in the ISI-databases. This obviously points to a problem of a possible Anglo-Saxon (mainly US) bias. Nevertheless, in a recent study by Luwel (1999a) about the possible US-bias of the SCI, it appeared that for the disciplines covered by the SCI the average fraction per journal of publications from the EU and the US is roughly equal. Furthermore, for the basic and applied sciences, the SCI seems to cover equally well the US and the EU research works and as such ‘no US-bias’ was observed. 57 As for the coverage of the various (sub-) fields within the Natural and Life sciences, the question is what the coverage is of the SCI relative to what researchers list in their research reports, annual surveys and so on (Moed, 1989). In order to obtain an insight into the appropriateness of SCI source journals and SCI source books as an adequate tool for bibliometric output and impact analysis, Moed et al. (1985) studied the publications for all research groups from the faculties of Medicine, Mathematics and Natural Sciences at the University of Leiden. They determined the percentage of total publications that was covered in the Science Citation Index. Summarising, the coverage for chemistry, physics and astronomy, and pharmacy/pharmacology appeared to be sufficiently high (more than 70%). However, for biology and in particular for mathematics, this percentage was found to be lower (54% and 26% respectively). The lack of coverage in these cases is caused by two factors: (1) the output in Dutch is not covered by the SCI source journals or books, and (2) articles and research reports (in English) that are published in non-SCI journals and media such as books, reports, proceedings, and so on, are of course not covered at all. Moreover, Moed (1989) mentions that several interviewed scientists from the fields of biology and mathematics judged the SCI coverage as inadequate, and that it does not constitute a representative sample of the research output of groups from these fields. Moed et al. (1985) conclude that, possibly, the role of the scientific journal literature for communicating research findings may differ from discipline to discipline; a point we will take up in the next section as well. Finally, another disturbing factor relates to trend analysis over the years based on SCI data. The problem relates to the large changes in coverage of the SCI database during the time period considered (Moed, 1989). Indeed, when using the SCI or another ISI database, one has to take into account that considerable disturbances can occur resulting from the fact that the ISI has included more source journals during the 1980s and that, as from 1977, non-journal material, such as books, are also included (Moed et al., 1985). Consequently, one should be very careful in interpreting trends in the numerical values of bibliometric indicators, when trying to relate them to actual performance of e.g. a research group. III.3. The problem of ‘statistics’ This problem, identified by Moed et al. (1985), pertains to the question when a given difference between two bibliometric scores should be considered as ‘significant’ with a certain probability, and when it should be ascribed to mere chance. This topic has been discussed frequently. It has been proposed to consider citing as a stochastic process, in which publications are assumed to attract citations due to their impact, but also due to many accidental factors. However, small effects in trends or small differences between bibliometric scores should be interpreted with great care (Moed et al., 1985). 58 IV. Citation analysis IV.1. Introduction Citation analysis constitutes an important device in the quantitative study of science and technology. The primary objective of the scientific community is the generation, processing, diffusion and utilisation of scientific knowledge. Science as a social and cognitive system of knowledge is characterised by a set of widely accepted and utilised norms, one of which is the practice of acknowledging the various sources of information which have been consulted and/or used in a scientific research activity (Tijssen, 1992). The latter becomes manifest through the list of references. In this list of references, authors of scientific publications refer to earlier publications of which the author has made use in doing his research activity or that support, provide precedent for, illustrate, or elaborate on what the author has to say (Garfield, 1979). The author should indicate a used publication uniquely so that retrieval of the original source is possible (Moed & Vriens, 1989). The terms ‘reference’ and ‘citation’ are used interchangeably, in the strict sense the first is the acknowledgement that a document gives to another, whereas the latter is the acknowledgement that a document receives from another (respectively ‘citing’ and ‘cited’). Citation analysis is the area of bibliometrics that deals with the study of the relationships between citing and cited. Citations may be considered a measure of the impact of the articles cited, as well of their timeliness and utility. It is presumed that a paper must have a certain quality in order to have an impact on the scientific community, a presumption that can be disputed. Most criticism pertains to the problem of the unknown motivations of the citing author (Egghe & Rousseau, 1990). Citations do represent a relation between the cited and the citing publication. However, the precise nature of this relationship is often somewhat unclear. Various motivations for citing are identified by Garfield (Smith, 1981). The most important will presented below and discussed in the next sections. 1. Paying homage to pioneers; 2. Giving credit for related work (homage to peers); 3. Providing background reading; 4. Criticising and/or correcting previous work; 5. Identifying original publications in which an idea or concept was discussed; 6. Disclaiming work, ideas or claims of others (negative claims and homage). It is very likely that norms for citing vary from discipline to discipline. However, one also has to take into account that, just as there are a number of reasons why citations are enclosed, there may on the other hand also be a number of reasons why a particular author has not established a link to another document. In other words, many documents that should have been cited are omitted, while many other documents that the author does cite are only slightly relevant (Smith, 1981). Over half (55%) of the articles published in the scientific journals covered by the SCI are not cited a single time in the five years following their publication (OECD, 1997). Nevertheless, citations are objective measures that can be gathered without the co-operation of any respondent (Smith, 1981). The practice of citing the work of other authors has led to the construction of so-called ‘citation indexes’. Citation indexing is based on the idea that an author's references to previous publications identify much of the earlier work relevant to the subject of his own article (Egghe & Rousseau, 1990). Citation indexing, of which the Science Citation Index (SCI) is by far the most important example, 59 offered an additional way for retrieving relevant publications by examining the citation linkages drawn by authors in their publications (Garfield, 1979). IV.2. The use of citations as ‘information’ The main subject of this section is how to use citations as items of information. The easiest way to use citations is by citation counts. In this approach the number of citations is counted that have been received by a given document or set of documents over a certain period of time from a particular set of citing documents (Smith, 1981). If the unit of analysis is on the level of single articles appearing in a particular journal, calculating e.g. the ‘impact’ factor of a publication (cf. infra) can refine this measure. However, over the last three decades, new techniques and measures using citations were introduced and new tools exploited. Two heavily used citation-based techniques devised to identify documents likely to be closely related are bibliographic coupling and co-citation analysis. Two documents are said to be bibliographically coupled if their reference lists contain one or more of the same cited documents (Smith, 1981). Their strength of bibliographic coupling depends on the number of references they have in common (Egghe & Rousseau, 1990). However, bibliographic coupling may not be a valid measure of relatedness between two publications. Just because two papers have a certain reference in common does not guarantee that they are both referring to the same piece of information in the cited article. In recent times, co-citation analysis has become more popular. Two documents are said to be co-cited when they are jointly cited in one or more subsequently published documents (Smith, 1981). Again the strength of the relationship is defined as the frequency with which the two documents are cited together (Egghe & Rousseau, 1990). Thus, while bibliographic coupling focuses on groups of papers that cite a common source document (later documents become linked because they cite the same earlier documents), co-citation analysis groups papers that frequently are cited in pairs (documents are later cited together). Since co-citation analysis is of particular interest in the domain of the mapping of scientific specialities, this technique will be dealt with in more detail in the section on mapping of science. IV.3. Basic assumption and problems around citation analysis IV.3.1 A general overview In this section we shall discuss a number of basic assumptions underlying citation analysis, and a number of problems or pitfalls around citation data. Before going into more detail on these aspects in the next sections we first present a general overview in the table below. 60 Table 2 – General overview of assumptions and problems/pitfalls related to citation analysis Assumptions underlying ‘’Citation Analysis (Smith, 1981) (a) Citation of a document implies use of that document by the citing author (b) Citation of a document reflects the merit, in terms of quality, significance or impact, of that document (c) Citations are made to the best possible works (d) A cited document is related in content to the citing document (e) All citations are equal Possible problems/pitfalls related to ‘Citation Analysis’ (Smith, 1981) (a) The problem of multiple authorship (co-authorship) (b) The problem of self-citations (c) The problem of unique identification of authors (d) Implicit citations (e) Fluctuations in time (f) Variations among (sub-)fields (g) Errors IV.3.2. Detailed discussion: basic assumptions underlying citation analysis (a) Citation of a document implies use of that document by the citing author Smith (1981) indicates that this first assumption actually consists of two parts: (a) the assumption that the author refers to all, or at least to the most important, documents used in the preparation of his work; and (b) the assumption that all documents that are listed in the reference list reflect actual usage, i.e. a reference to a particular document is only made if that document actually contributed to his work. If these assumptions do not appear to hold, certain documents will consequently be underrated since not all sources used were cited, while other documents will be overrated since not all sources cited were actually used. (b) Citation of a document reflects the merit, in terms of quality, significance or impact, of that document There is a high positive correlation between the number of citations a particular document receives and the quality of that document. However, we can already mention here that the use of citation counts as quality indicators have generated a great deal of discussion. For now, we can already mention that this issue was investigated in several studies and that results produced support for this assumption most of the times, although results of citation counts should always be compared with alternative quality indicators, such as for example expert opinions. This is also a remark we made previously when discussing that assessment should be based both on qualitative and quantitative indicators. (c) Citations are made to the best possible works Only if one assumes that authors, in describing their research efforts, sift through all possible documents that could be cited in order to select only those references judged best, this assumption can be expected to hold. However, studies of science information use have suggested that accessibility probably may be as important as quality as a factor in the selection of an information source. In short, an article might well have been cited because it happened to be on the writer’s desk rather than because it was the ideal paper to cite. 61 Accessibility of a document, Smith (1981) argues, is a function of its form, place of origin, age and language. However, just like a paper may be more or less accessible, a researcher may be more or less visible or even famous. It may well be that anything that enhances a researcher’s visibility is likely to increase his citation rate, irrespective of the intrinsic quality of his work. (d) A cited document is related in content to the citing document The use of citation indexes to retrieve relevant documents supports this assumption. The results of an experiment by Barlup, mentioned in Smith (1981), provide additional support for this statement. Authors, asked to assess the degree of relatedness of citations to their own work, judged 72% of the citations to be definitely related, and only 5% to be definitely not related. However, when it is extended to other uses of citation measures, this assumption becomes more problematic. In the case of bibliographic coupling, the assumption does not hold, since it is not sure that two documents citing a third are citing the same unit of information in it. Consequently, Smith (1981) argues that bibliographic coupling merely indicates the existence of the probability (possibly zero) of a relationship in the content of two documents. Analogously, the fact that two papers are cocited is no guarantee that a relationship exists in the content of both papers. (e) All citations are equal Studies using citation counts generally assume that all citations (with the possible exception of selfcitations) can be weighted equally. However, many researchers have looked for ways to refine citation analysis in the sense that all citations to a particular article would not be treated necessarily as equivalent. Smith (1981) distinguishes between two types of refinements: mechanical (e.g. looking for multiple occurrence in an article) and intellectual (content analysis). The differences between the two categories being that mechanical refinement requires no judgement or inference, whereas intellectual refinements do need human analysis and are, consequently, rather time consuming. IV.3.3. Possible problems/pitfalls related to ‘Citation Analysis’ Besides the rather subjective assumption underlying citation analysis, one must also be aware of problems associated with the sources of citation data or with the use of secondary sources, such as citation indexes. A range of problems is associated with the use of citation data. These problems are often very down-to-earth and rather technical in nature; still they can have a big impact on the results, if one fails to take them into account. A brief overview of the various pitfalls and problem issues shall be presented. Once again we base ourselves on the excellent overview given by Smith (1981). (a) The problem of multiple authorship (co-authorship) Several authors publish articles of which they are only co-author rather than the first author. This poses a problem because the ISI citation indexes list the cited references only by first author. As a consequence, by only counting citations found by looking in the citation index under the name of a particular author, one misses all citations to publications where he was a co-author rather than a first author. Studies on how reliable ‘first-author’ citations are in predicting ‘all-author’ citations show that the two measures are correlated highly, but that nevertheless large discrepancies arise between the authors in the top of the two rankings (Moed, 1989). 62 In the case of multi-authored works, there is also the problem of allocating credit to the various authors. In other words, which weighting procedure should be applied in allocating credits to authors of multiauthored works? A similar problem is encountered when dealing with patent: how do we count and attribute a patent to the different inventor or applicant countries. In general, there are three ways to deal with this problem: (1) straight count; (2) normal count; and (3) adjusted count (Lindsey, 1980; Egghe & Rousseau, 1990). In the case of straight counts, multi-authored works are treated the same as single-authored works, in the sense that the first author receives all the credit. This is by far the simplest approach and, in addition, it greatly reduces the work required to collect the data. Note that the use of straight counts can be considered as a sampling strategy that, as such, should be examined in terms of its representativeness. In this rationale, when using the straight count procedure, it is assumed that the set of publications in which an author's name occurs first constitutes a representative sample of all of this author's publications. It is concluded that no strong empirical evidence, nor theoretical rationale is found to support the assumption underlying the use of straight counts (Egghe & Rousseau, 1990). By using normal counts the problem of distributing credit for multi-authored publications is solved by giving full credit to all contributors. The problem with this approach, however, is that it tends to inflate the publication or citation scores of those researchers who produce many multi-authored papers. Moreover, problems can occur due to the fact that, in the normal count approach, the sum of the number of publications of all authors becomes larger than the actual number of papers under study. For scientists and politicians who use science indicators, normal counting is far more comprehensible and easy to interpret? ‘A share of 10% of country X means in this sense that 10 out of every 100 papers in the world have at least one contributor from country X’. Furthermore, it seems that the ‘meaning’ of a 10% share in the adjusted count (or fraction scale) is far more difficult to explain (Braun et al., 1991). It is safe to say that the best way to handle the problem of multi-authorship is by assigning credit proportionally. In the case of adjusted counts, every co-author is assigned a fraction of the authorship. Thus, if two authors write an article, each would be assigned half a credit; in the case of three authors, each would get a third; and so on. (b) The problem of self-citations As indicated by Egghe & Rousseau (1990), there are very few articles that do not include any selfcitation. In addition, authors are inclined to refer to their own work to a greater extent than to the work of any other single author. The problem with the inclusion of self-citations into citation analysis is that it, obviously, tends to inflate citation scores for those researchers who cite their own work abundantly. As a result, in almost all studies, self-citations are excluded. This can easily be achieved for singleauthor papers. When citations are used for science policy purposes or if a particular research group or research institute is taken as the unit of analysis, self-citations are often interpreted as ‘in-house citations’. The latter comprise citations by members of a research group or a research institution to the work of their own group or institution (e.g. Nederhof & Van Raan, 1993). In this case information has to be collected to identify all members of the research group or institution under study. (c) The problem of unique identification of authors This problem pertains both to the issue of ‘homographs’, resulting in failure to differentiate between researchers with the same name, and ‘synonyms’, resulting in failure to group references to an author's work due to the lack of a standard form for the author's name. Because it is possible that more than one researcher with the same name and initials (a case of homographs’) is publishing in the same field, 63 incorrect attributions of citations to authors can be the result. To differentiate between them, additional information on the authors is needed, such as for example their institutional affiliation. Another problem indicated by Smith (1981) pertains to the so-called ‘synonyms’, which are not recognised as such. She mentions that, unless some standard form for an author's name is established, citations to the work of many authors will be scattered. Examples of ‘synonyms’ in the context of citation indexes include an author's name with a variable number of initials, a woman's maiden and married names, different transliterations of foreign names (especially in the case of Russian, Chinese or Japanese names), and misspellings. Also relevant for our study are the synonyms or misspellings for one and the same journal title. Luwel (1999a) signals and studies these problems extensively. For example in the SCI, in the so-called target articles, journal titles are written fully, whereas in the cited references, abbreviated journal titles are used. This problem influences the possible matching approach, a significant step in the ‘linkage’ methodology (for more details see part III). In addition to variations in titles, or their abbreviated forms, journals also merge, split into new ones, change titles and are translated (Egghe & Rousseau, 1990). (d) Implicit citations Because most citation indexes include only explicit citations, these are the only ones considered in citation analyses. The one exception is the Arts and Humanities Citation Index (A&HCI), which also includes implicit citations when a paper refers to, though not by means of an explicit citation, and substantially discusses a scientific work. Smith (1981) mentions that implicit citations often also occur in the form of eponyms (an author's name plus an idea or concept associated with this person; e.g. Murphy's law). Furthermore, once an idea becomes widely accepted, a citation to the original source is often omitted. (e) Fluctuations in time Smith (1981) indicates that large variations in citation counts may occur from one year to another. Therefore, she recommends that citation data should not be too restricted in time. This also makes clear that seemingly large variations, often in relatively small numbers, should be interpreted with care. In addition, Moed et al. (1985) mention that, within fields, citation practices can change in time. The results of their analysis of the SCI database between 1970 and 1980, for example, show that, on average, a journal article contains an increasing number of citations to 0 to 2 year-old articles during the decade. (f) Variations among (sub-) fields This problem relates to differences both in publication practices and in citation practices among (sub-) fields or science disciplines. We already mentioned before that the scientific journal literature might play a different role in communicating results of research activities among different (sub-) fields. As a consequence, also citation counts strongly depend on the internal characteristics of the research fields. For example, it is found that the average chemistry or physics article contains about twenty references, while a mathematical article contains, on average, less than ten. This of course leads to difficulties in cross-discipline comparisons (Egghe & Rousseau, 1990). Furthermore, the coverage of a publication in terms of how many people study the publications in that area, is also relevant for the number of citations these publication will get. In short, due to differences in publication and citation practices between different scientific (sub-) fields, direct comparison of, for example, research performance of groups from different disciplines is inappropriate. 64 (g) Errors It goes without saying that citation analyses, whether they are based on citation indexes or not, can be no more accurate than the raw material used. A general problem with the use of citations is that authors, in their list of references, give bibliographic descriptions that do not exactly correspond to the bibliographic data of the article they intend to cite. Errors can include discrepancies in cited author names, journal title, page, volume, and year. As for the use of citation indexes, one can assume that, by processing citations for inclusion in citation indexes, additional errors are introduced, while others are eliminated. Incorrect citing of sources appears to be far from uncommon. In their data set taken from SCISEARCH, Moed & Vriens (1989) find that the number of citations showing a discrepancy in at least one data field (full author name, journal title, publication year, starting page number and volume number), relative to the number of citations not showing any discrepancy, amounts to 9.4%. This means that for every 100 citations showing no discrepancy, 10 citations show a variation or error in at least one data field. They also find that the number of citations not showing any discrepancy constitutes 91% of the total number of citations, while citations with discrepancies in only one data field account for almost 5%. Moreover, their results also suggest that copying references from other articles may be a cause of the observed multiplication of errors in cited references. V. Use of bibliometric data in a science policy context Bibliometric data is by no means exclusively used in science policy. Various disciplines make use of publication and citation data for different purposes. Indeed, ‘bibliometric information may serve different masters simultaneously’ (Tijssen, 1992, p. 14). Bibliometrics provides an account for publishing activities at the level of countries, provinces, cities or institutions, and is used for comparative analysis of productivity. Bibliometric data are also used as a benchmark for monitoring of science and technology, since longitudinal studies of scientific output help identify areas of research that are developing or regressing. This last aspect is of particular interest for out study. Although different users will have different reasons to make use of such quantitative information on science, they all rely upon numerical data on research publications to obtain a more objective assessment of different aspects of science and its participants. As an illustration of different uses by different groups of people and for different purposes, we see that, for example, information scientists focus on publications in the context of information retrieval systems, while sociologists of science use them to study the professional behaviour of scientists. Scientists, on the other hand, rely on bibliometric data for measuring individual (or group) scientific performance. In science policy, bibliometric data is mainly used in the light of accountability and justification of research funding allocations, on the one hand, and comparison of scientific input and output, on the other (Tijssen, 1992). Bibliometric data are not always the most suitable way for evaluating scientific performances. Bain and Rinhaldini (1989) pointed out that, due to the nature of bibliometric evaluation, a rather competitive model of scientific activity (number of publications, number of received citations), bibliometric indicators are wholly misleading when applied to the evaluation of co-operative research. 65 Nevertheless, by means of bibliometric data, such as publications and citations, a company or a research institute can address questions, such as: § What countries and/or organisations are most active in a specific (sub-) field? § How did patterns of activity change over time? § What is the impact of a particular research group or institute's work? § Which organisations were involved in the crucial breakthrough(s) in a particular (sub-) field? (Hicks, Martin & Irvine, 1986). Within the field of bibliometrics, we can differentiate between ‘bibliometric research performance analysis’ (also referred to as evaluative bibliometrics) and ‘mapping of science’, that is the analysis of the more structural features of science (Engelsman & Van Raan, 1991). Another frequently used distinction is the one between ‘descriptive indicators’ and ‘relational indicators’. The primer is of a more quantitative nature whereas the latter implies a more qualitative way of analysis by looking into the interactions and relations between science and technology. Furthermore, methodologically we can distinguish between one-dimensional and two-dimensional techniques. The performance indicators are mainly constructed with one-dimensional techniques usually based on ‘direct’ counts of elements such as publications, patents and citations. The relational indicators, and specifically ‘mapping’, are based on two-dimensional techniques constructed from co-occurrences of specific information elements (Engelsman & Van Raan, 1991). In ‘bibliometric research performance analysis’ publication output and received citations (volume and impact) can be used to assess the research performance of various actors at various levels. This may be an author, an institution (Noyons et al., 1999; Martin & Irvine, 1983), a sector of activity covering several institutions, or even a geographic area like a country or a region (European Commission, 1997; Tijssen & Van Wijk, 1999; Katz & Hicks, 1996). The use of bibliometric data for the assessment of research performance will be dealt with extensively in the next section. The second stream of research relevant to science policy-makers is concerned with the ‘mapping’ of scientific (sub-) fields. Mapping is a way to monitor the production of researcher in a particular S&T field. Maps of science can be created with different techniques mainly based on co-occurrences of information elements (cf. infra). The aim of mapping science is mainly concerned with understanding both the structure and the evolution of scientific (sub-) fields displaying both the structural and dynamic aspects of science and technology (Korevaar & Van Raan, 1992). Mapping of science will be discussed in section 8.6.6. VI. Bibliometric performance indicators VI.1. Indicators of publication output and productivity Output indicators are essentially based on the numbers of articles published by the unit of analysis under study − e.g. a research group, research institute, or a country − in the international scientific literature (Moed et al. 1985). Numbers of scientific publications, indeed, can be considered to be a reasonable measure of scientific production (output) − i.e. the extent to which a consumption of the inputs to research creates a body of scientific results (Martin, 1996). According to the OECD (1997) the benefits of science – or outputs – encompass the carriers of new knowledge (patents, publications, application etc.). However, counts of publications appear to be a much less adequate indicator of 66 contributions to scientific progress − i.e. the extent to which scientific activity results in substantive contributions to scientific knowledge (ibid., 1996). As we discussed above, most publications only contribute to the addition of knowledge in a rather modest way, while only very few make a major contribution. The mere counting of publications, however, does not capture this. Therefore, publication counts can only be considered, at best, as a partial indicator of contributions to knowledge since, aside from reflecting the level of scientific progress made by an individual or a group, they also reflect a number of other factors not related to scientific progress, such as publication practices of the institution, use of publications to obtain grants etc. As a solution, citations are used to account for the diversity in contributions made by various publications. This practice will be discussed in more detail in the section on impact indicators. To summarise, output can be measured in a satisfactory way by means of publication counts. However, productivity of a research group, a field, or an entire country is often obtained by normalising the output with various input indicators, in order to allow for differing size of research activity (cfr. chapter 7). Productivity measures thus can be seen as size-adjusted output indicators. In the evaluation of research groups, productivity is often defined as the number of publications per full-time equivalent spent on research. Other productivity measures are publications per person or per unit of funding. Just comparing the technological performance of countries by using patents normalisation is necessary for creating an ‘equal base’ of comparison, comparing the scientific output should also be normalised in order to correct for size differences (see for example Luwel, 1999b; European Commission, 1997). VI.2. Impact indicators VI.2.1. The relationship between ‘quality’ and ‘impact’ As already mentioned in the previous section, citations are used to take into account the different contributions to knowledge of different publications. Correspondingly, we can introduce the notion of ‘quality’ of scientific research, a concept virtually impossible to operationalise, since it may refer to a variety of values. With regard to scientific research, Moed et al. (1985) distinguish between three aspects of quality. Cognitive quality relates to the importance of the specific content of scientific ideas. Methodological quality, on the other hand, is concerned with the accuracy with which methods and techniques are applied. Finally, aesthetic quality is related to the attractiveness of mathematical formulations, models, and so on. Needless to say that the assessment of especially this last type of quality can be seen as a highly subjective enterprise. Moed et al. (1985) consider these three aspects of quality to be related to, what they call, ‘basic quality’. Since an assessment of such ‘basic quality’ is based on criteria intrinsic to scientific research, only peers can make a truthful judgement as for the basic quality of research projects. Next to this they also recognise a ‘scientific quality’. Scientific quality necessarily comprises basic quality but, in addition, it also relates to the extent in which researchers are active in presenting their research findings to colleague researchers. As such, only if the ‘scientific quality’ of a particular research programme is sufficiently high - that is, if the research is of a certain basic quality, and if colleague researchers are made sufficiently aware, it becomes possible that this research does have a significant impact. In this respect, Martin (1996) makes a distinction between quality, importance and impact of publications. He defines ‘quality’ as follows: 67 ‘ It describes how well the research has been done, whether it is free from obvious 'error', how aesthetically appealing the mathematical formulations are, how original the conclusions are, and so on.’ (Martin, 1996, p. 348) The ‘importance’ of a publication, then, is seen as: ‘ … its potential influence on surrounding research activities − that is, the influence on the advance of scientific knowledge it would have if there were perfect communication in science.’ (Martin, 1996, p. 349) Finally, the ‘impact’ of a publication is described as: ‘ … its actual influence on surrounding research activities at a given time. While this will depend partly on its importance, it may also be affected by such factors as the location of the author, and the prestige, language and availability of the publishing journal. ‘ (Martin, 1996, p.349) If we come back to the notion of contribution to scientific knowledge, it is clear that, of these three concepts, it is the third, scientific ‘impact’ that is the relevant one. As such, citation counts should be regarded more as an indicator of impact, rather than of quality or importance (Martin, 1996). VI.2.2. Short-term vs. long-term impact If it is assumed that scientific publications in a particular field are reflections of what happens at the research front in that field, then insight into a particular group's impact at the research front can be gained by looking at the number of times a research group's publications are cited. Moed et al. (1985) distinguish between ‘long-term impact’ and ‘short-term impact’. By looking at the long-term impact of a research group's work, one can make inferences on the degree to which a group has made a more permanent contribution to scientific advance. Short-term impact, on the other hand, relates to the impact of researchers' work only a few years after the publication of research results. As such, short-term impact, operationalised by means of citation counts, is mainly concerned with factors such as the extent to which a research group forms part of the research community, and the extent to which research results are widely known by colleague researchers. Consequently, short-term impact should be related more to the visibility of research groups (how famous are they), rather than with the extent to which a group has made a permanent contribution to the field, which is more related to the long-term impact. The latter is the more permanent contribution to scientific knowledge. Of course, a high short-term impact will not guarantee a high long-term impact, since many theories will be rejected and, as a result, will not acquire any long-term impact. However, if a group's work succeeds to acquire a high short-term impact during a relatively long period of time, older publications of that group may have a high long-term impact as well (Moed et al., 1985). As for the relevance of short-term and long-term impact for scientific research evaluation purposes, it seems plausible that research groups can hardly be required to produce work with a high long-term 68 impact. However, they can be required to take part in scientific discussions and engage in activities at the research front of their field. As such, indicators of short-term impact, rather than of long-term impact, are of primary importance, when it comes to the evaluation of research performance of research groups, but also of research institutes or entire nations (Moed et al., 1985). VI.2.3. Actual vs. expected impact When assessing research performance of different research groups, institutes or countries, some kind of ‘benchmark’ is needed in order to give those absolute citation counts a meaning. Direct one-to-one comparison between different research groups or countries is not possible. For example, as pointed out by Narin & Hamilton (1996), a high-quality paper in a lightly citing field may receive fewer citations than an average-quality paper in a heavily citing field. As such, it should be clear that citation counts obtained in one of the lightly citing fields might not be directly compared to ones obtained in highly citing fields. Therefore, data should be properly normalised and adjusted for field and sub-field differences (Narin & Hamilton, 1996). For this purpose, a comparison is often made between the actual versus expected impact of a research group's work (e.g. Nederhof & Van Raan, 1993; Moed et al., 1985; Moed et al., 1995; Noyons et al., 1998; European Commission, 1997). One type of normalisation often employed is to adjust citation counts to the impact of the journal in which the paper was published. In this approach, citation counts to a research group's publications are compared with the ‘expected’ number of citations; the latter being the average number of citation counts to all publications in the journals in which the group has published (Moed et al., 1985). This procedure, thus, entails counting the average number of citations a particular set of publications receives in, for example, the third year after publication (often the year in which a publication is cited most frequently), and comparing this to the average number of citations of an article in the same journal (Nederhof & Van Raan, 1993). In order to do this, first of all, for every journal a Journal Citation Score (JCS) is calculated (Moed et al., 1985). This JCS represents the number of times, on average, a typical article, published in a particular journal, is cited by other journal articles in the third year after its publication, often the year in which a publication is cited most frequently. The JCS can be calculated in the following way (an example of the calculation is presented in the textbox below). Total number of times articles, published during year t in journal A, are cited in year t+3 by other journal articles JCSA = The number of articles published during year t in journal A By comparing the actual number of citations per publication with the expected number, operationalised by the JCSm value, an indication can be obtained on whether a group's articles are cited frequently or not, relative to the citation rate of all papers published in the same journal set. This approach, however, has one major drawback: the level or impact of the journals in which the research group publishes is not taken into account. To give an example, let us assume that research group X publishes in prestigious, high-impact journals, while group Y publishes in rather low-impact journals. Then, the citation rate of papers published by both groups may obtain equal results relative to the average citation rate of their respective journal sets. However, in this example, group X surely can be considered to have gained a 69 higher impact than group Y (Moed et al., 1995). As a result, it may be clear that this type of analysis can only be used as a partial indicator for the actual impact of a research group's work. Assume that 212 articles were published in Journal A during 1997, and they received 482 citations in 1999, then the JCS of Journal A would amount to 482/212 = 2.27. Now, a research group's articles are usually published in different journals with different JCS values. To account for this, a weighted-average JCS value (JCSm) of the journal set in which the group's articles are published is calculated, with weighting factors equal to the number of publications in the different journals. Now, assume that a particular research group X published 32 articles in 1997, of which 18 were published in journal A, with a JCS of 2.27, and 14 in journal B, with a JCS of 1.85. The weighted-average Journal Citation Score then equals: JCSm = (18 × 2.27) + (14 × 1.85) 18 + 14 = 2.09 This value represents the average number of citations received by the 32 articles published in Journal A and Journal B in the third year after they were published. It is this JCS indicator that is used by Moed et al. (1985) as an indicator of ‘expected impact’ based on the journal set in which a research group publishes. A much more appropriate way to normalise citation counts is on a sub-field basis (Moed et al., 1995; Narin & Hamilton, 1996). Instead of comparing the citation rate of papers, published by a group, to the average citation rate of articles published in the same journal set, the former is now compared with a world citation average for all papers published in the sub-fields in which the group is active. In other words, instead of using journal citation rates, now sub-field citation rates are used as a reference level (Moed et al., 1995). The calculation of this mean citation rate of the sub-fields in which a group is active (FCSm, the mean Field Citation Score) is done in exactly the same way as with the JCSm, only with journals replaced by sub-fields. The definition of fields (or sub-fields) is usually based on a classification of journals into categories developed by the ISI, although this classification is far from perfect (Moed et al., 1995). Fields and sub-fields, thus, are operationalised by means of journal categories. A consideration that applies to the use of the FCSm is whether the performance of multidisciplinary oriented groups can be expressed in an adequate way in one single index. VI.2.4. Overview of impact indicators By way of summary, we will give a brief overview of the various impact indicators that are usually calculated in order to make an assessment of the research performance of a particular research group (Moed et al., 1995), research institute or an entire country or region (Noyons et al., 1998). The various indicators are illustrated in the table below. 70 Table 3 – Overview of various impact indicators Description Indicator Number of publications in SCI P Total number of citations received Average number of citations per publication C CPP CPP excluding self-citations Percentage of self-citations CPPex %Self-Cits Average journal impact factor JCSm World field citation average FCSm Citations per publication, compared to journal impact factor CPP/JCSm Citations per publication, compared to world field citation average CPP/FCSm Journal impact factor, compared to world field citation average JCSm/FCSm Source: based on Noyons et al. (1998) and Moed et al. (1995) The first indicator is the total number of papers published by the research group, institute or country under study (P). The next two rows consist of respectively the total number of citations received (C), and the average number of citations per publication (CPP). The fourth indicator gives the average number of citations per publication if so-called self-citations are excluded (CPPex). Depending on the unit of analysis − research group, institute, or country − a different definition for self-citations are employed. Usually, also the percentage of self-citations relative to the total number of citations received is given (%Self-Cit). Next, the mean citation rate of the journal set in which is published is given (JCSm, the mean Journal Citation Score). In the seventh row, the mean citation rate of the (sub-) fields in which the research group, institute or country is active is presented (FCSm, the mean Field Citation Score). As already mentioned before, the definition of (sub-) fields is usually based on a classification of journals into journal categories, as developed by the ISI. The FCSm can then be considered to reflect a world average in a particular (sub-) field or combination of (sub-) fields. In the eighth and ninth row the average number of citations is normalised both by the journal mean citation scores (CPP/JCSm), and by the world average in the (sub-) fields (CPP/FCSm). If the ratio CPP/FCSm is above 1.0, the work of the unit under study received more citations than an ‘average’ publication in the (sub-) field(s) in which the group is active. In this way it becomes possible to obtain an indication of the international position of the unit under study, in terms of its impact compared to a world average. This ‘world’ average usually is calculated for the entire population of articles published in ISI journals assigned to a specific sub-field or journal category (Moed et al., 1995). Usually, also the ratio JCSm/FCSm is calculated. If the mean citation score of the set of journals in which the unit under study publishes exceeds the mean citation score of all papers published in the subfield(s) to which the journals belong, that is, if the ratio JCSm/FCSm is above 1.0, the conclusion is that the unit publishes in journals with a high impact, i.e. journals with an impact above the world average in the field. 71 VI.2.5. Graphic representation of impact analyses Although all relevant information is incorporated in the values of the various bibliometric indicators, a graphical presentation of the results often facilitates interpretation, especially with respect to comparative purposes. The graphical depiction most frequently used to summarise research results is one in which the average citation rate of the unit of analysis is compared to the world citation average in the sub-fields in which it is active (e.g. Van den Berghe et al., 1998; Luwel et al., 1999). For illustrative purposes a typical outcome of a study on the productivity and impact of a number of research departments is depicted in figure 6. Figure 6 − Graphical presentation of a productivity and impact analysis 2 1 I m p 0 a c t -1 -2 0 100 200 300 400 Number of publications Source: adapted from Van den Berghe et al. (1998) and Luwel et al. (1999) Each circle represents a research department. The total number of articles published in SCI journals is depicted on the horizontal axis. The vertical axis represents the log CPP/FCSm, which gives the impact of a department during a given period of time, compared to the world citation average in the (sub-) fields in which it is active. Circles above the horizontal reference line represent departments for which the impact (CPP) is higher than the world citation average in the (sub-) fields in which they are active (FCSm). The impact of a faculty's research activities is represented in such a way that the performance of its constituent departments is still visible. As such, this figure facilitates interpretation, since it constitutes a useful synthesis between an analysis at the micro-level, the departments, and at the meso-level the faculty (Van den Berghe et al., 1998; Luwel et al., 1999). A slightly different, and equally useful graphic representation of an impact analysis, illustrated in figure 7, is presented by Noyons et al. (1998) in their paper on the assessment of Flemish R&D in the field of information technology. The horizontal axis represents the world citation average (FCSm), while the vertical axis gives the average citations per publication (impact). As a result, if a sub-domain (depicted as a data point) is situated above the diagonal, its impact is above world average. 72 Figure 7 − Alternative graphical presentation of an impact analysis 8 Citations per Publication (Impact) 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 8 World Citation Average Source: Noyons et al. (1998) In addition, the size of the circles, which represent the data points, indicate the proportional numbers of papers published by the unit under study, in this case Flemish researchers, in each sub-field, relative to the total number of an actor's papers (here, again Flemish papers) in the whole field. Finally, different colours are used to indicate sub-domains with high versus low JCSm/FCSm ratios. Noyons et al. (1998), for example, used dark grey to indicate sub-domains with a JECS/FCSm > 1.2; light grey for subdomains with a JCSm/FCSm between 0.8 and 1.2; and white for sub-domains with JCSm/FCSm < 0.8. If for a particular sub-domain the ratio JECS/FCSm > 1.2 (indicated by dark grey), this means the impact of the journals used by the researchers is above the impact of the field. VI.3. Indicators of collaboration Another indicator that could shed some light on the research performance of, in particular, research groups is an indicator measuring the extent to which research groups engage in collaboration. On the basis of addresses of the authors, Moed et al. (1995) adapted an approach in which a group's papers were assigned into three different categories reflecting three different types of collaboration. Papers were assigned to the ‘no collaboration’ category, in the case they were single-authored or published by more than one author from the same research group. Remaining papers were assigned to the category ‘collaboration type within the Netherlands’, in case the co-author(s) participated from other groups within the Netherlands, or ‘collaboration type International’, when scientists from groups outside the Netherlands were involved. In an excellent study by Katz & Martin (1997) the whole concept of collaboration is being discussed extensively. They point out that collaboration can take various forms ranging from offering technical advice to active participation in the research. Based on a definition of collaboration they argue that collaboration should be distinguished from co-authorship that at a certain point may be one of the 73 results of collaboration. This leads to the major conclusion that co-authorship is a rather approximate partial indicator of collaboration and that it should be used with the necessary precaution. The purpose of the indicator, staying in the context of Moed et al. (1995), is to show how often a particular research group has co-published papers with other groups. Moreover it can give an indication on the impact of co-published articles resulting from national or international collaboration, compared to the impact of papers published by researchers from a single research group. As such, impact of publications is analysed as a function of the type of collaboration. Moed et al. (1995) find that all three ‘relative’ impact indicators − i.e. both the average impact compared to the impact of the journal set (CPP/JCSm) or to the world citation average in the sub-fields the group is active (CPP/FCSm), as well as the impact of the journal set relative to the world citation average (JCSm/FCSm) − point in the same direction. All three indicators show the highest values for publications resulting from international collaboration, and the lowest for publications in which no collaboration with other groups was involved. VI.4. Indicators of scientific activity To compare research performance between different countries the calculation of the Revealed Literature Advantage (RLA) is particularly popular. The construction of this RLA-index is exactly the same as for the Revealed Technological Advantage (RTA) discussed in the chapter on patents. The RLA is often being referred to as a specialisation index. For more details on this we refer to the discussion of the RTA in the previous chapter. The formula for calculating the RLA is shown below. åL RLA = åL åL Lij ij i ij j Lij ij ij = number of publications of country i in sub-domain j åL = number of publications of all countries in sub-domain j åL = number of publications of country i in the whole field åL = number of publications of all countries in the whole field ij i ij j ij ij This RLA-index indicates the country's share in the world's publication output in a particular sub-field j, compared to the country's share in the world's publication output in the whole field. A value higher than unity reflects a relative specialisation (in terms of publications) of country i in sub-field j. A value of one signifies a neutral position, while a value smaller than one indicates a relative disadvantage of country i in sub-field j. As such, the indicator reflects whether a particular country engages in a particular sub-field to an extent above or below average, relative to its other publication activities. Note that, this RLA-index does not measure the absolute ‘strength’ of a country in a particular sub-field, but that it merely expresses the 74 relative amount of activity in a particular sub-field of a country that is engaged in several sub-fields. As such, it should be interpreted as an ‘activity index’ (Hinze, 1997; Egghe & Rousseau, 1990). Egghe & Rousseau (1990) also mention the existence of, what they call an Attractivity Index (AAI). Its calculation is identical to that for the RLA-index, except that now citations are used instead of publications. The Attractivity Index should characterise the relative impact of a country's publications in a given sub-field as reflected in the citations they attract. As such it reflects a particular country's share in citations attracted by publications in a particular sub-field j, compared to that country's share in citations attracted by publications in the whole field. Again, a value higher (lower) than unity would reflect a higher (lower) than average impact in the given sub-field, relative to the world's average. VI.5. Intermezzo: one-dimensional vs. two-dimensional measurement and assessment Before continuing our discussion with the second stream within bibliometrics focussing on the more structural aspects of science and science fields, we will first briefly mention several considerations about measurement and assessment. In Martin and Irvine's (1983) methodology of converging partial indicators it is debated that all quantitative measures of research performance are, at best, only partial indicators. It is stated that these indicators will only partly reflect what they are purported the measure, according to Martin and Irvine (1983), contribution to scientific progress. However, a network of other factors will also influence these indicators. Their methodology of converging partial indicators then assumes that, although each indicator in itself can only be considered, at best, to be a partial indicator, through the combined use of multiple indicators, the influence of other, disturbing factors can be minimised (Moed et al., 1985; Tijssen, 1992). Then, in occasions where all, or a majority of the indicators point in the same direction − i.e. if convergent results are obtained −, confidence may be attached to the results. Or, as Martin (1996) himself, puts it: ‘ … a result based on the convergence of several indicators (preferably including extensive peer evaluation) is likely to be more reliable than one based on a single bibliometric indicator or on peer review alone (Martin, 1996, p. 352). Although their approach has been criticised on several occasions (e.g. Moed et al. 1985; Moed, 1989), it is widely accepted that an assessment based on multiple indicators − bibliometric and non-bibliometric, should be more reliable than one based on a single indicator. Since any single indicator can at best only capture one aspect of performance, it is argued that using a range of performance indicators, preferably both quantitative and qualitative, is always better than relying on just one or two. So, rather than attempting to measure research performance on a one-dimensional scale, it is much more adequate to make an assessment based on a multidimensional set of ratings or some sort of profile (Martin, 1996). 75 VI.6. Mapping of science: techniques and utility for science policy purposes VI.6.1. Mapping of science: an introduction The enormous and still increasing amount of information on technology and science necessitates a systematic and careful approach to achieve a sensible data reduction. Large numbers of complex tables are mostly not very useful in this respect. New and additional ways of representing the data may reveal these underlying and until then ‘hidden’ features (Engelsman & Van Raan, 1991). Based on the analysis of information from the scientific literature, quantitative techniques are used to display both structural and dynamic aspects of scientific research. Its main purpose often is to display the foci of interest and attention that prevail in a particular scientific area (field or sub-field) in a certain period of time (Braam, 1991). Cartography of technology or science reformats the data into a specific graphical representation and also reduces the data while retaining only the essential information. Maps of science are constructed to display relational aspects in science. Contrary to indicators in the form of frequency lists, ranks or tables, maps can easily provide information on links between scientific entities (Tijssen, 1992). Maps, in this sense, are particularly helpful for visualising the pattern of such a large and/or complex structure inherent to the data. In his work ‘Mapping of Science: Foci of Intellectual Interest in Scientific Literature’, Braam (1991) identifies four steps in the bibliometric mapping of science: (a) the selection of (a set of) relevant scientific documents covering an area of scientific research and the subsequent collection of bibliographic data, derived from these scientific texts; (b) the construction of a separate data set in which the data are structured in a way appropriate for analysis; (c) statistical analysis of the bibliometric data; and (d) some guiding theoretical framework for the interpretation of the results. The mapping of science is performed by means of relational, two-dimensional indicators which are based on the analysis of the number of times different information items, such as author names, keywords, classification counts or citations, occur together (co-occurrence) (Hinze, 1997). By investigating connections through the ‘co-occurrence’ of references, words and/or classification codes, it becomes possible to unravel the immense network of interrelated pieces of knowledge, and to uncover major ‘hidden patterns’ in the vast amount of information carried by the scientific literature (Van Raan, 1997). The result of such an exercise is often represented in a two-dimensional way, a ‘map’, in which information items (and the publications containing these items) are structured according to their links, as uncovered in the ‘co-occurrence’ analysis (Luwel et al., 1999). As such, mapping techniques make a systematic use of the information carried by scientific articles possible (Hinze, 1997). Engelsman & Van Raan (1991) distinguish three main types of bibliometric maps. The first one is based on co-citations, the number of times two particular articles are cited together. A second type of bibliometric mapping is based on co-word analysis, or more broadly speaking, based on co-occurrence. Word co-occurrences reflect the network of conceptual relations from the viewpoint of the scientist in the field concerned. A third type of bibliometric maps, somehow artificial in our view, is the coclassification map, for example based on the co-occurrence of different classification codes. 76 Different techniques, making use of different information items included in scientific articles, are used in order to map scientific fields, as well as their development over time. It should be pointed out that it is not only possible to map scientific fields but also, by applying the same techniques, creating maps of technology (Engelsman & Van Raan, 1991; Korevaar & Van Raan, 1992). The specific techniques will be discussed briefly in the next section. IV.6.2. Several techniques Within the domain of the mapping of science, four principal types of bibliometric maps can be distinguished: (1) journal-to-journal citation maps; (2) co-classification maps; (3) co-citation maps; and (4) co-word maps. Mapping analyses using co-citation data or co-word data are by far the most popular (Tijssen, 1992). Especially the co-classification maps may prove of importance for analysing the internal dynamics of the science and technology link. (a) Mapping by means of data on journals In this approach maps of science are derived from existing inter-journal networks. By examining the links between journals as created by citations given to and received from other journals, it is believed that the macro-level structure of scientific activities can be pictured. The assumption underlying this approach is that inter-journal citation-frequencies reflect the magnitude of subject relation between journals (Tijssen, 1992). (b) Co-classification maps The structure of scientific fields may be studied by representing the relations between sub-fields as described by classification codes. As a rule, different classification codes are assigned to an individual publication. As such, the co-occurrence of classification codes (or subject-classification terms) can be studied. The number of times different classification codes occur together is taken as a measure for similarity. By means of multivariate statistical methods this similarity can be represented in a ‘map of science’, in which the structure of the analysed scientific areas becomes visible. Examples of studies based on co-occurrence analysis of classification codes are: Engelsman & Van Raan (1991), Van Raan and Peters (1989). The main advantage of this approach consists in the relative ‘straightforwardness’ of the method. Since all classification codes have well-defined meanings, the interpretation of the resulting map should not pose much problems. However, this relative ‘straightforwardness’ in the same time also illustrates the main drawback of this method. Since this kind of analysis is based on an existing classification system, it can never reflect the development of new scientific (sub-) fields in an adequate way, simply because classification systems are not updated on a continuing basis (Hinze, 1997). (c) Co-citation maps The mapping of the structure of scientific research can also be done on the basis of citations given by authors in their publications. This is done in co-citation analysis, initiated by Small at the Institute for Scientific Information (ISI) in Philadelphia (Small, 1973), and nowadays one of the major quantitative techniques used to map both the structure and the dynamics of scientific (sub-) fields. In co-citation analysis information on how often pairs of articles are cited together in other papers is used to construct maps of scientific research (Hinze, 1997). The resulting frequency of co-citation is used to measure the degree of association between two documents. The clustering of documents in co-citation analysis thus is based on existing co-citation relations and, as such, this approach is considered to be an alternative to existing classifications of scientific research activities (Braam, 1987). 77 (d) The technique of co-citation analysis The first step in co-citation analysis usually concerns the selection of documents that are cited more than a specified number of times, from the reference lists of set of (citing) publications published in a particular year. Only publications exceeding this ‘citation threshold’ are included in the analysis. The use of such a ‘citation threshold’ is used both to separate signal from noise, and to save computer time (Braam, 1987). Next pairs of publications are selected that co-occur relatively frequently in the citing publications reference lists. Two papers being cited together are considered to be linked. The number of times two publications are co-cited is a measure of the strength of this linkage. By specifying an integer ‘cocitation threshold’ or by using a specific index and defining some threshold value for this index, a distinction is made between significant and insignificant pairs of co-cited items, that is pairs with a relative strong versus a weak link. Pairs of publications exceeding this specified ‘co-citation threshold’ are included in the mapping exercise, while pairs not reaching this threshold are discarded. Several indices can be used to measure this co-citation strength. By far the most popular ones are Salton's formula and the Jaccard Index (Sneath and Sokal, 1973). Finally, the pairs of co-cited publications that reach the above mentioned ‘co-citation threshold’ participate in a clustering routine. An often used clustering routine is ‘single linkage clustering’ (Braam, 1987), in which clusters of documents are aggregated by sequentially linking together all selected pairs of documents having at least one document in common. Clustering thus brings together document pairs with high co-citation strength relative to the number of citations received by the individual articles. For more details on cocitation analysis and the construction of maps based on co-citation analysis we refer to Braam (1987); Mombers et al. (1985) and Tijssen (1992). At this point we would like to mention a number of limitations of maps based on co-citation analysis. Co-citation analysis has been criticised on several occasions. Hicks (1987) identifies five major limitations of co-citation analysis as a tool for science policy: 1. Delayed inception of clusters In general, a time lag can be perceived between the emergence of new specialities and their appearance on a co-citation map in the form of a new cluster. Hicks (1987) indicates that in some fields the number of researchers may grow rather quickly, resulting in a relatively short time lag before their work appears on a co-citation map. However, in other fields where it may take some years before a critical mass is reached, this time lag may be more substantial. 2. Inconsistent definition of the (sub-)field Although the identification and demarcation of ‘specialities’ is extremely difficult, strict boundaries have to be constructed for the purpose of analysis. First of all, definitions of fields, sub-fields and specialities of scientific researchers, on the one hand, and the bibliometric analyst, on the other, should agree in broad outline. Moreover, in order that observed trends should reflect actual changes in national research effort, definitions should be consistent over time. 3. Effects of errors in citations As already mentioned before, obviously co-citation clusters are also affected by errors made by citing authors. 78 4. Under-representation of experimental papers On several occasions, it is found that papers with an experimental nature are relatively underrepresented in co-citation clusters and theoretical papers are relatively over-represented. 5. Subjectivity in co-citation analysis Finally, the subjectivity inherent in the setting of both the ‘citation threshold’ as the ‘co-citation threshold’ has been a point of critique. This is especially so because the choice of these threshold levels can strongly affect both the size and the content of resulting clusters. (e) Co-word maps The last technique that shall discuss is the technique of co-word mapping. In the field of ‘mapping of science’, co-word analysis has been developed as a rival technique to co-citation analysis by Michel Callon and his colleagues at the Centre de Sociology de l'Innovation of the École des Mines de Paris, in co-operation with British and Dutch scientists (Callon et al., 1983; Callon et al., 1986). Whereas cocitation analysis builds on co-occurrences of pairs of cited documents in publications, co-word analysis focuses on the co-occurrences of pairs of content words related to these publications. These words can consist of manually or automatically established terms (free terms or controlled terms), words appearing in the title, in abstracts, or from the full text (Braam et al., 1989). Assuming that words designate specific loci of interest, the amount of publications associated with a given word then provides an indication of the number of people involved and time invested in research activities focusing on that particular ‘locus of interest’. The degree of overlap between distinct loci of interest can they be measured by the amount of times words occur together in a set of publications. If a set of words appears to co-occur relatively frequently, this can be interpreted as constituting a broader ‘problem area’ or ‘research theme’. Interrelations between distinct problem areas or research themes are then indicated by co-occurrences of words from these different areas (Braam et al., 1989). In order to map the structure of science, co-word analysis can be based on indexing terms, ‘controlled terms’, which are externally supplied by journal editors or professional indexers at documentation services, or on key-words, ‘uncontrolled terms’, terms supplied by scientists themselves (Tijssen, 1992). The use of ‘controlled terms’ in a mapping exercise resembles the before-mentioned co-classification analysis, since either some pre-specified terms or some pre-specified classification code is assigned to the publication. As a result, like in co-classification analysis, when performing a mapping exercise by using ‘controlled terms’ most recent developments will most likely not be reflected (Hinze, 1997). Moreover, when using ‘controlled terms’ or classification codes, the result of the mapping exercise may be flawed by what is called an ‘indexer effect’ (Healey et al., 1986). Since both ‘controlled terms’ and classification codes are assigned to publications by professional indexers, their view about a particular scientific paper may influence the results of the analysis in cases where the assigned terms or classification may not correspond with the authors view. Therefore, co-word analysis based on ‘uncontrolled terms’, key-words provided by the authors themselves, rather than terms given by the database producer should be used by preference (Hinze, 1997). The resulting network of co-occurrences between different words, collected from a set of publications, can then provide a detailed insight in the structure of the publication contents. By comparing publications with respect to the occurrence of similar word-pairs, co-word maps provide a direct quantitative way of linking the conceptual contents of publications. As such, through the relationships between various research themes, as reflected in the occurrence of word-pairs, research activities within 79 a particular scientific domain can be depicted (Tijssen, 1992). A computer program, LEXIMAPPE, was developed for obtaining graphical representations of co-word maps (Callon et al., 1983). Compared to co-citation maps, co-word maps are found to be more inclusive and more up to date, in that the emerging specialities are included relatively more quickly in co-word maps. The main drawback of co-word analysis is that words, or at least their meanings, are not always unambiguous and are not seldom context dependent (Hinze, 1997; Van Raan, 1993). However, Braam (1991) mentions that co-citation and co-word analysis should not so much be seen as competing techniques, but rather as complementary, since they both draw on fundamentally different sources of data. Indeed, whereas co-word analysis draws on data directly related to the content of current research activities, co-citation analysis draws on the content ascribed to the intellectual base, as reflected in earlier literature, on which this current research is based. As such, the extent to which coword and co-citation maps converge can give an indication on the relation between a shared intellectual focus on particular topics of research − as reflected in the co-word map −, and the intellectual focus on the relevance and importance of earlier work on these topics − as reflected in the co-citation map (Braam et al., 1989; Braam, 1991). Braam et al. (1989) stress that the combined use of the co-word and the co-citation technique for one and the same bibliometric data set offers an excellent tool to investigate the relationship that exists between two different types of intellectual focus in scientific research: shared attention to a common body of base literature, on the one hand, and shared attention to research topics, on the other. Besides the combined use of different techniques for the mapping of science, Tijssen (1992) also stresses that such maps should preferably be used in combination with other information sources, such as opinions of independent experts in the scientific (sub-) fields under study. However, this use of bibliometric maps in combination with an expert's opinion may lead to a dilemma in the use of science maps: " … if the expert's 'mental map' and the bibliometric map look alike, it is often argued that the map adds nothing to the knowledge; if, on the other hand, the structures differ considerably, the validity of the map of science is questioned. In both cases the map may be discarded as useless. On the other hand, if maps visualise features of the structure or development of a field that is surprising to experts in the field it sometimes triggers criticism or disbelief. "(Tijssen, 1992, p. 35) Moreover, maps of science are, by definition, only artificial ‘snapshots’ of abstract structures within the science system, which in addition are in a constant state of ongoing development (Tijssen, 1992). IV.6.3. Utility of mapping exercises for science policy purposes Let us assume that a mapping exercise by means of co-citation and/or co-word analysis indeed makes it possible to display the cognitive structure of scientific (sub-) fields. In that case it would prove very interesting, from a science policy point of view, to explore the research activity of a particular country or region in the various (sub-) fields identified in the resulting map of science. Investigating the distribution of publications, research institutes, or countries over these research areas can do this. 80 Mombers et al. (1985) employed co-citation analysis to assess Dutch participation in science. In particular, analyses were performed by identifying the Netherlands contribution7 to both the cores − i.e. the cited − and the citing literature. Useful inferences could be made from these analyses. For example, if a particular cluster showed a strong Dutch participation in the core, this indicated a strong interest in the past. If at the same time one also observed a substantial Dutch share in the citing literature, this indicated a continuous strong interest. If a significant Dutch participation was observed only in the citing literature, the interest in that particular (sub-) field was either marginal or had only started recently. An alternative explication for the latter case could be that older Dutch publications on the subject under study had little impact and consequently had not been cited enough (Mombers et al., 1985). A similar approach was adopted by Noyons et al. (1999) in a combined performance/mapping study used to evaluate the activities of IMEC, a Belgian research institute in microelectronics. First, a mapping exercise based on classification codes was performed. As a result of this exercise, the cognitive structure of microelectronics, as defined by the publications of seven of the major institutes active in this field, was displayed, with clusters representing the various sub-domains of the field. Subsequently, the relative activity of the research institute in terms of number of publications and their impact was assessed, compared to both the world average and the performance of selected benchmark institutes. As such, this combined procedure can be used for monitoring research performance on a micro-level, while at the same time taking into account recent developments in the field. In this way, an assessment of the performance of any actor, being either a research institute, a university department, or a country, can be made, relative to its peers and from a dynamic perspective at once (Noyons et al., 1999). Engelsman & Van Raan (1991) presented a study focussing on knowledge diffusion in fields of technology using mapping techniques. They constructed maps on three levels. The first level, the macro level (co-classification - and co-word analysis offered some interesting insights on the role of Japan in chemical, mechanical and electronical industry. At the meso level, maps were used to visualise structures of activity (performed for the emerging ‘crossroad’ technology of Optomechatronics). At the micro level specific fields were analysed revealing interesting insights on innovations and technological improvements. They also introduced the concept of ‘technological peripheries’, identifying the most closely linked fields. Finally, some comments on the usability of maps. As Noyons & Van Raan (1998) mention, for a long time maps were perceived as interesting overviews but difficult to interpret, or just to read. Furthermore, it seemed that the richness of information on the maps was not sufficient, in the sense that traditional overviews would also provide the same information. The conclusion based on these shortcomings is that mapping, as a policy support tool, had not yet reached a mature state. However, according to Noyons & Van Raan (1998) mapping techniques and thus maps, especially those developed at CWTS (Centre for Science and Technology Studies), have been improved. They mention the following improvements: multi level mapping, ‘map external’ information included in the maps, graphical user interface to browse through maps and added information, and animated representation of positioning of sub-domains in main map of fields. Their applicability has been improved. 7 A Netherlands contribution was defined as an article originating from a research institute, company or university situated in the Netherlands. 81 Part III: Science and Technology - Examination of the interaction CHAPTER 9 - LINKING SCIENCE TO TECHNOLOGY I. General introduction In the previous two chapters we discussed the most important possibilities and drawbacks for measuring science and technology in view of their close relation to research and industrial development. Many scholars have pointed out that in several technological areas, it is becoming increasingly difficult to differentiate between science and technology because of the multi- and interdisciplinary nature of knowledge production and the increased number of technology fields linked to science development (Meyer-Krahmer, 2000). In the literature, the approximation of scientific development towards technological evolution is coined by terms such as: “Science base of technologies” (Meyer-Krahmer & Schmoch, 1998; Van Vianen et al., 1990), “Scientification of technologies” (Narin et al., 1997) or “Science-Driven Market” (Grupp, 1998). It thus becomes more important to focus on the interaction, and not solely on the two systems independently. With De Solla Price (1965) and Rosenberg & Birdzell (1990), there has been quite some qualitative understanding of the linkage between science and technology. Yet until the ’90s there has been very little quantitative data to specifically characterise this relationship or to pinpoint the subject, national, international and temporal aspects of the coupling between science and technology. The increasing number of constraints and developments that governments are facing - constraints on public expenditure, pressure on the peer review groups, emergence of new scientific and technological areas etc. – more than ever before, lead to the necessity of ex-ante prioritisation and ex-post evaluation in science and technology. As such, a tool that enables science interactions and science bases of technologies would surely contribute to facing this challenge. The central role, but also the changing nature of ‘knowledge’ in technological progress has already been discussed extensively in part I. Patents and scientific publications are in their function of ‘carriers of relevant knowledge’, useful ‘proxies’ of scientific and technological activities (for more details, see part II). Due to the information contained in these documents, it becomes possible to model the relational aspects of science and technology and to delve deeper in to the social and cognitive aspects of the interrelation. Obviously, the presence of cross citation between both spheres is the vehicle towards this integrated analysis. However, a number of trivialities have to be kept in mind and adequately dealt with in respect of the purposes for using them. II. Background on the science and technology interaction II.1. Importance of the S&T interaction Many scholars have put forward the importance of scientific advancement for technological development. The interface between both is the nursery of research and development activities and is undoubtedly a major driving force for the economic development of our world. Mansfield (1991) has added a quantitative underpinning to this; from his survey data of 76 large US companies, he estimates that 11% of industrial innovations and 9% of the process innovations would not have been developed in time without the contribution of academic research. 82 In the early 1960s, Toynbee (1963) and De Solla Price (1965) already extensively discussed the science – technology interaction. Toynbee’s well-know metaphor of this interaction as a pair of dancers typifies very well the assumed importance of their simultaneity and co-development. Using this metaphor two decades later, Narin et al. (1989) analyse the relation between patents in biotechnology and papers in bioscience. They point out that not only ‘both partners’ dance together in the same rhythm, but that they are also locked in an embrace from which it is virtually impossible to separate them. At the same time they argue that, based on the increased number of articles cited on the front-page of patents in the last years, technologies have become increasingly ‘scientized’. A refinement has been made in this proposition in 1997, when Schmoch (1997), but also Narin et al. (1997) and others, have indicated that especially modern technological areas are often highly scientific. The (increased) importance of science for technological development has also been reflected in the categorization of manufacturing industries. It was Pavitt (1984) who introduced the science based sector in 1984, obviously implying that also from an economic classification point of view, a broad category of technological activities around numerous companies exist, in which innovations are directly linked to technological advances made possible by scientific progress. Furthermore, innovation-oriented activities of the relevant companies are usually formalised in R&D laboratories in which investments in research and development appear to be disproportionally high. According to Pavitt (ibid.) the electronic industry, large parts of organic chemistry industry, the pharmaceutical and biotechnology industries, air and space travel and the military technology industry can be classified in to the science-based sector. Due to the science – technology interface, market economies of western nations have achieved unprecedented prosperity. As such, science and technology are of vital importance for the performance of a national economy, as numerous discussions concerning the international competitiveness of national economies currently show (cfr. part I). The need for a better understanding of the interaction is obvious, in view of the dynamic of the science and technology interaction, the growing complexity of the interrelation (cfr. infra), the importance and necessity for more transparency in science funding and return on funding. Internationally, within the last few years, a growing interest has been developed in trying to foresee and understand future developments. II.2. Science and technology interrelation: who leads and who follows? A. The changes in the knowledge production and diffusion system Continuing the discussion on knowledge creation and diffusion touched upon in part I, but now extended by the context of the more general science and technology interaction, we shall look into the factors that determine the direction in which both Toynbee’s (1963) ‘dancers’ move. Understanding this mechanism is of great importance if one attempts to ‘intervene’ in the interaction process (numerous actions are thinkable). Most of the studies dealing with the subject are based on the so-called linear model of knowledge production, diffusion and utilization (cfr. section 3.IV.). It is assumed that scientific progress is directly followed by a technological application. According to Meyer (2000a), it appears that the work of Narin and his colleagues is primarily based on the ‘linear’ understanding of the innovation process. This is also reflected in the explanatory approach of science and technology interaction that they apply (based 83 on paper citation analysis), which implies that a scientific paper cited in a patent has a causal implication, specifically that science directly leads to technological development. The state of the art on the knowledge production and diffusion paradigm has to be aligned with the view on science and technology interactions, and with the analytical approach of studying that interaction. Meyer-Krahmer (2000) identifies two phenomena (‘new paradigms’) that indicate a structural change in both the production and the diffusion of knowledge. The first one is the ‘increasing number of technology fields with close linkages between basic research and industrial application’. The second phenomenon is the ‘growing importance of multi- and interdisciplinarity, which is a reflection of the fact that separation of technologies becomes more and more difficult and that the overlapping areas are often highly dynamic. Both paradigms will be discussed briefly. The increasing number of technology fields with close linkages between basic research and industrial application According to Schmoch et al. (1993), since the sixties a growing number of retrospective studies have tried to evaluate the dependency of past innovations on scientific work. The first of these large-scale studies was the so-called “Project Hindsight” of the US Department of Defence. The study assessed the contribution of science and technology to the development of 20 weapon systems. One of the conclusions was that basic research seemed to be rather unimportant, whereas applied research had the highest payback. As a reaction to these negative findings, the National Science Foundation launched a second study under the acronym “TRACES”. The findings of this study provided evidence to support the ‘science-pull’ model of innovations. § § Increasing inter- and transdisciplinarity A second major observation is the growing inter- and transdisciplinarity between S&T, but also within each of the two systems. Many large national studies have been performed in order to monitor and identify key future technologies related to economic potentials. Examples are the Delphi inquiry “Future Technology in Japan”, carried out by the Science and Technology Agency (STA), the study of the German government “Technology at the Threshold of the 21st Century” and the study performed by the French Ministry of Industry (1995) “The Key Technologies for the French Industry”. Al these studies aimed at identifying future relevant developments in the existing science and technology landscape. In the study performed for the German government (Grupp & Schmoch, 1992b), it was stressed out that technology at the beginning of the next century couldn’t be separated according to conventional disciplines (see table 4). Whatever the paths of technology emergence, they are all linked together. Table 4 - characteristics of traditional and modern production of scientific and technological knowledge Traditional systems Discipline based Internally driven Individually dominated Modern systems Transdisciplinary Practically oriented Network dominated Source: Gibbons et al. (1994) According to Gibbons et al. (1994), the mode of organisation of science and technology in the most advanced sectors, is leading to the above-illustrated transition. In view of the social embeddedness of science, the linear model of knowledge production and diffusion is inadequate for capturing the science and technology interaction in all its aspects. In a study performed by Grupp (1998), the interplay 84 between science and technology shows that scientific activities, at least in the area of laser technology, do not precede technical endeavours but almost simultaneously turn to the new paradigm, confirming, what is also argued by Meyer (2000a), that the S&T relation is reciprocal instead of linear. Figure 8 illustrates the difference between the ‘linear interpretation’ (model A) of the science and technology relation and the network model (cfr. section 4.IV.) or ‘two-branched model’ (model B), a term coined by Rip (1992). The two-branched model is based on two kinds of activities: (1) Exploitation understood as technological development, pilot processes and feedback. (2) Exploration to increase understanding. The figure illustrates that basic science research, or science, produces a number of scientific research papers, indicated here as copyright symbols (ã). Technology at a certain point in time materialised in patents illustrated by trademark symbols (â). The linear approach is being represented in A and the network model or two-way model in B. Figure 8 - (A) Linear interpretation (B) Network model or Two-way interpretation A B Science C Science C C C C C C C C C C C Highly mediated R R R R R R R R R R Technology R R R R R Technology Source: Meyer (2000a) B. Discussion of the science – technology interaction in the light of cross-citation analysis The model of Rip (1992), and its discussion by Meyer (2000a), clearly illustrates how the S&T interrelation can be grasped and discussed by patent citation analysis. Figure A describes one way of interpreting the data of Narin and of many other researchers by looking into the S&T relationship in a rather ‘direct’ way (assuming that science and technology are represented by paper and patents, and that the cross-citations represent the interaction). The linear approach as such implies a continuum that stretches from scientific research to technological application. On the contrary, figure B shows that the S&T interrelation is highly mediated, even though one might find occasionally direct relationships between a patent and the scientific paper it cites Meyer (2000a), and Grupp & Schmoch (1992a) have also looked into the relation between paper references cited in patents and the interpretation in terms of science – technology interaction. Meyer (ibid) studied this issue in 10 cases (patents) by looking into the motives of inventors for citing nonpatent literature (cfr. section 9.IV.1.1.). The cases concerned USPTO patents in the young and emerging field of Nanotechnology. Researchers seem to integrate scientific and technological activities increasingly by working on one subject matter and generating scientific papers as well as technological output. Grupp & Schmoch (ibid) also concluded that there are several motives for a patent examiner to cite scientific papers in a patent as prior art, and that science cannot always be assumed to trigger technological development (see also Tijssen & Buter, 1998). Several writers in the literature on innovation and technical change have emphasised that information contained only in scientific papers 85 will not suffice to implement the technology in question (see for e.g. Pavitt, 1987; Rosenberg, 1990). This suggests that there must be more mechanisms in the linkage between science and technology, and that the interplay is not so straightforward. The early discovery of food preservation in tin-coated steel by Nicholas Appert in 1810, and the explanation of this process much later in 1873 with the discovery of the role of micro-organisms in food spoilage (i.e. the birth of the science of bacteriology) provides an early example of science lagging behind technological development (technology-pull). III. Implications from a policy perspective Keeping in mind the importance of the interaction and cross-fertilization between science and technology, and the different aspects of this interaction, Meyer-Krahmer (2000) discerns several implications for science and technology policy making. Before turning to these implications, let us first briefly look into the context in which the policy interest becomes manifest. As Steinmueller (2000) pointed out, in the past two decades, wide-ranging socio-economic and technological transformations have caused European governments to reformulate their policies concerning government-supported scientific activity. This reformulation has been accompanied by shifts and even by complete turnarounds in research funding across fields of inquiry as well as in the type of research conducted (basic versus applied). The present constraints on public expenditures in general, the enormous investments involved in sustaining the econo-techno-scientific complex, and the actual debate on the effectiveness of government supported scientific research, all augment the need for more accountability and effectiveness in the area of publicly funded research (Hanusch, 1999; Ziman, 1994; Moed, 1989). The dependence of technological development on scientific progress has been subject to much examination and speculation (see for instance the analyses and the discussions reported in the European Science and Technology Indicators reports by the EC (European Commission, 1997)), or even the ongoing discussion on benchmarking Industry – Science relations in Europe). Let us now return to what Meyer-Krahmer (2000) refers to as “ implications for science and technology policy making”. A first implication is the need to promote university/industry relationships. A long tradition exists of different institutional settings – such as personal contact and co-operative research institutes and network approaches – and financial incentives – such as subsidies or tax incentives for extramural R&D or R&D co-operation projects –. It seems that transferring individuals is an efficient way of technology transfer. Increasing the mobility of R&D personnel between industries, universities, research institutes and government should therefore be one of the top priorities. Both types of promotion of technology transfer are mainly elements of a technology-push approach. Demand however is also a very important determinant of industrial innovation and technology development. In areas where comparative advantages or high national or international demand is expected, a concentration of R&D activities is needed. Indeed, national technology should not be restricted to its own national R&D activities, but should also respond to international demand, which consist a second implication. Even if a country does not have comparative technological advantages, it can act as an imitator, using effective ways of technological development and avoiding first-mover strategy with high learning costs. A third consequence is that new ways of linkages between basic and applied research need to be developed. Also, the facilitation of linkages between disciplines on the one hand and of trans- and interdisciplinarity on the other, are crucial. In order to do this, special attention must be paid to: 86 § § § § § Organisation of research: problem orientation in the case of well defined social or industrialtechnical problems, requiring new ways of project organisation and management. A better integration of the long-term application-oriented basic research in order to respond to future needs in a better way. Team research: besides the currently predominant orientation of academic research towards individualised research setting, team research must be strengthened. Improved intra- and inter-sectoral mobility of researchers: on an international level and also between science and industry. Increased flexibility of research structures: more rapid take-up of new developments through: - Flexibilization of the present rigid public service rules and budget laws - Deregulation of the academic administration - Networking of research institutions for a limited time, especially in an international framework A fourth consequence relates to university training. The increasing link between science and technology, as well as trans- and interdisciplinarity, lead to an increasing demand for Ph.Ds in engineering and natural sciences. Firms will invest in countries in which this academic human potential is available. A last remark that has to be kept in mind is that, in the past, government policy emphasised basic and public R&D, leaving industrial and applied R&D over to firms (cfr. chapter 3). However, due to the increasing interdependence between public and industrial R&D, industrial R&D may not be able to increase without an increase of public R&D. It seems that the traditional institutional delineations have to be reconsidered. IV. Exploration of the science – technology interaction In this section, we present a number of available approaches for investigating the interrelation between science and technology (S&T). Besides the discussion of the different approaches, we shall also discuss the main advantages and disadvantages. The focus however lies on S&T interaction analysis based on non-patent references – cross citations from technology to science. With respect to the quantitative description of science, both input-related and output-related indicators can be used (cfr. figure 2). Frequently used input indicators include the volume of research budgets and the number of scientists involved. However, since indicators of research budgets and personnel are usually only available on a highly aggregated level, the quantitative description of science usually occurs in terms of scientific output. Scientific output is generally measured through the number of scientific articles published by a particular researcher or research group. As described elaborately in the section on bibliometrics (cfr. chapter 8), a multitude of derived indicators can be constructed on the basis of publication output measures. Examples of derived indicators include productivity and impact measures. Analogously, technology can be analysed in terms of input indicators, such as the amount of research expenditures or the size of the research staff, and output indicators, such as the number of patents and other derived indicators (Schmoch, 1997). Grupp (1998) states that it is one of the most difficult tasks to establish a measurement specification for the S&T interaction. He states that: “In view of the poorly developed state of research, it is hardly surprising that, unlike the case of the patent statistics treatment of technological spillover effects, there 87 are hardly any economic expressions to identify the science base of technology” (ibid, p. 331). One of the exceptions, however, is the situation in which the patent examination leads to references to scientific literature indicating and delineating the so-called ‘prior art’. In order to describe the relationship between science and technology in a quantitative way, we can either focus on the existing relationships between some of the aforementioned indicators, or we can construct relevant derived indicators ourselves. However, since the main point in the analysis of the science – technology interaction is the demarcation of the knowledge movements between both spheres, one should aim to identify those proxies that best represent science and technology. Moreover, input indicators such as research budgets and personnel are usually only available on a relatively highly aggregated level, and hence are not suitable for analysis of particular technology areas, mainly output indicators based on patents and publications are used for this purpose (Schmoch, 1997). Patents and publications, due to their knowledge (information) carrying character, are best suitable for identifying the cognitive and social networks underlying the science and technology interaction. However, by using these proxies only a certain type of science – technology interaction can be grasped, namely the intacit and profound variant of the interaction present in patents and publications. In order to grasp the more tacit forms of science – technology interaction, other approaches need to be applied. As a consequence, we distinguish between direct, explicit forms of S&T interactions, and indirect, implicit forms. As explained, the first type implies that the science relation is a clear and direct one (Noyons & Van Raan, 1994). This is however not always the case, implying that other approaches that can reveal more implicit forms of interaction need to be applied. These approaches, “…do not identify links between science and technology which are actually present, but rather links which could be there, or even should be there” (ibid). Based on this distinction, we can discern between several approaches for investigating the link between science and technology (Schmoch, 1997; Noyons & Van Raan, 1994). The table below present the different analytical approaches. Table 5 – Overview of the different approaches for analysing the S&T interrelation Direct/explicit S&T interrelation q Indirect/implicit S&T interrelation Citations to scientific publications in patents q Patents of scientific institutions q Publications of industrial enterprises q Co-activities scientific (joint activities) institutions and between industrial enterprises q Parallel observation of patents and publications q Cartographical occurrences of approach based publication and on co- patent keywords In the following sections, these approaches will be discussed more in detail. The direct S&T linkage approach will be discussed first, followed by the more indirect linkage approaches. 88 I. Direct/explicit S&T interrelations IV.1. Citations to scientific publications in patent documents In the section on bibliometrics (cfr. section 8.IV.) the concept of citation analysis was already introduced in a purely scientific context. We mentioned that every scientific publication contains references to other publications of which the content in the mind of the authors is related in some way or another to their own article. These references - or citations - can then be used for a variety of purposes, of which the assessment of research performance of particular actors and the mapping of scientific (sub-)fields are two of the most common approaches. In a science-technology interface context, however, citation analysis involves the study of citations to scientific publications in patents. In principle, the logic applied in the field of bibliometrics - where a citation is perceived as an information item linking two scientific papers - is simply transposed to a science-technology context - where a citation is perceived as an information item linking a particular patent with a particular scientific article. The simple "translation" of this logic from a science context to a science-technology context however, does not occur without a flaw (cfr. section 9.IV.1.). Assuming that patents represent technology and publications science, the references to non-patent literature (NPRs) in patents search reports can be regarded as an indicator of the science base of technology (Schmoch et al. 1993). According to Collins & Wyatt (1988), Schmoch et al. (1993), Narin et al. (1997), Meyer (2000a) and Meyer-Krahmer (2000), studies of patent-to-journal citations are inherently interesting because of what they reveal about the linkage between science and technology. Furthermore, Grupp (1998) points out that patent citations are more objective than citations from scientists or technologists in their own publications. Various studies, among which the above mentioned, verify and also demonstrate that references to scientific publications are an appropriate indicator for describing the S&T relation. Before investigating this issue in more detail, a few introductory remarks on patents and patent citations seem necessary. First of all, a patent document consists of three elementary parts: (1) the title page or front page with bibliographic information, (2) the text, which includes a description of the invention, preferred examples in detail as well as drawings, diagrams, and flow charts, and (3) the claims. For citation studies, most of the relevant information is found on the front page of the patent document (Meyer, 2000a). It includes the following information: title and number of the patent, name and address of the inventor(s), name and address of the assignee(s), date of application, issue date, details of searches carried out by the examiner, abstract and drawing (when applicable) of the invention, classification code(s) and bibliographic information (Collins & Wyatt, 1988; Meyer, 2000a). For citation studies, the bibliographic information available on the front page of the patent document is most important. In particular, we can distinguish between two types of references in a patent document: (a) patent references, and (b) non-patent references. Obviously, the second type of references is particularly interesting if one wants to investigate the link between science and technology. In the following sections, we will subsequently discuss the concept of NPRs and the related citation motives, the nature of the citation link established by NPRs, a number of indicators that can be constructed, and the usefulness of this approach from a policy perspective. 89 IV.1.1. Non-patent references (NPRs) Non-patent references (NPRs) are a mixed set of references to scientific journal papers, meetings, books, and many non-scientific sources such as industrial standards, technical disclosures, engineering manuals, and every other conceivable king of published material (Perko & Narin, 1997). In particular, the references to scientific literature provide empirical evidence that a technical invention is in some way related - or perhaps initiated and/or stimulated - by research activities (Tijssen, 2001). The number of NPRs that can be found in patents differs strongly. This can be partly due to the characteristics of the patenting offices (external factor) or due to the characteristics of certain fields of technology (internal factor) - cfr. infra. According to Meyer (2000a), only a minority of patents contain references to non-patent literature. He cites a Norwegian study of the knowledge base of certain technologies (Iversen, 1998; op. cit. Meyer 2000a) showing that only 30% of Norwegian-oriented US patents contained NPRs. One explanation for this is the fact that the stock of prior art references of the Patent Offices mainly consists of patent literature. For example, the German Patent Office has about 10% of non-patent literature to refer to. For patents to cite substantial numbers of publications in journals, they must be situated in a field that is young, developing rapidly and with high scientific content. Furthermore, enough patents have to be granted in a short space of time so that the citations in them are sufficiently numerous to bear statistical analysis (Collins & Wyatt, 1988). According to these authors, this means that in practice the number of patents granted within a space of about five years should produce several thousand citations to the journal literature. A. Academic citation versus patent citations Referencing, as one of the widely accepted and utilised norms, confirms and illustrates the social character of the knowledge creation and diffusion process. Citing occurs not only within the scientific community but also within the technologic community. As we have seen, there is a profound ‘cross’ citation practice between both communities, mainly from technology to science. Comparison of the motives between academic citations (cfr. section 9.IV.1.) and patent citations may provide relevant insights in the differences in citation behaviour between the technology and the science system and also the value of patent citations, still a subject of discussion. At first sight, the role of citations in patent documents is intrinsically different from academic citations. Primarily, this is due to the legal function and character of a patent document and the social process of patenting, implying the involvement of several parties (examiner, applicant, patent attorneys) during the patenting process. A patent is a property right based on a sealed claim. For the property right to be recognised by other competitor companies, all property right details have to be public (Grupp, 1998). The description of the so-called ‘prior art’ leads to a set of one or more claims. All relevant documents (patents and other documents) that are known to the inventor, and that present those aspects in which the product or process has to be considered innovative, have to be mentioned. The role of the examiner versus that of the applicant differs per patenting system (more about this in the next section). The patent examiner’s task is to ensure the originality of the claimed invention and to identify the limits to that originality. His or her citations aim at defining the area in which the invention is truly original, by assessing the state of the art, and therefore fulfil the criteria for granting a patent. The examiner refers to the claims of the patent application. Thus, one of the major motives to cite a specific document 90 results directly from his/her responsibility and is related to the academic citation motive: ‘Disclaiming work, ideas or claims of others’, negative claims and homage. Two other citation motives within the scientific community also seem to play a role for the patent examiner to cite scientific literature (and patent documents): ‘Providing background reading’, in view of the information function of a patent (Grupp, 1998), and ‘Identifying original publications in which an idea or concept was discussed’, again in view of the examiners obligation to search for novelty and originality of the patent applied for. As a consequence, documents considered relevant to the examiner, are cited. There seem to be no other similarities between the citations in both spheres (cfr. section 9.IV.1.). B. Examiner versus inventor-given references With respect to the source of NPRs in patent documents, one can distinguish between inventor and examiner-given references. Nevertheless, both types of references stem from a different logic. Whereas the task of the examiner is to ensure the novelty of the invention claimed and to identify its limits, the inventor’s principle task is to identify work "either related to, but significantly different from, or else a useful step towards, the new invention or a use of the invention" (Collins and Wyatt, 1988, p. 66). However, the inventor’s task has to be regarded in light of the patenting procedure handling the application (cfr. infra). Examiner citations, as a result, usually complement, rather than duplicate the citations given by the inventor, although both examiner and inventor not seldom refer to the same publications (Collins & Wyatt, 1988; Tijssen, 2001; Meyer, 2000a). According to a survey by Schmoch (1993), about 8% of all examiner citations originate from the inventor. In a study performed by Tijssen et al. (2000) this percentage even amounted 35% (see also Narin et al., 1989). They also indicated that in another 35% of all cases, the inventors introduced most of the NPRs. As for the citation characteristic, applicants are more likely to cite the journal literature and examiners to cite reference books, conference proceedings and abstracting journals. Collins & Wyatt (1988) as well as Meyer (2000a and 2000b) argue that because citations in patents have a legal character, they are more carefully selected, and as a consequence less superfluous than citations in scientific publications. However, they are not free of problems either. Collins & Wyatt (1988) point to a number of these problems. The most important are: • Examiners tend to restrict their reading to a narrow range of specialties and to be relatively unfamiliar with the wider literature; they frequently cite in secondary form (abstracts), probably because of the availability of abstracts databases making the search for prior art easier. • The use of the same set of citations by one examiner in several different patents, suggests an occasional tendency to cite by rate, rather than by relevance. • Both examiners and applicants/inventors may be affected by national chauvinism in their citing practice. Furthermore, in view of the limited number of articles cited on the ‘front-page’ of patents, Collins & Wyatt (1988) question the competence of the patent examiners. They point out that inventors have a greater tendency to cite the scientific literature than the examiners. According to them, examiners do not only cite a few articles, but in addition, they may even cite these ‘badly’. C. Motives of the examiner to cite NPRs Whereas in the previous section we looked into the general tasks of the examiner (the legal responsibility) in comparison to the inventor’s tasks, in this section we will look more specifically into 91 the motives for an examiner to cite NPRs, and not patents for example. Based on interviews and technical discussions with patent examiners, lawyers, and others in the field, Grupp & Schmoch (1992a) infer six reasons for examiners to assign an NPR to the prior art description. Each reason influences the degree of science involvement in the invention process. These reasons, together with an indication of science involvement, are presented in the textbox below. Reasons for assigning NPRs to a patent Reflection Science Involvement I. Prior art in science and technology is still not covered by any patent document. Medium/High II. The patent examiner wants to cite non-patentable research results (e.g. formulae, High theories, discoveries, software development, and medical cures). He has to cite literature documents because patent documents are not available. III. The specialist area subject researched is developing so quickly that preceding patent Medium documents from foreign patent offices are still not accessible to examiners owing to the publication periods (but only their own documents even prior to publication). If the inventor has published scientific papers on similar subjects in non-patent literature, then these documents can be cited. IV. A company dispenses with a patent application and publishes the result of the Low development in the company’s own journal in order to safeguard the novelty claim visà-vis competitors even without a patent application. This document will be involved if it is relevant. V. The necessary references to Japanese language documents, incomprehensible to most None European and American examiners, may feature, via an English abstract service in the search report in the form of an NPL reference, whereas in reality they are patent citations. VI. References to very simple relationships that fall below the patentability threshold (e.g. None how a screw is tightened or a mechanical clock wound) are essential. Schoolbooks and encyclopaedia’s, therefore NPL sources, are mentioned. Adapted from Grupp & Schmoch (1992a) – Extended by author Some of these motives reflect an intensive science involvement, others not so unambiguously. Motives 5 and 6 are judged to be unimportant for science involvement. As a consequence, NPRs of such an origin should not be involved in the construction of indicators. Citations related to the 4th motive concern inventions that, through management decision, fall outside the commercial interests of the producing company. These need to be excluded from indicator construction as well. The 3rd motive pleads, though not necessarily, for the presence of a science interaction. The 2nd motive reflects a science connection, with a quite high level of probability. According to Grupp & Schmoch (ibid), the 1st motive reflects the classic situation in which - following Tijssen (2001) - examiners are probably more inclined to copy applicant-given NPRs when the prior art in science and technology is not yet, or only marginally, documented in the form of patents. This may particularly be the case in new technical fields or in young research-intensive fields where rapid developments have not yet reached the stage of patenting and are documented only in research papers and technical reports. Two elements are of interest here. First, according to Grupp & Schmoch (1992a), prior art in the form of patents is still not discovered. Second, according to Tijssen (2001), this leads to an increase in the number of applicantgiven citations included by the examiner. 92 D. Types of non-patent references In addition to the distinction that can be made between inventor and examiner citations, one can also in European search reports only - distinguish between different types of patent citations and as such obtain more insight into the relevance of a citation. As indicated by Schmoch (1993), the examiner has to look for earlier documents - primarily patents - that have the same or almost the same features as the patent application. Only if there are no other relevant documents questioning the novelty of the invention, will the patent application be accepted. In line with these responsibilities and tasks, we can find two types of patent citations (Schmoch, 1993): (1) documents of particular relevance, and (2) references concerning the general background. Documents of particular relevance restrict the claims of inventors. They are marked with the letter "X" when they, if taken alone, question the novelty or inventiveness of a patent claim. If a document is considered to question inventiveness of a particular claim, when taken in combination with another document, it is marked with the letter "Y". Citations documenting the technical background of the invention are marked with the letter "A" (Meyer, 2000a, 2000b). The EPO is the only patenting office that marks the different categories of references explicitly. Different degrees of linkage (or proximity) to the examined patents may be associated with the different types of cited references. The “A”-type references are general information for the applicant and are important, first of all, for the assessment of the so-called ‘inventive step’, which forms the second basic criterion for the approval of a patent. References of type “A” have low linkage proximity, whereas “X” references have a high degree of linkage. The linkage degree of “Y” references may vary since they are important only in combination with other references. When considering the “X” and “Y” type of references as of high-medium proximity, according to Schmoch (1993), then 29% of all citations can be considered “closely linked to direct knowledge sources”. On the other hand, this implies that almost 70% of all citations have no close linkage to the examined technological invention. Although only 29 % of all citations are closely related to the patents in question, all references can be considered relevant for the analysis of the S&T interrelation analysis, or “…patent citation analysis can be helpful for tracing the knowledge transfer without a distinction between the different types of citation” (ibid, p. 195). IV.1.2. Differences in citation intensity As Meyer (2000a, 2000b) pointed out, the patenting process is mainly a socially embedded process, during which many parties and considerations, and even interests (cfr. supra) play a part. The entrance of NPRs during this process is also influenced by this factor. Of particular interest is to look into the causes of varying numbers of NPRs between technology areas, countries etc. In other words: are there any plausible explanations for the fact that NPR-intensities vary? This is an important question, as its answer directly influences the interpretational possibilities, and the validity of this type of analyses. In view of the discussion carried out above, we can say that the occurrence and frequency of NPRs is a function of a) patenting procedures, b) the technology field under analysis, and c) the country of analysis. A fourth rather subjective factor can also be added, namely the decision making process of the patent examiner. We will discuss these factors more in detail below. A. Influence of patenting procedures Procedural differences between national patenting systems, and specifically between the European patent office and the United States patent office, also influence the availability of NPRs (Meyer, 93 2000b). It is in this context that Schmoch et al. (1993) identified another reason why EPO and USPTO comparisons are important: reliability. If the differences in the NPR structure between both patenting systems are major, then the reliability of the adopted approach needs to be called into question or a proper explanation needs to be found. Let us review a number of differences between the EPO and the USPTO (cfr. section 7.I.2). According to Schmoch et al. (1993), in science-intensive fields like biotechnology, the USPTO is very slow in granting because of an insufficient number of qualified staff in these areas. This implies that the number of EC-assigned patents at the EPO with equivalents at the USPTO is in reality much higher. Another interesting aspect is the difference in the number of citations between the EPO and the USPTO patent data. Again, we can find the answer by taking a closer look at the two examination procedures (Meyer, 2000a, 2000b). At the EPO, only the main claim is being examined. Most important is that the main claim meets the three criteria: novelty, non-obviousness and industrial applicability. In contrast, at the USPTO, all claims including dependent claims are investigated in detail. The most important reason for differences in the number of citations is the so-called “duty of disclosure” in the US. Meyer (ibid) explains: “In Europe, it is up to the applicant to introduce prior art known to him in the examination procedure or to refrain from doing so, whereas, US law stipulates that the applicant has to cite any prior documents known to him to the USPTO as long as the application is under examination” (p. 424). Not complying with this requirement is considered as fraud by the USPTO and can be used as a ground for invalidating the patent (Rainer Bertram, personal communication; in: Meyer, 2000b, p. 424). Table 6 provides an overview of the differences between the European and the US examination practices. Table 6 – Differences between the European and the US examination practices Issue EPO USPTO “Someone skilled in the art”, Specialist is well educated, so Specialist is less educated, so he the “average specialist”. for him, a less detailed needs a very detailed description description is sufficient. and many references to other documents. Searches, search reports It is said that, in many cases, the The quality of US researchers is EPO report is better, due to a broader access to relevant limited by their focus in English-language documents. material. US examiners are under time pressure. Education requirements of Generally higher than those of Generally lower than those of patent examiners their US counterparts. their European counterparts. Claims Focus on umbrella of claims. Many claims. References No duty of disclosure; Focus on Duty of disclosure: all relevant relevant citations. documents have to be indicated by the applicant party. Source: Meyer (2000b) According to Meyer (2000b), the differences between the EPO and the USPTO patenting system at the micro-level, might affect the macro interpretation of patent citations. As USPTO patent practice includes the examination of both major and other claims, a greater variety of linkage and a greater frequency should be expected. USPTO data seem to be more appropriate if the analysis aims at the widest possible cognitive web of interconnecting science and technology areas. 94 B. Field-specific influences Research has shown that differences in citation behaviour occur between technology domains. MeyerKrahmer and Schmoch (1997) observed, for example, that pharmaceutical patents cite scientific publications to a much bigger extent than mechanical and automobile patents. Meyer (2000a) indicates that electronics-related patents have a higher level of NPRs than materials-related patents. Narin and Olivastro (1992) also found significant variations in the number of NPRs present in patents belonging to different technology fields. High science-citing fields included "drugs and medicine" with an average of 3,19 NPRs per patent - and, to a lesser extent, "chemistry and chemical products" (0,77 NPRs per patent) and "professional and scientific instruments" (0,54 NPRs per patent). Low science-citing domains included "machinery" (0,12 NPRs per patent) and "transportation" (0,01 NPRs per patent) (Narin and Olivastro, 1992). According to Collins and Wyatt (1988), patents will cite substantial numbers of scientific papers if they are in a field that is relatively young, developing rapidly and with a strong scientific component. To quote Meyer (2000a, p. 426): “Different fields have a different nature of interaction”. In addition, one should also keep in mind that not all ‘technology’ is patented. A well-known example is software, which for reasons of market dynamics and actual exclusion from patenting, has consequently been a field with little patenting. As such, we can conclude that different fields have a different nature of interaction between science and technology. In some fields the intensity of the interrelation is stronger - indicated by a higher number of NPRs per patent - when compared to other fields. According to Meyer (ibid), this would also argue for sector-specific technology transfer policies. In addition to the fact that the number of NPRs appears to vary significantly according to the technology domain the patent belongs to, Narin et al. (1997) found that the degree of science linkage also is very subject specific. Their study shows that drugs and medicine patents invented in all countries cite almost exclusively to papers in the scientific fields of clinical medicine and biomedical research. Similarly, chemical patents appear to cite chemistry papers heavily, while electronics patents heavily cite to physics and engineering papers. C. Country specific influences In literature it has been frequently discussed whether the differences in citation patterns between countries, and specifically between the US and other countries, are due to actual country differences such as language. On the one hand, it has been pointed out that there are intrinsically national differences in the science bases of countries, but on the other hand, an investigation of the science intensity of the technological landscape of Germany, performed by Grupp and Schmoch (1992b), also discussed in Meyer-Krahmer (2000), showed that the link with science is an internal characteristic of the technological areas under review, and that there are hardly any differences between countries. It is also shown that a country must have a certain scientific level if it wants to ‘participate’ in a sciencelinked area. In addition, Narin et al. (1997) found a strong national component in the linkage between science and technology. They found that each country's inventors in the USPTO-system cite their own country's papers two to four times as often as expected, when adjusted for the size of the country's scientific publication rate. This strong domestic component that exists in the science-technology linkage shows that each country's inventors are preferentially building upon their own domestic science. 95 IV.1.3. Interpretation of the science – technology linkage based on citations In several studies – e. g. Grupp and Schmoch (1992a) – it has been indicated that NPRs can be seen as an indication of the “science relatedness” of a technology field. Meyer (2000a, 2000b) investigated the nature of the citation link established by an NPR, by looking into a sample of ten patents in detail. His findings clearly confirm the idea that there is no direct relationship (causality) between cited paper and citing patent. It appeared that, in the cases he analysed, the cited literature source rarely seemed to be the original source of the idea leading to the invention. Only in one of the ten cases, one can draw an “antecedent” direct cognitive link between a patent and a particular scientific article. Instead, the relationship can be more characterised as highly mediated. In a study performed by Tijssen & Buter (1998), slightly different outcomes were shown. In a survey involving 50 Dutch inventors, with USPTO granted patents, it appeared that in 94% of the patents, and thus of the inventors questioned, ‘in-house’ scientific research played a “very important” or “reasonably important” role. The conclusion that can be drawn is that a citation link (a patent citing a scientific paper) at least ‘connects’ a patent to a paper. As Meyer (2000a, p. 421) puts it: “It tells whether the invention touches an area where there has been no patent before, but a scientist has published something relevant”. What does this mean for the use of citation links for establishing the S&T interaction? NPRs provide an important tool for analysing the S&T interaction and for identifying and revealing cognitive webs of interactions between both spheres. The evidence is there. The studies performed by the CHI (Computer Horizons Incorporated) concluded that the technology areas whose patents cited to scientific papers were, in fact, rated by their peers as far more science dependent that areas of technology which did not relate to science (Narin & Olivastro, 1998). In 1993, Schmoch et al. (1993) confirmed these findings in a study performed for the European Commission focussing on the evaluation of EC programmes in the areas of agro related biotechnology and electronics & information technology. They identified similar science intensive technology fields as CHI. One of the conclusions in the above-mentioned study is that the operationalization of science intensity, through references to scientific publications, can be a starting point for many interesting investigations, but that, also in view of what has been presented in this report, a careful methodology is necessary to obtain valid results. According to Meyer (2000a), several writers in the literature on innovation and technical change have emphasised that information contained only in scientific papers will not suffice to implement the technology in question (see for e.g. Pavitt, 1987; Rosenberg, 1990). This suggests more mechanisms in the linkage between science and technology, and a not so straightforward interplay. Rabeharisoa (1992) states that there is no global, unequivocal answer to the question if articles, cited on the front-page of patents, trace links between different scientific and technical entities (cfr. section 9.IV.1.3.). However, the individual case rules: some examiners will cite scientific articles consistently and others will not (cfr. 9.IV.1.1.). The author continues to argue that the reliability and also the relevance of ‘front-page’ cited articles is a ‘local’ question and furthermore depends on the domain chosen and on the database used. In view of the previous discussion, relying upon journal citations becomes a delicate affair. Naturally, this has implications for the way in which S&T linkage results should be interpreted. As this discussion is related to the direction of the knowledge flows between science and technology, we also refer to chapter 7 for more details. 96 IV.1.4. Relevance of direct linkage approach from a policy perspective According to Meyer (2000a), the method of patent citation analysis is still valid as a policy tool. The method is still useful for policy makers in order to illustrate the science-relation of technological fields. More strongly expressed, even if the use of citation analysis for the illustration of the S&T relationship only reveals those science fields ‘touched’ by specific fields of technology, this can only be regarded as being of enormous value for the support of policy decisions in the S&T area, especially when combining the S&T link with other analytical tools. There are three major applications of patent citation analysis that we would like to discuss more in detail (Meyer, 2000c): following the general science orientation of fields over time (revealing a web of science and technology linkages); measuring the intensity of science and technology interaction; and tracking potential knowledge flows between scientific and technological fields. The first application allows for observing potential governance shifts in certain S&T areas. Areas in which instrumentation developed in early stages preceded and enabled scientific exploration, a development that gave the opportunity for further technological development (a so-called ‘Rosenberg pattern’). An example of such a field is nanoscience and –technology (Meyer & Persson, 1998; Braun et al., 1991). Identification of shifts from ‘applied’ to more ‘basic’ oriented technologies, or in the opposite direction, may have important policy implications concerning support and organisation structures. The second application, measuring the science – technology interaction, illustrated in the present study, measures the relevant interaction between science and technology fields. As already mentioned, patent citations can be used as a unit of analysis for establishing the link between both spheres. Science – technology interactions may reveal key science domains for certain technologies and provide an impression of how intensively scientific domains are related to technologic domains, and how this interaction co-develops. By broadening the approach, it also becomes possible to identify specific areas in which science and technology interactions could be promising but are not present; potential areas for future activity can be identified enabling policy makers to pinpoint them and intervene. The third application of patent citation analysis concerns the possibility of tracking and examining potential knowledge flows between specific (sub-) fields of science and technology. Analysis of patterns in which (sub-) fields interact, will increase the understanding of the science and technology interaction. The level of examination of the knowledge flows can vary from high (level of (sub-) fields of science and technology), medium (countries, regions or organisational entities such as universities, companies etc.) to low (identification of potential key actors). Patent citation analysis can give important clues as to the location of the knowledge flows and the occurrence or absence of knowledge flows in certain (sub-) fields, countries, and regions. All three applications will be utilised in our study, albeit in view of the necessary global focus. IV.1.5 Science – Technology linkage indicators In this section we shall discuss the different indicators – based on NPRs - and thec subsequent analyses that can be performed. Depending on the aspect of the linkage they focus on, we can distinguish between two major types of science-technology linkage indicators: those describing (1) the magnitude or intensity of the linkage; and those describing (2) the nature of the linkage. After having described both types, we shall elaborate on a number of drawbacks, or better put, issues on S&T linkage approach and linkage statistics. 97 A. Indicators describing the magnitude or the intensity of the S&T interaction This first type of indicators is based on simple counts of the number of non-patent references that occur in a given set of patents. Under the basic assumption that the degree of science intensity of a particular technology domain is reflected in the number of references to scientific publications given in patents, one can interpret such counts as a measure for the magnitude or intensity of the linkage between science and technology. In a next step, the average number of NPRs per patent (and specifically the citation to scientific papers) can be calculated in order to compare the magnitude of the science-technology linkage between various countries or regions. Narin et al. (1997), in their analysis of USPTO-patents over the period 1985-1995, observe a steady increase in science linkage across all countries and technologies. This is illustrated in Table 7 on a country-by-technology basis and in Figure 9 on a country-by-country basis. Table 7 − Evolution in the average number of science references in USPTO patents Technology area 1985 1990 1995 Chemical Patents 0.74 1.30 3.18 Drug and medicine patents 2.17 3.78 8.66 Electrical component patents 0.45 0.73 1.00 Scientific instruments patents 0.41 0.58 1.27 Source: adapted from Narin et al. (1997) Figure 9 − Increase in science-technology linkage as reflected in the number of NPRs in USPTO patents for five countries S cience References per US P T O P atent (sm oothed) 1,6 US 1,4 1,2 UK 1,0 0,8 France Germ any 0,6 Japan 0,4 0,2 0,0 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 T ime Source: Narin et al, 1997 Tijssen & Buter (1998), in a study on the importance of Dutch scientific research for technological innovations, concluded that Dutch science is increasingly used in technological innovations, based on the average number of NPRs per patent. The average number of citations increased with 42% (2.9 per patent). Practice has shown that the distribution of the number of NPRs per patent is very skewed. A 98 large number of patents contain no examiner-given NPRs, whereas only a few patents contain a lot of NPRs. As a result, a relatively high value for the average number of NPRs per patent could be caused by only a few patents containing a (very) high number of NPRs. In order to assess this problem, Van Vianen et al. (1990) have introduced two indices that provide us with information on the low-frequency end − patents containing not a single NPR − and the high-frequency end of the distribution − patents containing many NPRs. The first index, related to the low-frequency end of the distribution, is the proportion of patents containing zero references to scientific literature (P(0)). The second index, corresponding to the highfrequency end of the distribution, is the proportion of patents containing five or more references to scientific literature (P(5)). Grupp & Schmoch (1992b) introduced a more sophisticated indicator for measuring the magnitude or intensity of the linkage between science and technology. Their "mean NPL index" or "NPLM-index" is also based on simple counts of NPRs. For a particular technology domain j and country i the mean NPL index is calculated as follows (Grupp & Schmoch, 1992a; Meyer-Krahmer, 2000): NPLM ij (t ) = with Pij (t ) = NPL (Pij , t ) = NPL (Pij , t ) Pij (t ) the number of patents in year t for country i in technology domain j the number of NPRs in year t in patents from country i in technology domain j This NPLM index gives the mean value of the number of references to scientific literature in the patents from a particular country in a particular technology domain. One of the disadvantages of the mean index, according to Van Vianen et al. (1990), is that the mean is very sensitive, in a bad sense, for very ‘skewed distributions’ (i.e. in case that a lot of patents contain no NPRs and that only a few patents contain a high number of NPRs). For the visualisation of the results, this NPLM index can be transferred into a relative, standardised form (Schmoch, 1997): æ NPLM ij å Pij ö ÷ RNPLij = 100 × tanh ln ç ç å NPL (Pij ) ÷ è ø The RNPL index gives the deviation of individual countries and technology domains from the overall average. By taking the logarithm and the tangens hyperbolicus, the RNPL index becomes symmetric within the boundaries of +100 and −100, with 0 as neutral point. Positive values for the RNPL index correspond with science-technology linkage intensity above average, whereas a negative value reflects linkage intensity below average (Schmoch, 1997; Meyer-Krahmer, 2000). Schmoch (1997) used the RNPL index to investigate the science-technology linkage intensity for 30 different technology domains. Calculations were based on the total number of EPO-patents for the period 1989-1992. Figure 10 illustrates the outcome of this study. Schmoch (1997) encountered the highest science-technology linkage for the domain of biotechnology, the linkage intensity for domains such as chemistry, microelectronics, and information technology was also above average. The areas characterised by a relatively weak science-technology linkage − as reflected by the RNPL of a particular domain − are generally in the area of mechanical and civil engineering. 99 Figure 10 − Intensity of the science-technology linkage for 30 technology areas (EPO 1989-1992) 100 90 80 Sem iconductors (66) 70 Pharm aceutics (68) 60 O rganic Chem istry (61) Food (55) 50 Optics (39) 40 Telecom m unications (32) 30 Basic m aterials chem istry (21) 20 Surfaces (17) 10 Polym ers (5) 0 Environm ental technology (-17) M aterials processing (-28) Biotechnology (85) -10 -20 -30 -40 -50 Data Processing (42) M aterials (37) Audiovisual technology (27) Control technology (20) Nuclear technology (8) Electrical energy (-12) Chemical engineering (-2 6) M achine tools (-46) Engines (-57) Food processing (-61) Handling, printing (-66 ) -60 -70 Therm al processes (-64) M edical technology (-70) Transport (-80) -80 Consum er goods (-88) -90 Space technology (-78) M echanical elem ents (-85) -100 Civil engineering (-9 3) Source: Schmoch, 1997 B. Indicators describing the nature of the S&T interaction The second type of indicators - those describing the nature of the linkage - are not based on the simple counts of non-patent references that occur in a given set of patents, but on the characteristics of these non-patent references. Patents are then classified into groups on the basis of the nature and characteristics of the NPRs included in the patent. More insight into the nature of the S&T linkage can be obtained by studying several characteristics of the NPRs and the patents involved. This can, for instance, be achieved by investigating which technology domains refer to which science domains, resulting in a sort of "citation matrix". This higher level of analysis enables to model interaction between areas of technology and science. Other inferences on the nature of the science-technology linkage can be made by studying the age distribution of NPRs, the type of journals referred to, the author affiliation of the papers referred to, and so on. The most important aspects that can be investigated are described below. B.1. Modelling interrelations between science and technology areas First of all, the nature of the linkage is often investigated in terms of the science fields that are cited by the NPRs in the patents. As such, we can see which technology fields are linked with which scientific fields, thus arriving at a so-called "linkage scheme" or "citation matrix". This can be done for all patents in general, or for the patents belonging to specific technology domains in particular. Patents are assigned to a particular technology domain on the basis of their IPC classification codes. Scientific papers are assigned to a scientific discipline based on the journal they were published in. Journals are 100 assigned to one or more scientific disciplines based on the journal’s general contents. According to this procedure, cognitive links between citing patents and cited papers can be investigated. Tijssen (2001) for example examined the nature of the linkage between citing patents and Dutch cited papers at a rather aggregated level. Papers were classified into major scientific disciplines according to the ISI-defined journal sets ("subject categories") and the definitions used in the Science and Technology Indicators Report issued by the Dutch Observatory of Science and Technology. Citing patents were classified into one or more product fields, in accordance with the associated US-based SIC codes best corresponding to the origin of technical knowledge. Patents were further classified by aggregating those SIC groups into five generic fields, corresponding to broad industrial sectors. The distribution of citations to Dutch research papers on USPTO patents, by citing sector and cited scientific discipline (in percentage of total per sector) for patents granted in 1995-1996, as found by Tijssen (2001), is presented in table 8. Contributions with a percentage share of less than 5% in a sector were not shown. The results, according to Tijssen (2001), display an intricate web of interrelationships where each sector is characterized by its own distinctive multidisciplinary "science linkage" profile. This also illustrates that technological advances in the industrial sectors in question depend on a wide range of scientific disciplines. As such, we can see that the Pharmaceuticals and Medicines sector are primarily linked to scientific fields like Biomedical Research, Clinical Medicine and Biochemistry and Biology. The Instruments and Control sector, on the other hand, heavily relates to scientific fields such as Electrical Engineering, Physics, but also Clinical Medicine. Notice, however, that this "citation matrix" is based on citations to Dutch research papers only. 101 Table 8 − Distribution of citations to Dutch research papers on USPTO patents by citing technology domain and cited scientific discipline (USPTO 1995-1996) TECHNOLOGY DOMAINS Pharmaceuticals and medicines Chemicals and materials Biomedical Research 29% 17% Clinical Medicine 19% 17% Biochemistry and biology 28% Chemistry SCIENTIFIC DISCIPLINES Pharmacology 7% Instruments and control Electrical and electronics Machinery and transport 19% 7% 26% 6% 19% 20% 14% 17% 5% 23% 22% 9% Biotechnology 20% Chemical Engineering 6% Physics 18% 7% 9% Biomedical Technology 9% 5% 6% Electrical Engineering 24% 5% Mathematics and computer science 7% Source: adapted from Tijssen (2001) By incorporating the citing countries into the analysis, the linkage scheme can be analysed on a more detailed level. Narin and Olivastro (1992), for example, investigated the linkage between science and technology domains by determining the scientific fields that are cited in patents for a number of major product field categories. Figure 11, for example, shows the average number of NPRs to SCI journal papers, per cited science field and citing country for the technology domain of computing and communications. The data relate to USPTO patents issued between 1987 and 1988. 102 Figure 11 − Cited scientific fields and citing countries for the technology domain of computing and communications Citations per patent 0, 3 0,2 0, 1 Other En & Tech Phy sics Chem 0 France UK C ited Field C M & BR Germ any Japan US C oun try Legend: CM & BR = Clinical Medicine and Biomedical Research; Chem = Chemistry; En & Tech = Engineering and Technology Source: adapted from Narin and Olivastro, 1992 B.2.Investigating other aspects of the linkage between science and technology As we already mentioned, other aspects of the science-technology linkage can be investigated as well. This kind of analysis is always based on the classification of NPRs occurring in the patents under study, according to their intrinsic nature. Van Vianen et al. (1990), McMillan et al. (2000), and Schmoch et al. (1993) for example, investigated the nature of the linkage between science and technology by examining (a) the age distribution of NPRs, (b) the type of journals (basic versus applied) patents refer to, (c) the institutional breakdown of cited research, and (d) the country of origin for the cited papers. Several other analyses could be performed in a similar way. We will briefly describe and illustrate each of the aforementioned analyses. Age distribution of NPRs The nature of the linkage between science and technology can be described by examining the age distribution of references to scientific literature. In order to determine this age distribution, Vianen et al. (1990) calculated the age of all references, whereby "age" was defined as the difference between the year in which the patent was granted and the year of publication of the cited paper. Notice that the "age" of a NPR is not completely comparable to a reference in a scientific publication, since the time lag between applying for and granting a patent is on average between 18 and 24 months (in certain areas even more). Furthermore, when attempting to capture possible knowledge transfers like in the case of q 103 patent citation examination and ‘age’ of the citation, the time lag between submitting an article to a journal and its publication after a referee procedure has also to be taken into account (Schmoch et al., 1993). Van Vianen et al. (1990) examined the age distribution of non-patent references in the domain of chemical technology. The results are displayed in figure 12. The data relate to USPTO patents issued in 1985. Figure 12 − Age-distribution of non-patent references in the domain of chemical technology 10% 9% 8% % of total 7% 6% 5% 4% 3% 2% 1% 0% 0 5 10 15 20 25 30 age Source: Van Vianen et al, 1990 Van Vianen et al. (1990) found that the age distribution was very negatively skewed. Most of the cited papers were relatively recent ones, with a median age of four years. 50% of the non-patent references appeared to be less than 10 years old, while less than 10% was older than 30 years. They argue that the age distribution of NPRs gives an indication of the topicality of the technology domain represented by the patents. They argue that a so-called "high-tech" field is based on very recent science, whereas a more "established" technology domain would refer to somewhat older scientific publications (see also Narin & Noma, 1985). By comparing the age distributions of NPRs for different technology domains or different countries, one should get an insight in the "recentness" of the science part that is linked to a particular technology domain or a particular country's technological system. From a science and technology policy perspective, the authors argue, the question whether or not a country is involved in rather "high-tech" areas might be of interest, since these kinds of areas are generally supposed to present opportunities for economic growth. 104 The type of cited journals: the continuum “basic to applied” Another way of examining the nature of the linkage between science and technology is by investigating whether science that is cited in a set of patents tends to be more basic or more applied. In this respect, the National Science Foundation and Computer Horizons Inc. (CHI) jointly developed a classification scheme for the journals covered by the Science Citation Index (SCI). This classification scale ranges from 1 to 4, with a score of "1" indicating that a journal is "very applied" and a score of "4" corresponding with a "very basic" research journal. McMillan et al. (2000) analysed the nature of the linkage between science and technology in the domain of biotechnology. They also investigated the degree to which biotechnology firms rely on basic or applied research. They first selected a sample of 119 biotechnology companies. Up to 1997, these 119 companies acquired 2334 patents in total, which contained 23.286 NPRs on their front pages. These 23.286 NPRs were eventually traced back to 6884 different SCI-covered publications. According to the aforementioned classification, publications were classified into four categories on the basis of the journal they were published in. The result is displayed in table 9. q Table 9 – Research orientation of Biotechnology firms Level of journal Number of papers Percentage Unknown 1 (=very applied) 2 3 4 (= very basic) 12 150 541 1749 4432 0.3% 2.1% 7.9% 25.4% 64.3% Total 6884 100.0% Source: McMillan et al. (2000) The findings of McMillan et al.'s (2000) study clearly show the degree to which biotechnology firms rely on very basic research. This in turn indicates the need for biotechnology firms to develop and maintain a continued internal basic research ability in order to evaluate these external sources − see in this respect the notion of "absorptive capacity". q Institutional breakdown of the cited research: public versus private science McMillan et al. (2000) also investigated whether the domain of biotechnology is predominantly linked with public or private science. They found that 71.6% of NPRs originating from patents granted to the 119 biotechnology companies in their sample, were to papers written solely at public science institutions, such as universities, medical schools and research institutes. 11.9% of the NPRs were to papers being joint efforts of public and private institutions, while only 16.5% of the cited papers originated entirely from private companies. The results are displayed in table 10. Table 10 – Type of institutional origin of cited NPRs of biotechnology patens Author institution type Number of citations Percentage Public Private Public and private 8568 1971 1422 71.6% 16.5% 11.9% Total 11961 100.0% Source: McMillan et al. (2000) 105 These findings indicate that the biotechnology industry is primarily linked with science originating from public science institutions. Likewise, Tijssen's (2001) results on the study of Dutch papers cited in USPTO patents also illustrate the overwhelming importance of public science in the cited research literature. This is particularly the case in the pharmaceuticals and medicines sector, where he found that more than 70% of the papers originate from the public science sector. Finally, Narin et al. (1997) found similar results for the United States. Their results show that 70% to 80% of the NPRs were to publications (co-)authored by researchers working at universities or public sector research institutes. Institutional breakdown of the cited research: identification of knowledge producers Another frequently performed analysis is the one where papers, referred to by NPRs, are classified according to the institutional entity − rather than the type of institute − the author(s) of the cited papers belong to. In McMillan et al.'s (2000) study, the importance of public science to the domain of biotechnology was further demonstrated by the involvement of prestigious universities and laboratories in the cited papers. They identified the top 10 US author institutions for the sample of science references originating from the patents of 119 US biotechnology companies they investigated. Their results are displayed in table 11. q Table 11 – Institutional origin of NPRs in biotechnology patents Institution names Number of papers National Institutes of Health (Total) Harvard University National Cancer Institute UC-San Francisco Stanford University US Veterans Administration University of Washington Massachusetts Institute of Technology National Institutes of Health (unspecified) Massachusetts General Hospital 646 510 296 208 181 177 163 163 137 127 Source: McMillan et al. (2000) Country of origin An analogous analysis can be performed for the country of origin of the author(s) of the cited papers. On the basis of such an analysis, inferences can be made on the country specificity of the sciencetechnology linkage. It can, for example, be investigated whether there is a strong "national component" in the science-technology linkage for a particular country. This would mean that a country's technology is, to a higher than proportional extent, linked with research performed in the same country. q Narin et al. (1997) uncovered a very strong national component in the science-technology linkage. They found that each country's inventors in the USPTO system cite their own country's papers two to four times more often than expected, when adjusted for the size of a country's scientific publication rate. It appeared, for example, that although only 7% of all papers in the SCI were authored by researchers at German institutions, 17% of all NPRs on German-invented USPTO-patents referred to German scientific publications. In addition, this appeared to be a general phenomenon for all of the five countries − Japan, France, United Kingdom, Germany and United States − examined by Narin et al. (1997). Similarly, McMillan et al. (2000) found a very strong national bias in the US biotechnology industry. Likewise, Tijssen (2001) discovered a large domestic component in NPR linkages for Dutch science-technology linkages. 106 These results suggest that there is a strong domestic component in the science-technology linkage, showing that each country's inventors preferentially link with domestic science. An interesting observation is that the same national preference is apparent, not only in citation behaviour from patents to papers, but also from papers to papers and patents to patents. Narin et al. (1997) conclude from all these citing phenomena that there are very strong national ties between scientists within a country and inventors within a country, and between national inventors and scientists. This implies that a strong national scientific base and a strong national technology base go hand in hand, at least in areas where science and technology are tightly linked, for example in domains such as biotechnology, agricultural chemicals and plastics. B.3. Issues around S&T linkage approach and statistics Classification ambiguity in regard of S&T linkage A clearly classified observation structure is required for showing the connection between science and technology (Grupp & Schmoch, 1992a). Since no general valid classification of technology is available, it is an important advance that in the field of patent analysis there is a structured, uniform and internationally agreed upon classification system, the so-called International Patent Classification (IPC). Grilliches (1990) pointed out that the IPC is primarily based on technological and functional principles, in function of the patenting system, and that it is only rarely related to economists’ notions of products of well-defined industries. q In the case of economic research, quite some attempts have been made to link IPC-categories to economic categories of interest through concordance schemes (cfr. section 7.II.2.2.). However, the IPC classification does not always ally with a policy maker’s notion of a technology area. Therefore several attempts have been made in the past to converge the IPC classification towards broader technology areas. In a study of Grupp & Schmoch (1992a) a classification scheme of technology has been constructed based on IPC-codes (4th revision). The Fraunhofer (FhG-ISI) technology classification consists of 28 broad technological sectors, defined in 3-, and occasionally, in 4-digit IPC codes, being representative and homogeneous in terms of modern technology. The classification scheme has recently been updated/extended in co-operation with Observatoire des Sciences et des Technologies in France (OST) and the INPI. Also, for establishing the science content in terms of its intrinsic characteristics a type of classification is necessary. De Bruin & Moed (1993) comment on this task as follows: “An appropriate delimitation of scientific sub fields constitutes one of the key problems in bibliometrics”. According to the authors, there are several methods for classifying science sub fields. The most important ones are co-citation analysis, co-word analysis, the use of indexing systems produced by professional indexing or abstracting services, and the use of a classification of scientific journals into subject categories – nearing science sub fields. An example of a (meta-) classification used in several scientometric studies (see for example Collins & Wyatt, 1988; Narin & Olivastro, 1992; Malo & Geuna, 1999), is the classification produced by Computer Horizon Incorporated (CHI). This classification breaks several thousands of journals covered by the SCI down into four different levels, depending on the research type they contain, varying from ‘very’ basic to ‘very’ applied. 107 The journal classification of CHI has its difficulties and specifically in the field of physics and engineering (Collins & Wyatt, 1988, p. 66). According to Moed (1996) the SCI-ISI classification seems to be based to a large extent on information in the titles of the journals, while the above-mentioned CHI classification system is primarily based upon analyses of journal-to-journal citations. Furthermore, while the SCI classification is based on subject categories, which are and can be linked to science domains, the FhG-ISI classification is based on the type of science performed. This illustrates that classification systems can be intrinsically different, even when they are based on the same type of source data. Problems with linkage statistics Before turning to the indirect linkage approaches, we shall briefly elaborate on three issues or problems, inherent to the use of NPRs in the direct linkage approach. Some of these issues have been touched upon previously from a different point of view. The following will be discussed: 1) Database shortcomings: the need for unification and standardisation 2) Skewed distribution of NPRs 3) Complexity of the data q Each of these items will be discussed in detail below. 1) Database shortcomings: the need for unification and standardisation In chapters on patents and scientific publications (7 and 8), we have already dealt with specific problems around databases. In the context of the direct linkage approach and the use of NPRs, however, the heterogeneous format of NPRs and in many cases the incompleteness of the data (see also Schmoch et al. 1993) also play a significant role. As Grupp & Schmoch (1992a) mention, the patent databases usually have not been set up for statistical inquiries, but rather for patent lawyers, employees of patent offices and interested industries, who wish to clarify and research individual cases. When conducting statistical inquiries, double counting, incomplete data and spelling mistakes have to be taken into account. The question of reliability and in particular of reproducibility calls for a series of standardisations. In the table below we present a sample of NPRs originating from the USPTO and EPO data. Note the heterogeneity among the references. Table 12 – Examples NPRs Examples of USPTO journal references 1. 2. 3. Truelove, B. et al., Journal of Nematology, vol. 9, No. 4, Oct. 1977, pp. 326-330 Rodriguez-Kabana, R. et al., Nematropica, vol. 8, No. 1, (1978) pp. 26-31.’Ceramics In Surgery’ by R. P. Welsh et al., Journal Biomedical Materials mposium, vol. 2 (part 1), pp. 231-249, 1972 ‘The Experimental Use of Heterologous Umbilical Vein Grafts as Aortic Substitutes’, Singapore Medical Journal, vol. 3, No. 1, Mar. 1962 Examples of EPO journal references 1. 2. 3. 0203CHEMICAL ABSTRACTS, vol. 85, 1976, Columbus, Ohio, USA, BHATT S.B. et al. ‘A convenient method for the preparation of alfa-arylaminophenylacetic acids’, page 607, Abstract no. 123514h & Curr.Sci. 1976, 45(15), 547 0202JOURNAL OF THE CHEMICAL SOCIETY 1975, pages 1865-1868 0203L'ONDE ELECTRIQUE, vol. 55, no. 1, Jan. 1975 Paris J. BESSON, ‘Plaques photovoltaiques au sulfure de cadmium’, pages 21-24. * Page 21, column 1, paragraph 3 - page 21, column 2, paragraph 2; page 23, column 1, paragraph 2 - page 24, column 1, paragraph 2; figure 2 108 Collecting patent citations to scientific literature is a difficult task (Van Vianen et al., 1990). An important part of the data handling is ‘unification’ of the references: correction of journal abbreviations, page numbers, volumes, authors into a standard format (see also Narin et al. 1997). 2) Skewed distribution of NPRs Noma & Narin (1984) performed a study based on citation and referencing data from biotechnology patents and bioscience papers. They showed that there is a very skewed distribution of cited material in both patents and papers, with a relatively small number of highly cited patents and papers (see also Van Vianen et al., 1990; Schmoch et al. 1993; Schmoch, 1993). In other words, a large number of patents contain no references, whereas just a few patents contain many references to scientific literature. An important question when dealing with (very) skewed distributions is whether the value of the mean (average number of NPRs per patent) is a representative value. 3) Complexity of the data: multi-authorship The last problem deals with the data complexity. Narin et al. (1997) point out that the data used in linkage statistics are extremely complex. First of all, there are many different ways in which counts can be made (normal, adjusted, etc. – cfr. section 8.IV.3.3.). A scientific publication may be cited in several patents, and, on the other hand, a given patent may cite a number of different scientific publications. A paper often has authors from more than one institution, just like patents may have different inventors from different countries or institutions. The combination of the abovementioned factors gives rise to various complexities in counting and data-representation. 109 II. Indirect/implicit S&T interrelations The relation between science and technology cannot always be studied in a direct and straightforward manner. Scientific knowledge, used for technological application, is not always cited in patents. Noyons et al. (1994) found that the presence of fewer or no NPRs in patent is not necessarily an indicator of a lesser science intensity of the individual patents, but more an indicator of the technological nature of individual patents. As such, they conclude: “patents with no or only a few NPRs cannot be regarded as significantly less ‘science intensive’ than those with many NPRs”. In that regard, Meyer-Krahmer and Schmoch (1998) point out that a weak (direct) science linkage of a technology does not imply a low university – industry interaction. IV.2. Patents of scientific institutions and publications of industrial enterprises It is generally assumed that the main output of universities and research institutes consists of scientific publications, whereas the main output of industry consists of products and processes − represented by patents. However, academics can also take out patents and researchers in industrial enterprises can also publish scientific articles. This is illustrated in Figure 13. Figure 13 − Output versus Actor ACTOR OUTPUT University Industry Scientific publications MAINLY Secondarily Patents Secondarily MAINLY Source: adapted from Godin (1996) As a result of this general frame of mind, the number of studies dealing with articles published by industrial enterprises, on the one hand, and patents taken out by scientific institutions, on the other, has been limited in amount. In addition, the few studies that have examined this subject tend to be rather limited in scope, focusing on one technology or one industry − often pharmaceuticals − at a time (Godin, 1996). A first indicator of industry–university collaboration is the share of patents of scientific institutions (Meyer-Krahmer & Schmoch, 1998). A patent only makes sense for a scientific institution if it is interested in the commercial exploitation of a new finding and collaboration with an industrial partner (Meyer-Krahmer & Schmoch, 1998). According to Schmoch (1997), a high share of patents on the part of scientific institutions can be considered a good indicator for a close relationship of scientific and industrial laboratories in the technology field under analysis. He indicates that at the German Patent Office, the average share of patents held by scientific institutes is about 5%. Narrowly defined technology domains are often too small to perform any meaningful analysis on them. Nevertheless, this approach is easily applicable in countries where universities and research institutes appear as applicants and owners of patents, which is the case in the United States. 110 By simply generating applicant lists, one can proceed by identifying the patents of scientific institutions. In Germany or Switzerland for example, the professors can freely dispose of the intellectual property rights, and therefore appear themselves as applicant or sell their rights to firms. In some databases, they can be identified by the academic title “Professor” (Schmoch, 1997). Hicks (1995) has shown that companies publish scientific papers in order to signal scientific competence to academic research. Therefore publications of companies are appropriate indicators for the relationship between science and technology. A variant in the form of direct research co-operation between scientific and industrial institutions is documented by co-publications (in fact: co-authorship). It is also interesting to point out that, in the study of Meyer-Krahmer & Schmoch (1998), it appeared that academic researchers rank collaborative research distinctively higher than contract research, which they perceive as rather one-directional. The high ranking of informal contacts supports the relevance of knowledge exchange. Let us now consider the relation for the other angle: scientific publications (activities) of industry. As Godin (1996) mentions, for much of the last 35 years, basic science has been considered by economists as a "public good". Firms were thought to have relatively little incentive to do basic research, since research is expensive and risky and, once available, every firm can enjoy the results freely. As a corollary, firms are also recognised to publish relatively few scientific articles. Publications by firms are considered merely as a by-product of industrial activities and, if firms do publish, it is believed they mostly publish applied results, not basic ones (Godin, 1996). Recently, however, these assumptions have come under challenge. Since knowledge is complex involving "learning-by-doing" -, tacit - knowledge cannot be fully codified -, cumulative - it builds on what has already been learned - and firm specific, merely relying on extramural research is not an effective substitute for in-house research (Mowery, 1983; Godin, 1996). Rosenberg (1990) identified a number of reasons why firms perform basic research in-house. One of the most important reasons is that firms often need to do some amount of basic research, in order to understand better how and where to conduct research of a more applied nature. In addition, doing basic research in-house is indispensable in order to properly monitor and evaluate research being conducted elsewhere. Another important reason for conducting basic research in-house is that it serves as some kind of a ticket of admission to an information network. Indeed, the most effective way to stay connected with the scientific network is to be a participant in the research process (Godin, 1996). In accordance with these arguments, Nelson (1990) argued that firms have incentives to publish scientific articles: to (1) attract customers, (2) establish legal rights, (3) attract capital, (4) inform suppliers, and (5) gain reputation in order to relate to the scientific community and attract scientists and engineers. Although the number of scientific publications by industrial scientists is still relatively small compared to those from academics, the production of scientific articles by industry has increased by 50% in the last decades. In addition, some of these industrial publications have a very high impact on the research community. This is especially the case in some high technology industries, such as aerospace, chemicals, pharmaceuticals and electronics. Research has shown that companies primarily publish scientific papers in order to signal scientific competence to academic researchers (Hicks, 1995). Indeed, this appears to be the most effective way for getting involved in the communication of scientific communities and gaining access to the most recent research results. As a result, publications of industrial enterprises can be considered appropriate indicators for the relationship between science and technology. Indicators in this respect might be the relative importance of industrial publications in a particular scientific domain and its evolution over the time (Schmoch, 1997). 111 An enormous disadvantage of the mentioned approach is the lack of possibilities for automatic processing. Databases do not display special codes for universities, research institutions, and industrial patenting. Consequently, manual processing, which is time consuming and thus expensive, is necessary. In addition, an increasing number of the organisations that researchers work for no longer fit into the traditional university/industry dichotomy anymore. IV.3. Tendency to integrate scientific and technological activities Meyer (2000a, 2000b) indicates that there is a increasing tendency for researchers to integrate scientific and technological activities by working on one subject matter, but by generating scientific papers as well as technological outputs − in the form of patents. He describes a number of science-technology interactions he encountered when studying the front pages of a limited number of patents in the field of Nanotechnology. They are presented in Table 13. Table 13 - Types of science – technology interaction as reported in the cases studied Types of science – technology interaction as reported in cases Close personal science – technology linkage: individuals active both in academic research and industry. Doctoral candidate working in university-based corporate research institute. His work on the same subject matter has led to both scientific results and technological output. Public research institute is actively involved in both patenting and publishing. Not infrequently, patents and papers result from the same project. Invention patented was developed by a scientist who was active in a company and produced patents as well as scientific research papers, based on the same project. Data source: Meyer (2000a) As a result, investigating the publication as well as patenting activity of an actor that is involved in both kind of activities obviously is another way to explore the relationship between science and technology. In this respect, Schmoch (1997) suggests searches in patent databases with the target of identifying relevant inventors. In a second step, one could search publication databases for scientific articles published by these inventors. Noyons et al. (1994), for example, found that most inventors in the field of medical lasers are also actively engaged in publishing. They found that, at least for their sample, the number of publications related to the area of the invention significantly increased before the patent application. Rabeharisoa (1992) encountered similar results when analysing the field of fuel cells. She found that in times when the development of fuel cell technology was successful, inventors published articles describing their technical advances and the market prospects of the technology in question. In addition, stages of technical stagnation, reflected by a decreasing number of patents, were accompanied by a shift of the inventor-authors towards more fundamental research in electrochemistry (Schmoch, 1997). Obviously, the major drawback of this approach is the limited number of patents and publications originating from the same actors. IV.4. Co-activities (joint activities) between scientific institutions and industrial enterprises Direct co-operation between scientific and industrial institutions obviously is another indicator for a strong science-technology interface in a particular domain. Co-operation between institutions can take 112 the form of either scientific collaboration, which usually results in co-authorship, or technological collaboration, resulting in co-inventorship or co-ownership with respect to patents. By investigating the number of articles by academic researchers that are co-published by industrial colleagues on the one hand, and the number of patents by industrial enterprises that are either coinvented by or co-assigned to academic partners on the other hand, we should clearly get a better view on the relationship that exists between science and technology. Noyons et al. (1994) found in this respect that co-publications of universities and companies increased in the period before the patent registration and decreased afterwards. As such, it can be argued that the technology transfer between academic and industrial researchers is well reflected in publications linked to patents (Schmoch, 1997). Obviously, also here the major drawback pertains to the size of the sample involved. In addition, this approach is also very labour-intensive, since databases do not display particular codes for universities, research institutions and industrial enterprises. As a result, the records of the sample have to be edited in a manual way (Schmoch, 1997). In addition, an increasing number of the organisations that researchers work for no longer fit into the traditional university/industry dichotomy anymore. IV.5. Parallel observation of patents and publications The second approach, the parallel observation of patents and publications, compares the co-evolution of patents and publications in time (Schmoch, 1997). According to Rappa & Debackere (1992), the parallel development of publications and patents can be considered a strong indication for a close interaction between academic and industrial researchers. “The comparison of time series for longer observation periods supports the understanding of the interaction between industrial and academic research and thereby the process of technology generation” (Schmoch, 1997, p. 110). An example of parallel observation in the field of Neural Network is shown below. Number Figure 14 – Parallel observation of patents and publications in the domain of neural networks 5000 4500 4000 3500 3000 2500 2000 1500 1000 500 0 Publications Patents 80 81 82 83 84 85 86 87 88 89 90 91 Year of publication/application Source: Schmoch (1997) The simultaneous take-off of both graphs around 1987 is due to the (re-) discovery of a learning algorithm that had major implications for the performance of neural networks. In the last period, starting around 1990, we see that scientific publications start evolving differently than the patents. This 113 might imply a breach in the close S&T interaction in the field of neural networking. A methodological problem of this approach is the equivalent definition of a technology field in patent and publication databases. The problems around classification systems, as previously discussed (cfr. supra), are a limiting factor. As a variation of the comparative approach, Schmoch (1997) proposes to start the search in a patent database and to identify relevant inventors in the field. In a second step, publications of these inventors are searched for in publication databases. Rabeharisoa (1992) and Noyons et al. (1994) use a similar approach by examining the S&T relation through the identification of authors who are also inventors (inventor - author relationships). IV.6 Cartographical approach based on co-occurrences of publication and patent keywords The last indirect approach for analysing the S&T relationship that will be discussed in this report is mapping. Mapping techniques and the utility of maps for science and technology policy have already been discussed in the chapter on bibliometrics (chapter 7). In this section, we shall briefly discuss mapping as a technique for illustrating and analysing the S&T interaction. According to Noyons & Van Raan (1994) inventor - author relations occur in closely linked science and technology fields, which is not always the case. By using a cartographic approach, the link between science and technology is analysed on the basis of a cognitive overlap. This approach does not identify S&T links that are actually present, but rather links that could be there, or even should be there. In the study performed by Noyons & Van Raan (ibid.), a group of experts is used for the definition of the field of Optomechatronics, the field under study. The same definition has been used to identify relevant publications on the science side and also relevant patents on the technology side. From the set with selected publications a list of most frequently occurring controlled (classification terms of the database producer) and uncontrolled terms (terms given by the author reflecting the content) was generated. From the patent set, the top-100 of indexed words was generated. Two maps were constructed, by using co-occurrence frequencies, based on the same definition. It appeared that the cluster structure of the two maps is more or less identical, implying a relationship between the scientific and technological development. Korevaar & Van Raan (1992) also explored the possibility to map the interface between science and technology by combining bibliometric data on inventors and scientists. The findings represent the major features of the field of Catalysis well, both for the science side and for the technology side, as was validated by several internationally renowned scientists. It appears that mapping of the science and technology interaction results in interesting insights, offering an alternative for those situations where a direct linkage does not occur or where a direct linkage approach is not desirable. As no methodology is completely unambiguous, the choice for a certain methodology has to be made in regard of the specific aims and characteristics of each study. 114 - Sectoral breakdown and analysis, Number start-ups in the field, Institutional specification Normalisation of absolute cooccurrence values by using the ‘cosina’ formula, check on suitability of data Patent counts, Publication counts, # Organisations and citations per institution type Patent and Publication counts, Specialisation Indices (RLA) 2. 3. Controls (normalisation) Citation counts, Publication counts, Impact ratio’s Variables 1. Study § § § Type: indirect (cooccurrence) Type: direct (through NPRs) Type: - Linkage 115 § § § § § § § § § § § § § § § Data source(s): INSPEC T-field selection/class, keyword based search, IPC S-field selection/class, keyword based selection Data standardisation: General: two time periods, mapping of internal dynamics Data source(s): SCI T-field selection/class.: S-field selection/class.: construction of a classification based on journals (similar to SCI class.) Data standardisation: extensive (author name, publication year, journal title, volume, starting page number) General: extensive standardisation process, algorithm to identify citations to target articles, full counting method, correlation coefficients Data source(s): EPO T-field selection/class.: Keyword based selection S-field selection/class.: CHI ‘Level’ classification of journals Data standardisation: General: number of organisations and citations per institution type Characteristics Comparable analysis of S&T Explicit linkage to Science (NPR), Identification of major institutes, policy implications Standardisation approach,, matching approach, construction of science domains +/+ - - - -/- APPENDIX I: KEY CHARACTERISTICS OF SEVERAL EMPIRICAL STUDIES ON S&T ANALYSIS Correlation # NPRs per patent # NPRs per patent Patent counts, NPR counts, Publication counts, Inventors being Authors count Patent counts, NPR Counts, Citation counts, Age distribution Patent counts, NPR Counts, References, NPRs per patent 4. 5. 6. § § § Type: direct (through NPRs) Type: direct (through NPRs) 116 Type: indirect but based on NPRs (through inventor – author relationships) § § § § § § § § § § § § § § § Data source(s): USPTO T-field selection/class.: Keyword based selection S-field selection/class.: CHI ‘Level’ classification of journals Data standardisation: General: From – to citation analysis, fractional counting, comparison of examiner and applicant citations Data source(s): USPTO, EPO T-field selection/class.: S-field selection/class.: Data standardisation: complete standardisation of all relevant fields (more or less complete reference) General: matching to SCI, front page references are used, Data source(s): EPO, SCI T-field selection/class.: randomly selected patents – most closely linked publications, expert assessment, inventor authored publications S-field selection/class.: CHI ‘Level’ classification of journals Data standardisation: General: analysis of most closely related publications, expert analysis of the patents, introduction of the CICA indicator (coinventors vs. co-authors), division in subsets of patents based on # NPRs EPO – USPTO comparison in NPRs Field characteristics for good linkage results Expert validation, showed that less NPRs is not necessarily an indicator of a lesser science intensity of the individual patent - - - Two different mapping methods (Cosina – inclusion index, single linkage clustering – waverage) - # NPRs per patent Patent and publication counts Counts of ‘Other references’, Patent counts, Year Patent counts, NPR counts, Citation counts, Publication counts 7. 8. 9. Type: indirect (mapping; co-occurrence) Type: direct (through NPRs) 117 Type: direct (through NPRs) § (S&T linkage has been created based on inventor-author relations and thus creating a map) § § § § § § § § § § § § § § § § Data source(s): Chemical Abstracts, WPI/L, EPAT T-field selection/class.: IPC definition of field of Catalysis, Chemical Abstract class and keyword selection S-field selection/class.: Data standardisation: General: Bibliometric cartography, 2 time periods of analysis, Co-occurrence frequency (Cosina, MDS, inclusion index) Data source(s): USPTO, TECH-LINE, SCI T-field selection/class: Concordance, SIC product groups S-field selection/class.: CHI ‘Level’ classification of journals Data standardisation: General: unification of each NPRs and assignation to a specific category, adjusted count’ (partial), assigned patents, division between SCI and non-SCI reference Data source(s): SCI based SLID database, USPTO T-field selection/class.: S-field selection/class.: Data standardisation: divided in different categories, unification of format General: matching to SCI-SLID, patents issued, identification of author addresses - - Expert validation, Comparison of two mapping techniques, Internal dynamics insight enables policy decisions 24% unmatched references due to imperfectio ns - Interpretatio n of the maps Distribution of references to scientific literature (P0 and P5), Validation of the Activity Index (χsquare criterion) - # NPRs per patent Patent counts, NPRs, Field of technology, Patent class, Age distribution, ‘Prominence’ of the Journal in question, Activity Index, Science Intensity Index Publications counts, Occurrence counts, Year, Country Patent counts, NPRcounts, different variables around type and characteristics of NPR (counts, journal influences, time distribution etc.) 10. 11. 12. § § Type: direct (NPRs) Type: linkage between fields of science 118 Non patent references, Comparing age distribution in patent references Type: direct (through NPRs) § § § § § § § § § § § § § § § Data source(s): USPTO, EPO (through WPIL), SCI T-field selection/class.: IPC, keyword selection (combined) S-field selection/class.: CHI ‘Level’ classification of journals Data standardisation: unification of NPRs, (format, id, supplementation) General: - Data source(s): INSPEC (PACS codes) T-field selection/class.: Keyword selection of publications S-field selection/class.: Data standardisation: General: mapping, Co-classification (classification code), Co-word (descriptors in a publication), Activity index (normalised) Data source(s): USPTO, SCI T-field selection/class.: US classification, SIC product groups S-field selection/class.: CHI ‘Level’ classification of journals Data standardisation: format unification, parsing and identification of the specific fields General: analysis of the characteristics cited literature, examiner references, P(0) and P(5) in order to analyse the skewness of distribution Patent search technique, analysis of the citation characteristics, thorough analysis of the findings Insight in the development of the field Validation of the Activity Index, Alternative: inventor = scientific author, suggest an approach for validating the distribution of NPRs, language barrier - - - (Study mainly based on the results of other studies) NPR counts, Science Intensity Index, # of patents, 14. 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. § § Type: direct/indirect (university – industry interactions, NPRs) Type: direct (NPRs) § § § § § § § § § § Data source(s): USPTO, SCI T-field selection/class: SIC product groups, selection based on nationality and NPR S-field selection/class.: SCI classification of Journals is Science disciplines Data standardisation: General: survey with 50 random inventors on the reference motives Data source(s): EPO T-field selection/class: S-field selection/class.: Data standardisation: General: identification of patents of universities through keyword ‘Professor’, They argued that a weak science linkage of a technology does not imply a low univ. – industry relationship Analysis of the role of public research institutes, validation study - - 119 Luwel, M. (1999), Is the Science Citation Index US-biased Malo S. & A. Geuna (1999), Science-Technology Linkages in an Emerging Research Platform: The Case of Combinational Chemistry and Biology Hinze, S. (1997), Bibliometric Mapping of Microsystems Engineering and International Actors in the Field Noyons, E.C.M. et al. (1994), Exploring the science and technology interface: inventor-author relations in laser medicine research Collins, P. & S. Wyatt (1988), Citations in patents to the basic research literature Narin, F. & D. Olivastro (1997), Linkage between patents and papers: an interim EPO/US comparison Korevaar, J.C. & A.F.J. Van Raan (1992), Science Base of Technology: Bibliometric mapping as a tool for national science and technology policy Narin, F. & D. Olivastro (1992), Status report: Linkage between technology and science Narin, F., K.S.D. Hamilton & D. Olivastro (1997), The increasing linkage between US technology and public science Van Vianen, B.G., H.F. Moed & A.F.J. van Raan (1990), The Assessment of the Science Base of Recent Technology Noyons, E.C.M. & A.F.J. Van Raan (1995), Structuring the dynamics of Neural Network Research Schmoch, U. et al. (1993), Indicators of the Scientific Base of European Patents Tijssen, R.J.W & P.K. Butter (1998), Het Belang van Nederlands wetenschappelijk onderzoek voor technologische innovaties: kwantitatieve analyse van octrooien Meyer-Krahmer, F. & U. Schmoch (1998), Science-based technologies: university – industry interactions in four fields Corresponding references: #NPRs per patent, validation based on EPO patents NPR-counts, patent counts, publication counts 13. REFERENCES Arrow, K. (1994), “The production and distribution of knowledge”, In: G. Silverberg and L. Soete, (eds.), The Economics of Growth and Technical Change, 1994 Arundel, A., and I. Kabla (1998), “What percentage of innovations are patented? Empirical estimates for European firms”, Research Policy, 27, pp. 127 – 141 Bain, D. and C. Rinaldini (1989), “Pitfalls of indicator based evaluation”, Commission of the European Community Joint Research Centre, Ispra, Italy, In: Van Raan et al. (1989) Braam, R.R. (1991), “Mapping of Science: Foci of Intellectual Interest in Scientific Literature”, Doctoraatsdissertatie Rijksuniversiteit Leiden, DSWO Press. Braam, R.R., H.F. Moed and A.F.J. Van Raan (1989), “Comparison and Combination of Co-Citation and Co-Word Clustering”, In: Van Raan A.F.J., Nederhof A.J., Moed H.F., Eds. (1989) Braun, T., W. Glänzel, and A. Schuber (1997), “The Bibliometric Assessment of UK Scientific Performance: Some Comments on Martin’s ‘Reply’”, Scientometrics, 20, 2, pp. 359-362 Bud, R. (1994) “The Uses of Life: A History of Biotechnology,” Cambridge: Cambridge University Press. Callon, M., J. Law and A. Rip (1986), “Mapping the Dynamics of Science and Technology”, MacMillan Press Ltd., London Callon, M., J.P. Courtial, W.A. Turner and S. Braun (1983), “From Translations to Problematic Networks: an introduction to Co-word Analysis”, Social Science Information, 22, pp. 191-235 Caniels, M.C.J. (1997), “The Geographic Distribution of Patents and Value Added Across European Regions”, Working Paper, Maastricht Economic Research Institute on Innovation and Technology (MERIT), Maastricht Collins, P., and S. Wyatt (1988), “Citations in patents to the basic research literature”, Research Policy, 17, pp. 65-74 David. P. and D. Foray (1995), “Accessing and Expanding the Science and Technology Base”, STI Review, pp. 13-68, OECD, Paris Debackere, K., M. Luwel, R. Veugelers (2000), “Patent Data as a Tool to Monitor S&T Portfolio’s”, Research Report 0040, Department of Applied Economics, Catholic University of Leuven, Belgium 120 Deeds, L., DeCarolis, D. en Coombs, J. E. (1999), “Dynamic Capabilities and New Product Development in High-technology Ventures: An Empirical Analysis of New Biotechnology Fims”, Journal of Business Venturing, 15, pp. 211-229 Deeds, D. L., DeCarolis, D. en Coombs, J. E. (1997), “The Impact of Firm-specific Capabilities on the Amount of Capital raised in an Initial Public Offering; Evidence from the Biotechnology Industry”, Journal of Business Venturing, 12, pp. 31-46 De Bruin, R.E., and H.F. Moed (1993), “Delimination of scientific subfields using cognitive words from corporate addresses in scientific publications,” Scientometrics 26, pp. 65-80 De Solla Price, D. J. (1963), "Little Science, Big Science", Columbia University Press. De Solla Price, D. J. (1965), “Is Technology Historically Independent of Science? A study in Statistical Historiography”, Journal of Technology and Culture, 6, pp. 553 – 568 Dosi, G. and S. Fabiani (1994), “Convergence and Divergence in the Long-term Growth of Open Economics”, In: Silverberg & Soete (eds.) Drucker, P.F. (1993), “Post-Capitalist Society”, Oxford: Butterworth Heinemann Egghe, L., R. Rousseau (1990), “Introduction to Informetrics”, Quantitative Methods in Library, Documentation and Information Science, pp. 203-290, Elsevier, Amsterdam Engelsman, E.C., and A.F.J. Van Raan (1991), “Mapping of Technology, A first exploration of knowledge diffusion amongst fields of technology”, Centre for Science and Technology Studies (CWTS), Leyden University Engelsman, E.C., and A.F.J. Van Raan (1994), “A patent-based cartography of technology”, Research Policy, 23, pp. 1-26 European Commission (1997), “Second European Report on S&T Indicators”, Final Report, EUR 17 639 EN, Brussels/Luxemburg French Ministry of Industry (1995), “The Key Technologies for the French Industry”, Ministère de l’Industrie, de la Poste et des Télécommunications Garfield, E. (1979), “Citation Indexing. Its Theory and Application in Science, Technology and Humanities”, Wiley, New York Gauthier, E. (1998), “Bibliometric analysis of scientific and technological research: a user’s guide to the methodology”, Observatoire des Sciences et des Technologies, Science and Technology Redesign Project, Canada, ST-98-08 121 Gibbons, M., C. Limoges, H. Nowotny, S. Schwartzman, P. Scott and M. Trow (1994), “The New Production of Knowledge”, Sage, London Glänzel, W. (1996), “The Needs for Standards in Bibliometric Research and Technology”, Scientometrics, 35, p. 167-176 Glänzel, W. and U. Schoepflin (1992), “Little Scientometrics, Big Scientometrics … And Beyond?”, Scientometrics 30, pp. 375 – 384 Grilliches, Z. (1984), (ed.), "R&D, Patents and Productivity", University of Chicago Press, Chicago Grilliches, Z. (1990), “Patent Statistics as Economic Indicators: A Survey”, Journal of Economic Literature, 28, pp. 1661-1707 Grilliches, Z., A. Pakes and B.H. Hall (1987), "The value of patents as indicators of inventive activity.", In: Dasgupta and SP. Stoneman (eds.), "Economic Policy and Technological Performance", Cambridge University Press, pp.97-124 Grupp, H. (1994), “The Measurement of technical performance of innovations by technometrics and its impact on established technology indicators”, Fraunhofer Institute for Systems and Innovation Research (FhG-ISI), Karlsruhe, Germany, Research Policy, 23, pp. 175-193 Grupp, H. (1998), "Foundations of the Economics of Innovation", Edward Elgar, Cheltenham, UK; Northampton, MA, USA Grupp, H. and B. Schwitalla (1989), “Technometrics, Bibliometrics, Econometrics, and Patent Analysis: Towards a Correlated System of Science, Technology and Innovation Indicators”, In: Van Raan et al. (1989) Grupp, H. and U. Schmoch (1992a), “Perception of Scientification of Innovation as Measured by Referencing between Patents and Papers”, In: H. Grupp (ed.), Dynamics Science-based Innovations, Springer Publishers, Berlin/Heidelberg, pp. 73 – 128 Grupp, H. and U. Schmoch (1992b), “Wissenschaftsbindung der Technik. Panorama der internationallen Entwicklung und sektorales Tableau für Deutschland; Physica”, Heidelberg Healey, P., H. Rothman and P.K. Hoch (1986), “An experiment in science mapping for research planning”, Research Policy, 15, pp. 233 – 251 Hicks, D. (1987), “Limitations of Co-Citations Analysis as a Tool for Science Policy”, Social Studies of Science, 17, p. 295 – 316 122 Hicks, D. (1995), “Published Papers, Tacit Competencies and Corporate Management of the Public/Private Character of Knowledge”, Industrial and Corporate Change, No. 4, pp. 401424 Hicks, D., B.R. Martin and J. Irvine (1986), "Bibliometric Techniques for Monitoring Performance in Technologically Oriented Research: The Case of Integrated Optics", R&D Management, Vol.16, No.3, pp.211-223 Hinze, S. (1997), “Mapping of Structures in Science and Technology: Bibliometric Analyses for Policy Purposes”, Doctoraatdissertatie Rijksuniversiteit Leiden, DSWO Press Katz, J.S. and B.R. Martin (1997), “What is research collaboration?”, Research Policy, 26, pp. 1-18 Katz, J.S. and D.M. Hicks (1996), “A Systemic View of British Science”, Scientometrics, 35 (1), pp. 133 – 154 Korevaar, J.C., and A.F.J. Van Raan (1992), “Science base of technology. Bibliometric mapping as a tool for national science and technology policy”. Part I: Recent developments in catalysis and environmental chemistry, Research Report to The Netherlands Ministry of Education and Science, Zoetermeer. Report CWTS-92-08, Center for Science and Technology Studies (CWTS), Leiden, 73 pp. Laestadius, S. (1998), “The relevance of science and technology indicators: the case of pulp and paper”, Research Policy, 27, pp. 385 – 395 Luwel, M. (1999a), “Is The Science Citation Index US-Biased”, Scientometrics, Vol. 46, No. 3, pp. 549-562 Luwel, M. (1999b), “Bibliometrisch profiel van het Vlaams onderzoek”, Ministerie van de Vlaamse Gemeenschap: Administratie Wetenschap en Innovatie Luwel, M., E.C.M. Noyons and H.F. Moed (1999), “Bibliometric assessment of research performance in Flanders: Policy background and implications”, R & D Management 29 (2), pp. 133-141 Malo, S. and A. Geuna (1999), “Science-Technology Linkages in an Emerging Research Platform: The Case of Combinatorial Chemistry and Biology”, Scientometrics, 33, pp. 381422 Mansfield, E. (1969), “The Economics of Technological Change”, Norton & Company, London, Great Britain Mansfield, E. (1991), “Academic Research and Industrial Innovation”, Research Policy, 20 (1), pp. 1 - 20 123 Martin, B., A. Salter, D. Hicks, K. Pavitt, J. Senker, M. Sharp and N. von Tunzelmann (1996) “The Relationship Between Publicly Funded Basic Research and Economic Performance - A SPRU Review”, Report prepared for HM Treasury, SPRU, University of Sussex, Falmer, Brighton, UK Martin, B.R. (1996), “The Use of Multiple Indicators in the Assessment of Basic Research”, Scientometrics, 36 (3), pp. 343 – 362 Martin, B.R., Irvine, J. (1983), “Assessing Basic Research. Some Partial Indicators of Scientific Progress in Radio Astronomy”, Research Policy, 12, pp.61-90 McCulloch, R. (1980), “International Indicators of Science and Technology: How does the US compare?”, Scientometrics, 2, 5-6 McMillan, G.S., Narin, F. and Deeds, D.L. (2000), “An analysis of the critical role of public science in innovation: the case of biotechnology”, Research Policy, 29, pp. 1-8. Meyer, M. (2000a), “Does science push technology? Patents citing scientific literature”, Research Policy, 29 (3), pp. 409-434 Meyer, M. (2000b), “Patent citations in a novel field of technology – What can they tell about interactions between emerging communities of science and technology”, Scientometrics, 48 (1), pp. 151 – 178 Meyer, M. (2000c), “Patent Citation Analysis as a Policy Planning Tool”, Working Paper INN1E496, SPRU University, Sussex, UK Meyer, M. and O. Persson (1998), “Nanotechnology – interdisciplinarity, patterns of collaboration and differences in applications”, Scientometrics, 42, 2, pp. 195-205 Meyer-Krahmer, F. (2000), “Increasing role of Basic Research for Innovation: The Case of Science Based Technologies. Empirical Evidence and Institutional Consequences”, ISI Karlsruhe, Conference paper Meyer-Krahmer, F. and U. Schmoch (1997), “Science-based technologies: universityindustry interactions in four fields”, Research Policy, 27, pp. 835 – 851 Moed, H.F. (1989), "The Use of Bibliometric Indicators for the Assessment of Research Performance in the Natural and Life Sciences", Doctoraatsdissertatie Rijksuniversiteit Leiden, DSWO Press Moed, H.F. (1996), “Differences in the construction of SCI based bibliometric indicators among various producers: A first overview”, Scientometrics, 35, pp. 177-191 Moed, H.F. and M. Vriens (1989), “Possible Inaccuracies Occuring in Citation Analysis”, Journal of Information Science, 15, pp. 283-295 124 Moed, H.F., Burger, W.J.M., Frankfort, J.G., Van Raan, A.F.J. (1985), "The use of bibliometric data for the measurement of university research performance", Research Policy, 14, pp.131-149 Moed, H.F., R.E. De Bruin, and Th.N. Van Leeuwen (1995), “New bibliometric tools for the assessment of national research performance: database description, overview of indicators and first applications”, Scientometrics, 33, p. 381-422 Mohnen, P. (1992), “The relationship between R&D and productivity growth in Canada and other major industrialized countries”, Economic Council of Canada Mombers, C., A. Van Heeringen, Van Venetië, C. Le Pair (1985), "Displaying Strengths and Weaknesses in National R&D Performance through Document Co-citation", Scientometrics, 7 (3-6), pp. 341-355 Narin, F. and K.S. Hamilton (1996), “Bibliometric Performance Measures”, Scientometrics, 36, pp. 293 – 310 Narin, F., and D. Olivastro (1992), “Status Report: Linkage Between Technology and Science”, Research Policy, 21, pp. 237-330 Narin, F., and E. Noma (1985), “Is technology becoming science?”, Scientometrics, 7 (3-6), pp. 369-381 Narin, F. and E. Noma (1987), “Patents as indicators of corporate technological strength”, Research Policy, 16, pp. 143-155 Narin, F., K.S. Hamilton, and D. Olivastro (1997), “The Increasing Linkage Between US Technology and Science”, Research Policy, 26, pp. 317 – 330 Narin, F., M. Rosen and D. Olivastro (1989), “Patent citation analysis: new validation studies and linkage statistics”, In: Van Raan et al. (1989) Nederhof, A.J. (1991), “Delimitation of a medical research topic: interaction with experts in selecting a database and constructing a search strategy”, Research Evaluation 1, pp. 149-154 Nederhof, A.J., Van Raan, A.F.J. (1993), "A bibliometric analysis of six economics research groups: A comparison with peer review", Research Policy, 22, pp.353-368 Nelson, R.R. (1994), “What has been the Matter with Neoclassical Growth Theory?”, In: Silverberg & Soete (eds.) Nelson, R.R. and S.G. Winter (1974), “Neo-classical vs Evolutionary Theories of Economic Growth: Critique and Prospectus”, Economic Journal, pp. 886 – 905 125 Nonaka, I. and H. Takeuchi (1995), “The Knowledge-Creating Company. How Japanese Companies Create the Dynamics of Innovation”, New York, Oxford: Oxford University Press Noyons, E.C.M. and A.F.J. Van Raan (1998), “Advanced mapping of science and technology”, Scientometrics, 41 (1-2), pp. 61-67 Noyons, E.C.M., A.F.J. Van Raan, H. Grupp, and U. Schmoch (1994), “Exploring the science and technology interface: Inventor-author relations in laser medicine research”, Research Policy, 23, pp. 443-457 Noyons, E.C.M., and A.F.J. Van Raan (1994), “Bibliometric cartography of scientific and technological developments of an R&D field. The case of optomechatronics”, Scientometrics, 30, pp. 157-173 Noyons, E.C.M., M. Luwel and H.F. Moed (1998), “Assessment of Flemish R&D in the field of information technology; A bibliometric evaluation based on publication data and patent data, combined with OECD research input statistics”, Research Policy, 27, pp. 285-300 Noyons, E.C.M., M. Luwel, and H.F. Moed (1999), “Combining mapping and citation analysis for evaluative bibliometric purposes”, J. of the American Society for Information Science (JASIS), 50 (2), pp. 115-131 OECD (1992), “Technology and The Economy, The Key Relationships”, Paris OECD (1994a), “The Measurement of scientific and technological activities. Using Patent Data as Science and Technology Indicators”, Patent Manual 1994, Paris OECD (1994b), “Main Definitions and Conventions for the Measurement of Research and Experimental Development (R&D)”, A Summary of the Frascatti Manual 1993, Paris OECD (1997), “Bibliometric Indicators and Analysis of Research Systems: Methods and Examples”, STI working papers 1997/1 Pakes, A. (1986), "Patents as Options: some estimates of the value of holding European patent stocks", Econometrica, 54, pp.755-784 Pakes, A., Grilliches, Z. (1984), "Patents and R&D at the Firm Level: A First Look", In: Grilliches, Z. (Ed.), R&D, Patents and Productivity, Chicago, University of Chicago Press Pavitt, K. (1984), “Sectoral patterns of technical change: towards a taxonomy and a theory”, Research Policy, 13, pp. 343 – 373 Pavitt, K. (1985), “Patent Statistics as indicators of inventive activities: possibilities and problems”, Scientometrics, 7, pp. 77-99 126 Pavitt, K. (1987), “The Objectives of Technology Policy”, Science and Public Policy, 14, pp. 182 – 188 Pavitt, K. (1997), “Academic Research, Technical Change and Government Policy”, In Krige, J. and Pestre, D. (eds.), “Science in the Twentieth Century”, Amsterdam: Harwood Academic Publishers, pp. 143-158 Perko, J.S. and F. Narin (1997), “The Transfer of Public Science to Patented Technology: A Case Study in Agricultural Science”, Journal of Technology Transfer, 22, 3, pp. 65 – 72 Pritchard, A. (1969), "Statistical bibliography or bibliometrics?", Journal of Documentation, 25, pp. 348-349 Rabeharisoa, V. (1992), “A special mediation between science and technology. When inventors publish scientific articles in fuel cells research”, pp. 54 – 72, In: Grupp (1992a) Rappa, M.A. and K. Debackere (1992), “Technological communities and the diffusion of knowledge”, R&D Management, 22, pp. 209 – 211 Rip, A. (1992), “Science and technology as dancing partners”, In: Kroes, P., M. Bakker (Editors), Technological Development and Science in the Industrial Age, Kluwer, n.p., pp. 231-270 Rosenberg, N. (1990), “Why Do Firms Do Basic Research (With Their Own Money)?”, Research Policy, No. 19 (1990), pp. 165-174 Rosenberg, N. (1992), “Scientific Instrumentation and University Research”, Research Policy, No. 21, pp. 381-390 Rosenberg, N. and L.E. Birdzell Jr. (1990), “Science, technology and the Western miracle”, Scientific American, 263 (5), pp. 42-54 Rosenberg, N. (1982), “Inside the black box: technololgy and economics”, Cambridge University Press, Cambridge Salter, A. J. and B. R. Martin (1999), “The Economic Benefits of Publicly Funded Research: A Critical Review”, Science Policy Research Unit, University of Sussex, UK (forthcoming in Research Policy) Schankerman, M., Pakes, A. (1985), "The Rate of Obsolescence and the Distribution of Patent Values: Some Evidence from European Patent Renewals", Revue Econ., 36(5), pp.917941 Scherrer, F.M. (1996), “Patents: what do we know; what must we learn?”, November, Luxembourg 127 Schmoch, U. (1993), “Tracing the knowledge transfer from science to technology as reflected in patent indicators”, Scientometrics, 26 (1), pp. 193-211 Schmoch, U. (1997), “Indicators and the relations between Science and Technology”, Scientometrics, No 38, pp. 103-116 Schmoch, U., E. Strauss, H. Grupp and T. Reiss (1993), “Indicators of the Scientific Base of European Patents”, Report to the European Commission, EUR 15 330 EN, Brussels/Luxemburg Senker, J. (1993), “The Contribution of Tacit Knowledge to Innovation”, AI & Soc. (1993), No. 7, pp. 208-224 Senker, J. (1995), “Tacit Knowledge and Models of Innovation”, Industrial and Corporate Change (1995), Vol. 4, No. 2, pp. 425-447 Senker, J. et al. (1999), “Changing Structure, Organisation and Nature of European PSR Systems, European Comparison of Public Research Systems”, EU (TSER) report Silverberg, G. and L. Soete (eds.), (1994), “The economic growth and technical change: technologies, nations, agents”, Elgar Aldershot Smith, L.C. (1981), "Citation Analysis", In: Potter, W.G. (ed.), “Bibliometrics”, Library Trends, 30 (1), Summer Sneath, P.H.A. and R.R. Sokal (1973), “Numerical taxonomy: the principles and practice of numerical classification”, Freeman San Fransisco (Calif.) Soete, L.G. and S.M.E. Wyatt (1983), “The Use of Foreign Patenting as an Internationally Comparable Science and Technology Output Indicator”, Scientometrics, 5 (1), pp. 31-54 Steinmueller, W.E. (1994), “Basic Research and Industrial Innovation”, Chapter 5 in Dodgson, M. and Rothwell R. (eds.), The Handbook of Industrial Innovation, Edward Elgar, Aldershot, pp. 54-66 Steinmueller, W.E. (2000), “Economics of Science”, In: Pavitt, K., E. Steinmueller and J. Calvert & B. Martin (2000), “International Encyclopedia of the Social and Behavioural Sciences: 3 papers", Paper no. 44 of SPRU Electronic Working Papers Series, Science Policy Research Unit, University of Sussex, UK Stoneman, P. (1987), “The economic analysis of technology policy”, Clarendon Oxford, Oxford Tidd, J. and Brocklehurst, M. (1999), “Routes to Technological Learning and Development: An Assesment of Malaysia’s Innovation Policy and Performance”, Technological Forecasting and Social Change, 62, pp. 239-257 128 Tijssen, R.J.W. (1992), "Cartography of Science: Scientometric Mapping with Multidimensional Scaling Methods", Doctoraatsdissertatie Rijksuniversiteit Leiden, DSWO Press, 307 pp. Tijssen, R.J.W. and R.K. Buter (1998), “Het belang van Nederlands wetenschappelijk onderzoek voor technologische innovaties: kwantitatieve analyse van octrooien”, Center for Science and Technology Studies (CWTS), Leiden, The Netherlands Tijssen, R.J.W., and E. Van Wijk (1999), “In search of the European Paradox: an international comparison of Europe's scientific performance and knowledge flows in information and communication technologies research”, Research Policy, 28 (5), pp. 519-543 Tijssen, R.J.W., R.K. Buter and Th.N. Van Leeuwen (2000), “Technological relevance of science: validation and analysis of citation linkages between patents and research papers”, Scientometrics, 47, pp. 389-412 Toynbee, A. J. (1963), “Introduction: The Genesis of Civilisations, A Study of History”, 12 vols, 1, 3, New York Trajtenberg, M. (1987), “Patents, Citations and Innovations: Tracing the Links”, Working Paper No. 31 – 87, Foerder Institute for Economic Research, Tel-Aviv University, Ramat Aviv, Israel Trajtenberg, M. (1990), “A penny for your Quotes. Patent Citations and The Value of Innovations”, Rand Journal Economics, 21 (1), pp. 172 – 187 Van Amelsvoort, P. (1992), “Het vergroten van de bestuurbaarheid van produktieorganisaties”, Proefschirft ter verkijging van de graad van doctor aan de Technische Universiteit Eindhoven, ST-groep, Oss, The Netherlands Van Den Berghe, H., Houben, J.A., De Bruin, R.E., Moed H.F., Kint, A., Luwel, M. and Spruyt, E.H.J. (1998), "Bibliometric Indicators of University Research Performance in Flanders", Journal of the American Society for Information Science (JASIS), Vol.49 (1), pp. 59-67 Van Raan, A.F.J. (1993), “Advanced bibliometric methods to assess research performance and scientific development. Basis principles and recent practical applications”, Report CWTS-93-05, Center for Science and Technology Studies (CWTS), Leiden University, The Netherlands Van Raan, A.F.J. (1997), “Scientometrics: State of the Art”, Scientometrics, 38 (1), pp. 205218 129 Van Raan, A.F.J. and H.P.F. Peters (1989), “Dynamics of a field analysed by co-subfield structures”, Scientometrics, 15, pp. 607 – 620 Van Vianen, B.G., H.F. Moed and A.F.J. van Raan (1990), “An Exploration of the Science Base of Recent Technology”, Research Policy, 19, pp. 61-81 Verspagen, B. (1994), “Technology and Growth; the Complex Dynamics of Convergence and Divergence”, In: Silverberg & Soete (1994) Verspagen, B., T. van Moergestel and M. Slabbers (1994), “MERIT concordance table: IPC – ISIC (rev. 2)”, MERIT Research Memorandum 94-004, University of Limburg Maastricht, The Netherlands Ziman, J. (1994), “Prometheus Bound. Science in a Dynamic Steady State”, Cambridge University Press, Cambridge 130
© Copyright 2026 Paperzz