1 A review of the co-citation based Science Foresight process Dr David J Holland Smith Alison Mathew Dr Nick M Kemp Dr J Sylvan Katz, Katz Competitive Intelligence Ltd Dstl/CR10342 1.0 23 March, 2004 © Crown copyright 2004 Dstl © Crown copyright Defence Science and Technology Laboratory UK Executive summary Overview This report was commissioned by the UK MOD’s Research Acquisition Organisation as part of a wider investigation aimed at informing the selection of an appropriate process for ‘horizon scanning’. Though the stakeholders in the output of such scanning extend throughout MOD and into its wider networks within government and commercial organisations, the process must primarily serve Output 5 of MOD’s research programme. In this context, horizon scanning can be seen as the means for identifying significant developments in civil and foreign defence science and technology that may not otherwise be noted through MOD’s own technical programmes. Through the ‘Science Foresight’ Project, which reported in 2001, MOD/Dstl, aided by SPRU, designed and piloted a process that addresses at least some of the aspects required for horizon scanning as defined above. Specifically, this process provides a means for ‘targeting’ influential scientific authors and collecting their views as to the future of their domain. Aim. This report aims to provide an objective assessment of the Science Foresight process, as it might be cast within MOD’s new Output-based framework for research delivery. The Science Foresight process The Science Foresight process is designed to provide a systematic and efficient means for collecting a body of scientifically informed opinion regarding the future of a field or domain of interest, drawn directly from those who are most influencing its development. • ‘Influential’ authors are determined through a statistical analysis of the patterns of cocitation within the domain (The publisher, Thomson-ISI, offers the bulk of this analysis as a commercial service). This first identifies the key groups that are pioneering particular advances, and then selects one or a small number of key authors from each group. • Key authors are then invited to express their views as to the likely future for their areas of work. A systematic on-line questionnaire technique was pioneered for this aspect of the process. • The opinions thus gathered are analysed, summarised and reported for consideration within MOD. Findings Through horizon scanning, RAO particularly wishes to identify relevant developments in domains not addressed by MOD’s current research programme. Identifying such areas is non-trivial, and may in some cases fall outside the coverage of the ISI database currently used for co-citation analysis. Dstl/CR10342 1.0 Page 3 of 30 Co-citation analysis is one member of a class of bibliometric techniques that form an established and internationally respected means for gaining insight into the structure of research and its key contributors. Its use within the Science Foresight process appears to be appropriate. For practical reasons, however, this form of analysis is currently limited to the ISI database whose contributing authors will provide a mainly science-centric perspective. Furthermore, the current process cannot draw on material that is not indexed by ISI. This includes much of the World Wide Web, and other technical literature such as books and reports. The process of advising on the content of MOD’s research portfolio may begin with the collection of academic and other scientific opinion, but can only be completed once matched with knowledge of potential military applications. Whether explicitly or otherwise at this stage, key items on the relevant technology roadmap need to be understood. The availability of military impact assessment would considerably increase the potential readership within MOD and enable the participation of a much wider group. Even with impact assessment, however, a paper (or electronic) report is still likely to be difficult to navigate and to extract the information that is likely to be most useful. Conclusions and recommendations Taken overall, Science Foresight provides a practical and cost-effective means for sampling opinion from a wide range of leading scientific authors. If the process is to be incorporated into MOD’s future horizon-scanning activity, additional analyses will be required. In particular: • MOD will need to devise a means for identifying fields of interest within which the process of identifying leading scientific authors can be focused; • safeguards should be incorporated where possible to alert the readership to views which are judged to be extremely atypical or potentially erroneous; • a means will need to be identified for ‘mapping’ scientific opinion into an assessment of potential military impact; • if RAO wishes its horizon-scanning activity to include inputs from the technology community in addition to those from the authors of science papers, as obtained by the current process, an appropriate method will need to be developed1, and • a revised means for disseminating the results of the analysis should be introduced. With these additional measures, the process could contribute significantly towards satisfying RAO’s requirement for horizon scanning from open sources. Appendix A provides further insight into the original Science Foresight Project and describes possible enhancements towards the realisation of ‘dynamic foresight’. 1 The implementation of co-citation analysis on databases other than the ISI Research Fronts database may be problematic. Page 4 of 30 Dstl/CR10342 1.0 Table of contents 1 Introduction 7 1.1 Background 7 1.2 Aim of study 8 2 The Science Foresight process 9 2.1 The Science Foresight concept 9 2.2 The 2001 Science Foresight process 9 3 Assessment of the Science Foresight process 11 3.1 Domain selection 11 3.2 Identification of authors 11 3.3 Likely strengths of the author-selection method 12 3.4 Possible weaknesses of the author-selection method 13 3.5 Questionnaire considerations 14 3.6 Analysis of returns 15 4 Conclusions 17 4.1 General findings 17 4.2 Strengths of the Science Foresight method 17 4.3 Weaknesses of the Science Foresight method 17 5 Recommendations 19 5.2 General issues 19 5.3 Domain identification 19 5.4 Author identification 19 5.5 Information gathering 19 5.6 Reporting 20 6 Acknowledgements 21 Dstl/CR10342 1.0 Page 5 of 30 7 List of references 22 8 List of abbreviations 24 APPENDIX A From Foresight to Dynamic Foresight 25 A-1 Introduction 25 A-2 Expert Selection 25 A-3 Validation 26 A-4 Dissemination and Visibility 26 A-5 Dynamic Foresight 28 A-6 Appendix summary 29 Page 6 of 30 Dstl/CR10342 1.0 1 Introduction 1.1 Background 1.1.1 The UK MOD has long sustained a deep interest in relevant scientific and technical developments within the civil sector. Knowledge of current and likely future advances is used to underpin and complement MOD’s own research programme. Access to civil technologies enhances the overall effectiveness with which MOD sustains the means for defence. ‘The first step in acquiring access is understanding what exists, requiring, in turn, that MOD maintains awareness of the developments in the civil sector world-wide’ (DG(R&T), 1999). 1.1.2 In practice, the evolution of technology has long been recognised as a complex sociotechnical phenomenon that is very difficult to monitor, to track and to forecast. Not only is the range of information sources vast2, the estimation of relevance, possibly projecting many years into the future, often requires deep human expertise and insight. 1.1.3 Since the Second World War, a number of authors have attempted to provide a theoretical foundation for analyses of technology evolution. In parallel, a number of programmes have implemented pragmatic solutions. Figure 1-1 charts a selection of some of the more significant contributions and events. Theoretical development Information economy Skunk works (LockheedMartin) Knowledge economy Product life-cycle Technology trajectories Technology forecast surveys (Japan) Value innovation Virtual organisations UK Government Foresight programme EU Framework Programme ’50 ? Disruptive technologies Paradigm shift ’40 Much to be done on the underlying theoretical drivers ’60 ’70 ’80 ’90 ’00 Punctuated equilibria, S-curves ’10 ’20 ’30 year NB. curve is illustrative only Figure 1-1: A timeline of relevant developments in technology evolution (Holland Smith & Kemp, 2003) 2 Broadly, every minute sees the publication of forty new pages of scholarly text (http://www.sims.berkeley.edu/research/projects/how-much-info-2003), at least one new patent application (http://swpat.ffii.org/news/04/oecd0130/index.en.html), and a new Web site (http://wcp.oclc.org/stats/size.html). Dstl/CR10342 1.0 Page 7 of 30 1.1.4 Considering MOD’s particular requirement for horizon-scanning in support of Output 5, the definition of technology watch, as accepted by the Science and Technology (S&T) Board, has strong relevance (especially if extended to science and technology watch3). “monitoring developments in civil and foreign defence technology to identify potentially disruptive technologies4 and inform balance of investment in our own research programme” (see Henderson & Edwards, 2003) 1.1.5 Notably, this describes a process that works in addition to the literature search and analysis that is conducted as part of the normal research process. Here, emphasis is placed on assessing the current portfolio of research and supporting the management of its content through time. 1.1.6 In 2001, MOD/Dstl, with the aid of SPRU5, designed and piloted a process that might be adopted within Output 5 to provide a horizon-scanning service. This offers a ‘…simple, objective and cost effective technique to gather information about emerging short and long-term research developments…’ (Katz et al., 2001). Known as ‘Science Foresight’, this process provides a means for ‘targeting’ influential authors and collecting their views as to the future of their domain. 1.2 Aim of study 1.2.1 This report aims to provide an objective assessment of Science Foresight, as it might be cast within MOD’s new ‘Output’-based framework for research delivery. 3 The process may also need to be aware of relevant market and economic forces, availability of funding, relevant legislation and so on (Holland Smith et al., 2001). 4 Holland Smith & Kemp (2003) suggest that ‘disruptive’ here refers to ‘… technologies which simultaneously change the way in which a capability is realised whilst displacing current solutions’. For the military, disruption will typically require changes to force structure; but the phenomenon is equally at play in a commercial context where market leaders can be overturned and supply chains radically re-shaped. 5 SPRU’s original title was the Science Policy Research Unit and hence the acronym. Its current title is Science and Technology Policy Research. Page 8 of 30 Dstl/CR10342 1.0 2 The Science Foresight process 2.1 The Science Foresight concept 2.1.1 Science Foresight, described in detail in Katz et al. (2001), essentially comprises four sequential steps as shown in Figure 2-1. Choose technical domain(s) for study Identify key authors using co-citation method Survey selected authors’ views by questionnaire Collate, edit and report views within MOD Figure 2-1: The components of Science Foresight 2.1.2 The approach, once the domain(s) for investigation are known, is to automatically select key academic authorities based on their published output, and then to collect their views and forecasts via questionnaire. Through this means, the technique is relatively objective and auditable, yet it has the potential to deliver significant insights at relatively low cost. 2.2 The 2001 Science Foresight process 2.2.1 Domain selection. The 2001 Foresight project focused on the physical and engineering sciences. This choice was based on a study of the MOD Technology Strategy, the Strategy for Applied Research 1999-2019, and the MOD Technology Taxonomy Issue 5 (1999) (Katz et al, 2001). 2.2.2 Identification of authors6. This was carried out using a form of bibliometric analysis based on citation and co-citation statistics. ‘Co-citation’ occurs where two documents or authors appear within a single reference list (i.e. are ‘co-cited’) in a subsequent paper7. Documents that are co-cited are assumed to be closely related in terms of topic; the more frequent the co-citation, the stronger the inferred relationship. In co-citation analysis, statistics are compiled based on counts of the number of times that two works are cocited in later publications. Using these statistics for documents within a particular field, it is possible to produce a representation or ‘map’ describing the field’s intellectual ‘structure’. Specifically, the ‘groups’ or ‘clusters’ of papers identified in this way generally correspond to subspecialty areas or ‘research fronts’8, and may point to sociotechnical networks engaged in the process of generating important new knowledge. 2.2.3 Thomson ISI, which offers a commercial co-citation service known as the Research Fronts database, was used as the source of published literature. 6 7 8 A more detailed account appears in (Katz et al, 2001) The analysis is typically associated with formal technical documents. A research front is a collection of between two and fifty co-cited papers that are clustered together. Dstl/CR10342 1.0 Page 9 of 30 2.2.4 Science and engineering fields of potential interest were identified and approximately 500 of the most representative clusters were determined using a well-defined set of rules (Katz et al., 2001). The most highly cited paper from the pair of most highly co-cited papers published in the 1990s was then chosen from each cluster. The authors of these papers were used to prepare the list of experts. In total, 483 highly co-cited papers were selected, involving 2120 authors and representing research activity in 66 physical science and engineering fields. 2.2.5 Information gathering. Authors, judged by the co-citation method as being leading authorities within a particular research front, were contacted by letter and invited to provide views as to the future of their specialisms. Access was provided to a secure online questionnaire that covered background information, emerging developments in science, factors affecting developments, and the driving forces in science. 2.2.6 Analysis of returns. Each author was asked to describe up to two short-term predictions (covering the next five years) and one longer-term prediction (for the period of five to ten years from now). Up to 300 words of free text was available for each prediction. 2.2.7 Analysis of these returns was limited to extracting one or two key sentences, or, if none existed, providing a short summary. In all cases, care was taken to use the original authors’ words and to remain faithful to their text. ‘No attempt was made to judge, criticise or interpret a predicted emerging development. This task is left strictly to the reader.’ (Katz et al., 2001). Page 10 of 30 Dstl/CR10342 1.0 3 Assessment of the Science Foresight process 3.1 Domain selection 3.1.1 When used in conjunction with the Thomson ISI database, the technique is able to cover a broad swath of science and engineering at relatively low cost9. The starting assumption is that one or a small number of individuals can represent the scientific views of the potentially large communities in which they work. To some extent, this reduces the need to pre-specify the areas of interest10. However, there remains a trade-off between depth of analysis and breadth of coverage, and the method may be best suited to the identification of radical innovations as opposed to the much greater number of smaller, incremental ones. 3.2 Identification of authors 3.2.1 A number of different strategies might be employed for author selection. One of the simplest, given access to existing domain expertise, is the simple nomination or conomination11 of knowledgeable ‘colleagues’ or scientific peers. Such methods, based on human judgement, are open to a range of biases and will not be equally effective in all cases. One way to avoid such bias is to work with the full text of documents by studying the words and their co-occurrence patterns. Callon and his colleagues’ co-word analysis (Callon et al, 1991) and Kostoff’s database tomography (Kostoff, 2001) are examples of such techniques: the former has been used to identify socio-technical networks, whilst the latter can guide the production of domain taxonomies. Either can be used to draw attention to a ‘balanced’ sample of a domain’s authors, but again, each requires human input and can be quite labour-intensive. 3.2.2 Bibliographic databases store pre-extracted data about collections of published literature and are able to underpin a range of automated ‘bibliometric’ analyses. Naturally, these analyses can only be applied to data furnished by the database provider. 3.2.3 Publication counts simply indicate which authors are most prolific, not necessarily most informative or influential. Citation counts say more about the influence of scientific authors, but offer little information regarding the structure of a scientific community or scientific advances. How can we know, for example, when two prolific and highly cited authors are writing about significantly different advances? Science Foresight uses cocitation to identify and analyse research fronts and the authors working within them12. The authors of later (citing) papers effectively contribute the measure, and its objectivity reflects the fact that some degree of consensus has been established across many independent authors. 9 The use of other data sources may significantly increase the costs of the process. In fact, co-citation links have been shown to cross established disciplinary boundaries (Small, 1999), and the analysis may lead to new and interesting developments outside the prescribed area of interest. 11 Co-nomination is an iterative process. It starts with a list of recommended experts and converges to a list of experts who have multiple nominations by different experts (Katz et al., 2001). 12 Citation analysis then targets authors within research fronts. 10 Dstl/CR10342 1.0 Page 11 of 30 3.2.4 Ideally, we would wish to find authors who have just published papers that will later prove to be influential. However, for co-citation to operate, not only must the original papers have been published, but also a significant number of later papers that cite them. Since a conventional publication cycle typically takes between 1 and 3 years13, the authors identified through co-citation may no longer be ‘at the front’ of their research area. In practice, a mixture of bibliometric and other techniques will undoubtedly deliver the best results. 3.2.5 Before leaving the time-lag issue, it is worth noting that an alternative approach, known as bibliographic coupling, simply assesses two document’s similarity in terms of the overlap between their reference lists (see Morris et al, 2003 for a recent implementation). This metric does not rely on later authors and has been shown to be capable of identifying how a particular field has advanced and its structure has evolved over recent years. Whilst bibliographic coupling is capable of generating groupings that may be used alongside or in place of ISI’s Research Fronts, the lack of a commercial supplier for this form of analysis may render it prohibitively expensive to implement on a large scale. 3.2.6 The co-citation analysis tended to identify senior scientists of long standing in their fields, who were actively publishing in prestigious international journals14. To complement this group, there may be value in targeting the authors of influential ‘review’ articles, which explicitly set out to draw conclusions from the work of a number of researchers. 3.2.7 The ultimate assessment of the author selection technique used in Science Foresight is whether the predictions gathered prove to be accurate. To a limited extent, it would be useful to compare the predictions made using this technique with others for the same field, but arrived at using different methods. The only true arbiter, however, is time. 3.2.8 In the absence of a practical means for validating the predictions, the following bulleted list summarises a range of the more general strengths and weaknesses of co-citation based techniques. 3.3 Likely strengths of the author-selection method • The metric is readily available through the ISI Research Fronts Database. Whilst it is possible to analyse other sources using information extraction techniques, these can be expensive to establish. Moreover, since automated processes are likely to perform less robustly than a human analyst when working with written text, additional errors may be introduced. • The assessment of science and technology outputs by citation counts is both quantitative and relatively objective (see 3.2.3). • Co-citation is relatively straightforward, relies on few assumptions, and can be quite cost-effective. 13 The consequence for this time-lag of the rapidly growing trend towards e-publication needs to be investigated. 14 Since Thomson ISI takes relatively few journals in non-English, generally, these authors (or their coauthors) will have some familiarity with the language. Page 12 of 30 Dstl/CR10342 1.0 3.4 • ‘Since only highly cited papers are used as the source for co-citation statistics, the analysis is relatively robust to biases otherwise caused by excessive self-citation’ (Katz et al, 2001). • A single, large database, such as the ISI Research Fronts, does capture the bulk of formally published ‘mainstream’ scientific literature. Studies indicate broadly that, 20% of scientific journals publish 80% of all articles (see for example, Geisler, 2000). Possible weaknesses of the author-selection method • The time lag between the inception of a new line of research and the formation of a co-citation cluster will usually be measured in years. For incremental developments15, the average age of cited papers is five years although this may be considerably less in some fast moving areas. There is therefore an issue over how far back in time the citations should reach. (Geisler, 2000). • Typically, there may be an over-representation of theoretical as opposed to experimental papers, due to the former’s relative ease of production (King, 1987). Citation patterns may also vary between these two. • The ISI database covers some engineering and technology, but is strongly focused towards science. The ‘science-push’ view (Irvine & Martin, 1984), which will typically emerge from its scientific authors, may not, on its own, satisfy MOD’s needs for both science and technology perspectives16. • The Science Foresight process ultimately requires the identification of individual authors. The papers that are highlighted through co-citation, however, may be written by more than one author. The statistics produced do not inform the choice of which of several joint authors to approach17. • Citations may reflect either agreement or disagreement with the cited papers. While the general pattern is overwhelmingly towards positive citation, notable exceptions do exist. • The ‘obliteration phenomenon’ (King, 1987) refers to a known tendency for breakthrough advances to be cited less frequently over time. This effect is domain dependent but usually applies to papers that are decades old. The effect on co-citation analysis is, therefore, small at this time although if the current trends toward increasing volume and speed of publication continue, the phenomenon of obliteration may eventually need to be taken into consideration. 15 As opposed to those which have more radical effect and whose time-scales may be measured in decades. 16 US patents are required to cite the prior art in their subject, and the practice is not uncommon elsewhere. Patent citation analysis may offer a quite different view of technical advances, here emphasising technology over science. 17 There may be merit in considering author co-citation which accounts both for individual seminal papers, and authors whose importance is due to a body of work (White & Griffith, 1981; McCain, 1990). Dstl/CR10342 1.0 Page 13 of 30 • The output of scientific endeavour, through scholarly journals, is typically focused through small, self-selected groups of scholars who form an elite or ‘invisible college’ within a given discipline (e.g. Geisler, 2000). Important developments occurring outside these invisible colleges may not be cited as frequently as those within. To the authors’ knowledge, the impact of this phenomenon on co-citation analysis has not been systematically studied. 3.5 Questionnaire considerations 3.5.1 The Science Foresight Project specifically set out to identify an effective but low-cost means for gathering the views of an international collection of leading specialists. Since the specialists were not directly funded for their participation, the challenge was to reach a geographically diverse group of senior people while providing them with the flexibility to engage on their own terms. 3.5.2 Given the travel implications, formal interviews could only have been undertaken by telephone or with some other on-line mediation. To judge from the current growth of tele-marketing, this approach may warrant consideration, although the cost of analysts who are able to ‘interview’ world-class scientists and engineers is likely to be high. In the 2001 exercise, a self-administered questionnaire method was chosen such that individuals could provide an input at a time of their own choosing. 3.5.3 When one uses self-administered questionnaires, it is imperative to minimise the scope for uncertainty or ambiguity in the interpretation of questions. This issue is particularly acute when dealing with an international audience. In this case most respondents may be assumed to at least be familiar with the language (see Paragraph 3.2.6). However, unlike with researcher-administered questionnaires, additional indicators such as the respondent’s apparent confidence and the time taken to complete each question cannot be gathered18 (Peterson, 2000). 3.5.4 In the Science Foresight Project, open-ended questions were used to capture respondents’ views of the future in their domain. Katz et al. (2001) note the variability of responses and that some predictions were much more difficult to summarise for reporting than others. Whether this variation might be reduced by refinement of the questionnaire is unclear. 3.5.5 To illustrate the complexities, consider the following question taken from the Science Foresight Questionnaire: Over the next five years do you expect the availability of funding in your research area will? Increase Decrease Stay the same Cannot Predict Table 3-1: Example question (Katz et al., 2001) 18 The time taken to complete an unsupervised online questionnaire cannot be accurately gauged, since it is not possible to determine the cause of any interruptions (e.g. thinking time or tea time). Page 14 of 30 Dstl/CR10342 1.0 3.5.6 This question might reasonably be answered from either a ‘local’ (i.e. personal, organisational or national) or a ‘global’ perspective; the response given could therefore differ according to the interpretation adopted. Furthermore, the ‘availability’ of funding might be taken to refer to the total anticipated investment in a domain, or simply to the number of independent awards likely to be available. 3.5.7 The Science Foresight Project broke new ground in the use of ‘secure’ questionnaires which were hosted at SPRU’s web-site and completed on-line. Doubtless this technique offers greater scope than traditional paper forms for ensuring the accuracy of gathered information. For a future evolution of the Foresight process, MOD’s previous investment in human factors research, and especially in interface design, might be reviewed to shed light on such issues as: • the balance between open and closed questions19; • the ordering of questions; • respondent motivation and providing appropriate incentives to respond; • assessing the robustness of responses, and • matching the questionnaire with the subsequent analysis. 3.5.8 Respondents’ motivation to invest time and effort in co-operating with the survey needs to be considered. In general, for non-funded returns, a good starting point for such considerations is that respondents are seeking to influence the conclusions from the exercise (Peterson, 2000). In the case of Science Foresight it was noted that respondents may be gratified to have been ‘singled out’ by an objective process as being the individuals whose opinion is worth eliciting. 3.5.9 In the 2001 Science Foresight, respondents were predominantly upbeat when predicting that funding and collaboration would increase in the future (Katz et al., 2001). In reality, this is unlikely to be the case for all areas, and may suggest a bias towards optimism20. Furthermore, there was less optimism regarding the number of graduate students who would be available to provide the increased research effort. 3.5.10 Drawing these observations together, it is seen that the robustness of the process is likely to benefit from formal review or debate by peers. This, however, brings significant implications for the cost of the exercise. 3.6 Analysis of returns 3.6.1 The identification of domain experts based on co-citation analysis assumes a correlation between technical expertise and influence, on the one hand, and the ability to predict 19 In closed questions the respondent selects an answer from a fixed set of options, whereas in open questions the respondent’s answer is not limited in this way and is usually discursive in nature. 20 Leading scientists will typically have devoted their working lives to the pursuit of research in their chosen area. In general, they will regard this as the most important research area and tend to believe that others will eventually come to see that too, and hence that this will therefore eventually be reflected in more funding. Dstl/CR10342 1.0 Page 15 of 30 future developments, on the other. There are no known reasons to doubt this assumption, although strictly speaking, there is little evidence to support it either. For MOD’s purposes, ideal respondents would also have a clear view as to the potential military relevance of their science. 3.6.2 Katz et al. (2001) acknowledge that analysis and presentation should be undertaken differently in future iterations of their process. There are at least two levels at which this thinking needs to be applied. • Summarisation. In the 2001 Science Foresight report, ‘…no attempt was made to create a general overview of the emerging developments…’ (Katz et al., 2001). For presentational purposes, respondents’ narratives were summarised into (typically) two sentences, taking care to maintain the meaning of their words. • Abstraction. The assessment of military relevance and impact was not attempted in the 2001 Science Foresight study. Given summarised information regarding the future of science in particular domains, it was left to the report’s readers to determine its military implications. Thus the original format for the report will achieve value only within a subset of its potential readership. No attempt was made to collect readers’ own analyses of impact and to facilitate their dissemination. 3.6.3 Whether explicit or otherwise at this stage, the analysis must develop an understanding of the key items on the relevant technology roadmap that describes the passage(s) of the science or technology through to its ultimate use(s). 3.6.4 In summary, Katz et al. (2001) describe Science Foresight as ‘…the act of looking forward in an effort to anticipate the nature and significance of scientific developments…’. For the RAO, in support of Output 5, this implies not only the need to identify important science, but also to assess its particular relevance to the MOD. The latter implies the need for a further stage in the analysis that might be termed ‘impact assessment’. Page 16 of 30 Dstl/CR10342 1.0 4 Conclusions 4.1 General findings 4.1.1 The 2001 Science Foresight study represents an interesting and welcome development in MOD’s approach to maintaining an understanding of world-wide science and technology. The approach is innovative with genuine novelty, especially in the use of online questionnaire completion. As designed, the process is squarely aimed at gathering opinion across a broad and influential multi-national group, at low cost. Inevitably, some compromises are required to meet these competing aims and the process may need to be extended if it is to be adopted within Output 5. Most notably, these include impact assessment and improved reporting. 4.1.2 Accepting that it is too soon to judge the accuracy of the Science Foresight output, it is nevertheless possible to summarise some of the salient strengths and weaknesses of the method. 4.2 Strengths of the Science Foresight method 4.3 • Relative to other methods, the technique provides broad coverage at low cost, especially in cases where one is seeking to identify innovations that are more radical in their potential impact. • The approach generates a number of possible futures rather than consensus on just one outcome. • The process is consistent with the MOD’s technology awareness ‘level’ of Technology Watch21 (Henderson & Edwards, 2003). • The online questionnaire approach facilitates the engagement of senior and busy scientists. • Information collected on limiting factors and driving forces can be used to support and qualify forecasts. Weaknesses of the Science Foresight method • Domain selection. The technique is subject to a trade-off between depth of analysis and breadth of coverage and may be poorly suited to the identification of incremental innovations. This will be the case, even in a narrowly specified domain, where papers reporting ‘lesser’ innovations may fall beneath the thresholds required for inclusion in the Research Fronts database. 21 This level identifies potentially interesting new technologies but does not involve a formal assessment of MOD significance. Dstl/CR10342 1.0 Page 17 of 30 • Identification of authors − Uses published findings that are, typically, several years old22. − Is limited to broadly scientific texts. Respondents hence may know little about translating their science into technology and systems. • Information gathering − The questionnaire method may be susceptible to subjectivity and to respondent bias23 and lacks some of the quality assurance provided by peer review. − The approach offers little incentive for respondents to share new and particularly innovative concepts. − Information is returned as a set of scientific opinions, in narrative form, from which it may be difficult for a non-domain specialist to abstract common themes and principles. • Reporting − The lack of military ‘impact assessment’ reduces the scope of the readership within MOD and hence the size of the potential audience that will be interested. − The gathering of a rich set of information from all respondents in a single technical report, even in summarised form, was acknowledged to result in poor ‘navigability’. 22 If this is a significant issue in practice, it will apply more to the identification of incremental than radical advances. 23 i.e. those who respond may have systematically different views from those who do not respond. Page 18 of 30 Dstl/CR10342 1.0 5 Recommendations 5.1.1 A series of recommendations follow from the foregoing analysis and conclusions. Where possible, these are grouped under the headings of the conceptual process. In support of this, Appendix A provides further insight into the original Science Foresight Project and describes possible enhancements that would move towards the realisation of ‘dynamic foresight’. 5.2 General issues 5.2.1 There is an issue over ‘ownership’ of the study output. MOD staff who have the power to act on the results, should be engaged as appropriate in the process design and in the analysis. 5.3 Domain identification 5.3.1 The Science Foresight process as described does not include a specific means for identifying new and potentially ‘interesting’ domains. MOD does, however, need to continually re-assess its current understanding in this respect. This process should, therefore, either be considered as one of a range of techniques for ‘horizon scanning’, or a separate process for domain identification should be designed and implemented. 5.4 Author identification 5.4.1 The following points should be addressed when re-designing a future Science Foresight process. • The possible virtues of bibliometric coupling should be considered, along with issues for its operationalisation if appropriate. • The respective merits of citation databases, the World Wide Web and e-publications as sources of author identification should be re-assessed at intervals. • It may be possible to incorporate the opinions of ‘review’ paper authors to complement those of the leading specialists captured by the present method. • The importance to Science Foresight of incorporating a technology-centric view should be considered in detail. If required, a means should be established for collecting this input alongside the current science-centric views. 5.5 Information gathering 5.5.1 The questionnaire should be appraised for the possibility of capturing domain-specific jargon used to describe key developments. This will enable the retrieval of further supporting material and may be used to draw attention to extreme or potentially Dstl/CR10342 1.0 Page 19 of 30 erroneous views. Alternatively, it may be possible to achieve this aim through textual analysis (see 5.5.3. below). 5.5.2 The timing of the questionnaire survey should be matched to the availability of the scientific target audience. Mid-way through an academic term is probably the optimum for academic scientists. 5.5.3 Textual (or other) analyses (see, for example, Kostoff, 2000) should be considered for the papers within each cluster. Such analyses could generate hypotheses upon which the selected experts could be asked to comment. 5.6 Reporting 5.6.1 If the Science Foresight process is to complement MOD’s Technology Watch programme at levels two and three, then a process for analysing questionnaire responses in order to assess the military impact of scientific advances should be designed and costed. 5.6.2 The format of the final report should be reviewed. It may be preferable to replace this entirely with a well-indexed, subject-searchable database (possibly using MOD definitions/taxonomies). Appendix A describes an alternative, and potentially powerful approach, which treats the process as dynamic. Page 20 of 30 Dstl/CR10342 1.0 6 Acknowledgements 6.1 The authors acknowledge the kind assistance of Sally Stewart, formerly of Dstl, who was instrumental in pioneering the Science Foresight process within MOD. Our thanks are also due to Professor Ben Martin of the University of Sussex for his detailed technical review and kind suggestions for improvement of the text. Dstl/CR10342 1.0 Page 21 of 30 7 List of references 7.1. Callon, M., Courtial, J-P. & Laville, F. (1991) Co-word analysis as a tool for describing the network of interactions between basic and technological research: The case of polymer chemistry. Scientometrics, 22, 1, pp.155-205. 7.2. DG(R&T) (1999) MOD strategy for keeping track of world-wide civil sector R&T. D/DG(R&T)/9/22, June 1999. 7.3. Geisler, E. (2000) The Metrics of Science and Technology, Quorum Books, Westport, C.T. 7.4. Henderson, J. I. & Edwards, B. (2003) Improving technology watch in the Corporate Research Programme (CRP). Distillation, Issue 04. 7.5. Holland Smith, D. J. & Kemp, N. M. (2003) Text Analysis for S&T Intelligence, Innovation and Creativity: Final Customer Report. QinetiQ Customer Report. QINETIQ/FST/CR031323/1.0. 7.6. Holland Smith, D. J., McOwat, D. M. & Marks, L. M. (2001). Development of a costeffective science and technology intelligence process. DERA Technical Report. DERA/TD/TR000004/1.0, March. 7.7. Irvine, J., & Martin, B.R. (1984). Foresight in science: Picking the winners. London: Frances. 7.8. Katz, J. S., Stewart, S., Gow, T.A.K. & Martin, B. (2001) Science Foresight Project: Final Report. Volume 1. Dstl Technical Report. DSTL/TR01697. 7.9. King, J. (1987) A review of bibliometric and other science indicators and their role in research evaluation. Journal of Information Science, 13, 5, pp.261-76. 7.10. Kostoff, R. N. K. (2000) Fullerene data mining using bibliometrics and database tomography. Journal of Chemical Information and Computer Science, 49, 8, pp.674-85. 7.11. Kostoff, R. N. K. (2001) The role of open source intelligence in global technology watch. Military Information Technology. 7.12. McCain, K.W. (1990). Mapping authors in intellectual space: A technical overview. Journal of the American Society for Information Science, 41, pp.433-443. 7.13. Morris S.A., Yen G., Wu Z. & Asnake, B. (2003) Time line visualization of research fronts. Journal of the American Society for Information Science and Technology. 54, 5, pp.413-422. 7.14. Peterson, R.A. (2000). Constructing Effective Questionnaires. Sage Publications Inc., Thousand Oaks, California, USA. Page 22 of 30 Dstl/CR10342 1.0 7.15. Small, H. (1999) Passage through science: crossing disciplinary boundaries. Library Trends, 48, 1, pp.72-108. 7.16. White, H.D. & Griffith, B. (1981). Author co-citation: A literature measure of intellectual structure. Journal of the American Society for Information Science, 32, pp.163-71. Dstl/CR10342 1.0 Page 23 of 30 8 List of abbreviations IRAP Industrial Research Assistance Programme NRC National Research Council [of Canada] ISI Thomson ISI publisher and database provider S&T Science and Technology SCI Science Citation Index SPRU (formerly) Science Policy Research Unit [University of Sussex] (now) Science and Technology Policy Research Page 24 of 30 Dstl/CR10342 1.0 APPENDIX A From Foresight to Dynamic Foresight This appendix has been contributed by Dr J. Sylvan Katz of Katz Competitive Intelligence Ltd, Saskatoon, Canada and part time Senior Fellow at SPRU, University of Sussex. Dr Katz was a manager and principal author of the Science Foresight study undertaken in collaboration with MOD and Dstl during 2000/1. A-1 Introduction A-1.1.1 The Science Foresight Project was apparently the first international on-line foresight activity. The aim of the project was to design and assess a simple, objective and costeffective technique to gather information about emerging short and long-term research developments, primarily in the physical and engineering sciences. During the course of the project, and through subsequent involvement in two focused horizon-scanning activities, new ideas have emerged that could improve the original Science Foresight process. This appendix discusses possible ways: • to improve expert selection; • to validate predictions of emerging developments; • to increase the visibility and dissemination of foresight information, and • to add more value to expert predictions by moving from conventional foresight to dynamic foresight. A-2 Expert Selection A-2.1.1 Experts were selected from the ISI Research Fronts database by starting with a list of Science Citation Index (SCI) subject categories24 from which highly co-cited papers were to be selected. Currently, the approximately 8,500 journals covered within the Science, Social Science, and Arts and Humanities Citation Indexes, are classified into 170 subject categories. Of these, 66 science and engineering categories were selected for the original study, based on information derived from the MOD Technology Strategy, the Strategy for Applied Research 1999-2019, and the MOD Technology Taxonomy Issue 5 (1999) (Katz et al., 2001). About 500 highly co-cited papers were chosen from the Research Fronts database25 using queries (in the SQL language) and custom scripts (written in Perl). Some manual manipulation of the data was required when, for example, Access tables were transformed into formats suitable for processing by the Perl programs and vice versa. 24 Examples of subject categories are acoustics, energy and fuels, applied mathematics and telecommunications. 25 Supplied by ISI as a Microsoft Access database. Dstl/CR10342 1.0 Page 25 of 30 A-2.1.2 If the Science Foresight approach is to be operationalised for ongoing use, then further thought should be given to ways of improving expert selection. In particular, a user-friendly interface could be developed that allows non-programmers to select highly co-cited papers based on a set of well-defined selection parameters. A-2.1.3 Perhaps some consideration could be given to examining the way highly co-cited papers are selected according to the subject categories of interest that are being scanned. For example, customised subject categories could be developed that are more closely aligned to the fields that the MOD/RAO is interested in scanning. However, it must be pointed out that this could be costly and time-consuming since it requires the participation of people knowledgeable about the contents of the journals and the fields of interest. An alternative approach to selecting research fronts of interest might be to run a keyword search over the descriptive labels given to each cluster in the research front database. The difficulty with this approach is that the keywords can be quite specific and it could be difficult or time consuming to develop an effective keyword search strategy. A-3 Validation A-3.1.1 The problem of validating expert predictions was one of the major issues encountered in the original project. In the original project, experts were asked to describe two short-term and one long-term emerging development in their research fields. Unfortunately, the only way to confirm the validity of their predictions would be to wait for time to pass and then to examine the literature for supporting information. This would be a costly and time-consuming process that might not be worth pursuing. A-3.1.2 There is a novel approach that could be used to gather predictive information that can be at least partly confirmed in the present. Instead of asking experts to predict emerging developments, they could be asked to describe important research problems that are currently being investigated in their disciplines and specialities. Furthermore, they could be asked to provide one or two references to peer-reviewed papers that discuss the problems and their significance. In addition, the experts might be asked to hypothesise about the nature of potential radical and incremental developments that might emerge once those research problems had been solved. A-3.1.3 The implication of using this approach is that the validity of the research problems can be established immediately by examining the reference publications and crossreferencing responses from other experts in the same disciplines and specialities. While the potential technical developments cannot be confirmed until later, they can be examined by knowledgeable people with interests in emerging technologies relevant to the military. A-4 Dissemination and Visibility A-4.1.1 The findings from the original Science Foresight Project were disseminated through a printed report and an interactive web site. The printed report was valuable for documenting the objectives, methodology and findings; however, it was not Page 26 of 30 Dstl/CR10342 1.0 particularly satisfactory for presenting the richness and diversity of the predicted emerging developments. In the second phase of the project an interactive web site was created to overcome this limitation. The web site contained: • a downloadable PDF version of the printed report; • HTML versions of the abstract, executive summary and the observations and recommendation; and • searchable databases of the highly co-cited papers from which the experts were chosen and of the predicted short- and long-term emerging developments. A-4.1.2 Generally speaking, the dissemination process did not draw as much attention from the wider foresight community as might have been hoped. This was probably due to the lack of time and funds needed to present the project findings at international conferences and to write peer-reviewed publications. A-4.1.3 The interactive web site was located within the SPRU web site at the University of Sussex. Unfortunately, the university does not have a method for easily tracking the amount of traffic that the web site generated or the types of queries that were being made against the databases. However, a Google search using the phrase “Science Foresight Project” indicates that the web site has a reasonably good Google page rank and about 30 web sites have links to the project web site. A-4.1.4 There are a number of ways to improve the dissemination of the findings of future foresight projects. One of the significant ways that the visibility of a future foresight project might be increased is to involve members of the international foresight community. However, this introduces questions like “what role, if any, should these organisations play in shaping future projects?” On the other hand, there is little doubt that these organisations could provide valuable inputs into the interpretative and dissemination stages. By way of example, the unit for Science and Technology Foresight in the European Commission’s Research DG for the knowledge-based society and economy could be approached. This unit is responsible for monitoring S&T Foresight activity around the world and it is already familiar with the Science Foresight Project. In addition, the leader of the Office of Technology Foresight at the National Research Council of Canada has expressed an interest in future foresight projects. This group receives foresight input from Defence R&D Canada, who have a ‘disruptive technologies’ working group and a ‘technology watch’ function, and the Natural Sciences and Engineering Research Council of Canada whose interests lie mainly in the realm of emergent and convergent shifts in disciplines and domains. A-4.1.5 The web site for the original project should be considered as a prototype that can be improved for future foresight activities. Consider, for example, the possibilities that would be available if the information solicited from the expert community was adapted to facilitate the new approach outlined in section A-3. In this case, not only would experts provide brief descriptions of the important research problems that are being investigated but they would also provide references to publications supporting their claims. A bibliographic database of these references, perhaps including abstracts, could be included with the other data about the highly cited papers and the important research problems identified experts. Furthermore, hyperlinks could be Dstl/CR10342 1.0 Page 27 of 30 provided to the full text for those documents that can be found on the Internet. These reference documents would not only provide support information but they could also be used as the starting point for other types of bibliographic and bibliometric investigations. A-4.1.6 It might be worth reconsidering the location of the web site. Perhaps greater Internet visibility and ‘product branding’ can be achieved by locating it on its own server with its own domain name. The affiliation with SPRU, Dstl and the MoD can easily be maintained through careful design of the graphical presentation on the web pages. This approach would have a number of advantages. For example, the server can be configured to track the amount of traffic and general location of visitors to the web site. It can monitor the types of searches that are made of the databases and the number of times reference documents are accessed. In addition, a discussion forum could be included on the server to gather comments and suggestions from the broader international community about the value of the project so they could make contributions to the information that has been gathered. A-5 Dynamic Foresight A-5.1.1 A foresight activity requires a significant investment of time, money and effort to gather and organise expert opinions so that they can be explored and digested relatively easily. Furthermore, other than through the preparation of formal reports and briefs, relatively few simple techniques are available for users to record and exchange views dynamically as they explore a foresight database. If such techniques were available, then it would be possible to move from the current rather static foresight activities to a more dynamic foresight process. A-5.1.2 In the autumn of 2001, the Industrial Research Assistance Program of the National Research Council of Canada (NRC-IRAP) asked Dr Katz to assist in a short-term, narrowly focused foresight activity. This centred on the potential uses of prairie crops for enhancing health. Using experience gained from the Science Foresight Project, a dynamic foresight system was prototyped, built on a web-based knowledge system. This project was successful and a second project was undertaken to prototype a dynamic foresight system with expanded capabilities that could be used in strategic planning foresight activity for the NRC’s Plant Biotechnology Institute. The second prototype was constructed in collaboration with QCC Communications Inc, a publicly traded Canadian company that specialises in e-business and network applications development. This project was completed successfully and it laid the groundwork for the development of a fully functional foresight system. A-5.1.3 Generally speaking, dynamic foresight is a technique that overcomes some of the shortfalls of conventional static foresight activities. It uses Internet-based methods to disseminate and capture value-added information in a cost-effective and continuous manner. It allows users to view and maintain a foresight knowledge base with a browser through a secure public, private or semi-private portal. Most importantly, the accuracy and integrity of the information in the knowledge base is maintained through collaborative and peer-reviewed processes. Page 28 of 30 Dstl/CR10342 1.0 A-5.1.4 Dynamic foresight has three main ingredients: a focus, a moderator and an interest group. The focus is any well-defined area that is being monitored. The moderator is an individual or group of individuals responsible for ensuring that the information in the knowledge base is current and accurately reflects the emerging developments in a focus area. An interest group is composed of the users of foresight information. The users are the human knowledge base and reviewers of the content of the foresight knowledge base. A-5.1.5 A dynamic foresight system contains the tools that allow web-naive individuals to create and manage information in a knowledge base. Initially, the knowledge base is constructed by experts with knowledge about a focus area. A knowledge base can contain information for many different but perhaps related focus areas. Users of the foresight information are asked to advise the moderator and other users about other emerging developments and innovations that they find as they explore the knowledge base. A-5.1.6 Conceptually, dynamic foresight uses a table of contents to organise information. A table of contents has a hierarchical structure and it contains hyperlinks to information located on the Internet. The information could be located anywhere including on secure servers requiring special authorisation. The source of information could be anything that can be viewed with a browser such as (but not limited to) Microsoft Word documents, searchable databases and media files. As users explore the knowledge base and other resources, they may locate information that is relevant to the focus area. Alternatively, it may be that the organisation of the table of contents would more accurately reflect the emerging activities in a focus area if the structure or entries in the table of contents were moved, removed or modified. Users provide such feedback to the moderator and other users by email or through an on-line discussion system. The moderator maintains the integrity of the knowledge base by looking for user consensus before authorising a modification to the table of contents. Over time the table of contents and hence the knowledge base will change to reflect the emerging developments in the focus area. The dynamic foresight system is designed to allow certain groups of users to view only selected portions of the table of contents and hence limit their access to portions of the knowledge base. This feature facilitates gathering feedback from the broader foresight community as well as from restricted and confidential groups of users. A-5.1.7 A dynamic foresight system may not only extend the useful lifetime of a foresight activity but it may also be used for other purposes. For example, it can be employed as the starting point for future foresight activities. Experts could be asked to reshape and update a knowledge base. Furthermore, indicators of emerging developments in a focus area can be constructed by tracking the changes in the structure of the table of contents over time. A-6 Appendix summary A-6.1.1 This appendix outlines a variety of ways that the original format of the Science Foresight Project could be modified to enhance the process of selecting experts, to validate information about research from which radical or incremental technologies could emerge, to increase the visibility and dissemination of the information that is Dstl/CR10342 1.0 Page 29 of 30 collected and to extend the useful life-time of a foresight activity through the addition of value-added information. The implementation of these ideas will depend on such things as the possibility of integrating foresight activities into broader horizon-scanning activities as well as the funds and resources available for future foresight activities. Page 30 of 30 Dstl/CR10342 1.0
© Copyright 2026 Paperzz