Topic:'Can'cooperation'among'competing'firms'existing'in'industry'led'open' research'networks?'and'if'so'when? Instructor:*Robin*Gustafsson The*Finnish*SHOK*(Strategic*Centres*for*Science,*Technology*and*Innovation)* organisations*are*built*on*the*idea*that*firms*operating*in*the*same*industry*–* even*competing*firms*–*can*benefit*from*shared*research*and*development* programs.*However,*firms*that*compete*in*the*same*markets*or*product* segments*can*find*if*unattractive*to*cooperate*with*their*competitors.*In*this* special*study,*the*students*will*analyze*a*rich*set*of*data*from*the*CLEEN*and* FIMECC*to*explore*which*factors*increase*the*likelihood*of*competing*firms* joining*together*in*an*open*research*program.*The*data*consists*of*program* enrolment*information*between*2009R2013.*Additionally*the*students*will* gather*data*from*company*annual*reports*for*identifying*inter*firm*rivalry.*The* primary*empirical*challenge*in*the*study*will*be*to*identify*which*firms*are* competitors*with*on*another*and*to*explore*using*empirical*data*the*conditions* for*competitors*collaborating.*The*methods*for*this*study*will*be*either* qualitative*or*a*mixture*of*qualitative*and*quantitative*(i.e*mixed*methods.).** Cassier,*M.,*&*Foray,*D.*2002.*Public*Knowledge,*Private*Property*and*the* Economics*of*HighRTech*Consortia.*Economics*of*Innovation*and*New* Technology,*11(2):*123R132. Chen,*M.*1996.*Competitor*Analysis*and*Interfirm*Rivalry:*Toward*A*Theoretical* Integration,*Academy*ofMAnagement*Review,*21(1):*100R134*Dahlander,*L.,*&* Gann,*D.*M.*2010.*How*Open*Is*Innovation?*Research*Policy,*39(6):*699R709. Ritala,*P.,*&*HurmelinnaRLaukkanen,*P.*2009.*What's*in*It*for*Me?*Creating*and* Appropriating*Value*in*InnovationRRelated*Coopetition.*Technovation,*29(12):* 819R828.* Feedback(on(the(special(study(“EFFECT%OF%MARKET%COMMONALITY%AND% RESOURCE%SIMILARITY%ON%PARTICIPATING%IN%OPEN%INNOVATION”( ( We(normally(provide(about(one(page(of(feedback(for(the(final(version,(but( because(we(use(this(study(as(an(example(in(the(class,(you(will(get(a(bit(more( feedback.( ( This(is(a(very(well@written(special(study.(We(particularly(appreciate(that(you( make(a(clear(claim(and(provide(an(empirical(test(of(the(said(claim(without(too( much(clutter(in(a(concise(form.((I.e.(we(like(that(the(report(is(short.)( ( The(paper(starts(with(an(abstract(that(describes(the(study(well,(but(could(still(be( improved.(The(main(purpose(of(abstract(is(to(give(a(busy(reader(a(quick( overview(of(what(you(did(and(what(the(results(is(so(that(the(reader(can(decide( whether(or(not(to(read(the(study.(You(open(with(a(direct(quotation(of(a( definition,(which(is(perhaps(not(the(best(idea.(I(would(rather(rely(on(the(reader( having(a(rough(understanding(on(what(open(innovation(is(and(then(saving(the( definition(to(the(paper(body.(If(you(present(a(quote,(you(would(need(to(include( the(source((citation(with(a(page(number)(here(as(well.(In(the(third(paragraph(you( present(two(questions,(which(can(be(a(bit(confusing(as(a(study(normally(has(just( one(research(question.(Moreover,(in(the(abstract(it(is(not(necessary(to(explain( what(particular(statistical(model(was(used.(You(could(be(a(bit(more(descriptive( about(the(theory,(though.(Now(the(interaction(effect(that(you(study(is(mentioned( only(in(the(final(paragraph.( ( (This(is(a(very(minor(comment,(but(please(refrain(adjusting(the(margins(from(the( Word(defaults.(When(printed,(the(paper(is(now(difficult(to(keep(in(hand(without( covering(part(of(the(text.)( ( The(intro(is(well@structured,(and(describes(clearly(what(the(study(is(about(and( ties(it(to(some(of(the(relevant(literature.(One(page(is(the(correct(length(for(this( part,(and(we(like(how(you(made(use(of(that:(you(first(describe(why(open( innovation(is(interesting(and(important,(follow(with(a(clear(research(question,( and(the(describe(what(you(did.(( ( The(theory(and(hypotheses(section(is(well(structured(as(well.(We(like(that(you(do( not(spend(too(much(space(on(explaining(what(the(literature(says(in(general,(but( focus(on(your(theory(and(the(hypotheses(instead.(Nevertheless,(we(find(this( section(of(the(paper(to(be(the(weakest(part(of(the(study.(You(could(have( explained(and(presented(stronger(arguments(for(the(why@question.(However,( considering(that(you(were(a(one(person(group(and(the(study(has(other(merits,(we( allow(for(some(weaknesses.((In(other(words,(if(this(study(was(done(in(a(larger( group,(we(would(have(suggested(this(to(be(fixed(and(if(not,(considered(reducing( the(grade.)( ( You(start(by(explaining(why(the(companies(that(serve(common(markets((i.e.( potential(competitors)(might(want(to(participate(in(open(innovation.(You(explain( “There&are&multiple&reasons&why&companies&with&common&markets&might&want&to& participate&in&open&innovation.&For&example,&they&might&collaborate&within&certain& segment&of&overall&market&while&at&the&same&time&competing&in&other&segments,& such&as&Toyota&collaborating&with&PSA&Peugeot&Citroën&in&small&car&segment&in& Europe&(PSA&Peugeot&Citroën,&2012)”.(We(like(that(you(address(the(why@question( explicitly,(but(the(second(sentence(does(not(really(provide(an(answer.(The(press( release(that(you(cite(provides(a(clear(rationale(for(why(the(two(companies(want( to(cooperate(in(the(Light(Commercial(Vehicle(segment,(and(you(could(have( explained(that(a(bit(more.(The(second(explanation(that(you(provide(is(that(for(the( reasons(that(you(explain,(“companies(are(less(likely(to(initiate(an(attack”.( However,(initiating(an(attack((a(competitive(move)(is(not(the(same(as( participating(in(open(innovation;(these(are(distinct(concepts.(Same(goes(for(the( third(explanation;(imitation(between(competitors(is(not(the(same(as( participating(in(open(innovation.(In(sum,(while(we(like(that(you(attempt(to( provide(multiple(explanations(for(why(competitors(might(participate(in(open( innovation.(However,(each(of(the(three(explanations(that(you(provide(could(have( been(developed(much(further.( ( The(same(general(comment(applies(to(Hypothesis(2(and(Hypothesis(3(as(well;(the( warrants(for(the(hypotheses(should(be(expanded(and(refined.( ( The(methods(are(well(described(and(appropriately(chosen.(You(start(by( explaining(that(you(chose(the(CLEEN(SHOK(because(the(participating(companies( are(heterogeneous,(but(do(not(explain(why(heterogeneity(is(a(good(thing.(I(would( prefer(a(structure(where(you(first(explain(your(sample,(then(what(you(measured( and(how,(and(only(after(this(explain(where(the(data(comes(from.(If(you(consider( your(data(as(an(Excel(sheet,(this(would(mean(that(you(first(explain(what(rows( you(have,(then(what(columns(you(have,(and(only(after(this(how(you(got(the(data( into(the(cells.(You(now(state(that(“Industry&descriptions&and&main&line&of&business& were&obtained&from&company&data&search&engines&created&by&Kauppalehti,&Fonecta,& and&from&Joint&business&information&system&of&the&Finnish&Patent&and&Registration& Office&and&the&Tax&Administration”&before(explaining(why(you(needed(this(data(in( the(first(place((this(is(explained(on(the(following(two(pages.)(( ( The(subsection(that(describes(the(sample(is(missing(the(information(of(how( many(companies(participated(in(the(CLEEN(SHOK(and(how(many(programs(there( were(in(the(SHOKs.(You(do(explain(this(later(in(section(3.7,(but(I(would(have( expected(to(have(it(here.(Further,(it(is(not(clear(at(his(point(that(you(build(all( possible(pairs(of(companies(participating(in(the(SHOWK(regardless(of(whether( they(actually(participated(in(the(same(programs.(Without(this(information(your( explanation(of(what(the(dependent(variable(is(would(be(difficult(to(understand.( ( Your(description(of(the(variable(“Line(of(business(description”%is(exceptionally( good:(it(provides(sufficient(information((for(a(reader(with(the(necessary( background(knowledge)(of(a(complex(variable(without(being(too(long.(You(also( address(validity(and(correctly(point(out(that(the(variable(has(some(weaknesses.( ( The(part(about(independent(variables(could(have(been(made(more(convincing(by( providing(evidence((i.e(citations)(that(these(measures(or(similar(measures(have( been(used(in(the(past(and(have(been(found(useful.((I(do(not(know(if(this(is(the( case,(but(I(would(be(surprised(if(not(at(least(some(of(your(measures(would(have( been(used(by(someone(else(as(well.)( ( One(thing(that(you(could(have(still(explained(in(the(independent(variables( section(that(you(used(multiple(measures(for(market(commonality(and(resource( similarity.(Also,(you(missed(the(opportunity(of(using(means(of(the(different( measures(as(more(robust(composite(measures.((Similarly(to(using(multiple(items( in(a(survey(to(measure(one(concept(and(then(aggregating(these(as(mean.)(( ( I(like(that(you(present(control(variables(in(a(separate(section.(This(section(starts( with(the(statement(“All&control&variables&for&the&firmOdyad&are&mean&values&of&the& variables&to&control&for&their&absolute&value&in&order&to&find&relative&differences& between&the&companies.&“&I(do(not(understand(what(you(mean(by(this;(why(would( mean(control(for(relative(differences?((And(also(relative(to(what?)(It(seems(that( you(have(chosen(the(control(variables(well,(but(the(choices(are(only(partially( justified:(A(control(variable(should(be(expected(to(correlate(with(both(the( dependent(and(the(independent(variables,(but(you(explain(only(why(you(expect( the(controls(to(be(correlated(with(the(dependent(variable(and(only(for(some(of( the(variables.(The(possible(effects(or(age(and(size(are(not(explained.( ( I(like(how(you(describe(data(preparation.(However,(all(of(this(is(really(about(how( you(chose(the(sample(and(how(you(constructed(the(measures.(Another(option( would(have(been(to(present(this(information(in(parts(in(the(previous(sections( where(you(described(the(sample(and(measures.((In(my(opinion,(this(would(have( been(a(better(choice(and(this(is(also(the(current(convention(in(high@quality( research(journals.)(Moreover,(there(is(some(repetition(here.(In(sum,(this(section( could(have(been(omitted(altogether.( ( The(section(about(analysis(method(could(have(been(omitted.(Logistic(regression( is(a(fairly(standard(tool(as(is(reporting(odds(ratios,(so(you(could(have(just(stated( “I(used(Logistic(regression(to(analyze(the(data”(in(the(beginning(of(the(results( section(and(be(done(with(it.(It(is(not(clear(what(you(mean(by(“Analyses&were&made& separately&for&both&relative&values&and&mean&values&to&determine&whether&there& was&difference&in&relative&or&absolute&values&of&variables,&e.g.&is&it&the&relative&size& that&makes&companies&prefer&coOoperating&or&do&larger&firms,&for&example,&tend&to& coOoperate&with&each&other.&After&this,&odds&ratios&were&generated&from&each&of&the& models.”&You(are(using(both(types(of(variables(in(the(same(models,(which(is(just( fine.( ( The(first(half(of(the(section(“3.7(Overview(of(the(data”(really(just(describes(what( the(sample(is(and(would(be(better(positioned(in(the(section(that(discusses( sample.(The(second(part(could(then(be(merged(under(the(main(section(results.( (This(is(the(structure(that(is(commonly(used(in(published(papers.)( ( The(tables(are(formatted(well(and(it(makes(a(lot(of(sense(to(include(two( correlation(tables,(one(for(the(company(level(data(and(one(for(the(dyadic(data.( You(could(have(simplified(this(a(bit(by(just(limiting(your(analyses(to(complete( cases(so(that(you(would(not(have(to(present(different(observation(numbers((N)( for(different(variables.(This(is(the(first(time(that(the(variable(R&D(per(revenue(is( introduced.(I(would(have(expected(this(to(be(discussed(in(the(variables(section.( However,(considering(that(you(do(not(use(the(variable(in(the(regressions,(it( would(be(more(simple(to(just(not(say(anything(about(it(at(all.( ( The(results(section(is(well@structured.(You(present(the(main(results(in(one(table( and(then(explain(which(hypotheses(are(supported(and(which(are(not(and(then( leave(the(discussion(of(these(results(to(later(sections(as(you(should.(The( regression(tables(is(exceptionally(good(and(provides(all(necessary(technical( information(as(well(as(the(purpose(of(different(models(and(variables.(One(thing( that(you(could(have(added(is(description(of(the(effect(sizes,(i.e.(how(strong(the( effect(of(the(significant(variables(are.(E.g.(“Compared(to(companies(that(do(not( operate(in(the(same(industry,(companies(that(operate(in(the(same(industry(are( roughly(two(and(a(half(times(more(likely(to(cooperate(in(open(innovation( projects.”( ( The(conclusions(section(well@written(as(well.(You(start(by(summarizing(the( results(and(then(move(on(to(weaknesses,(which(you(correctly(state(relate(mostly( to(how(the(variables(were(defined(and(are(therefore(limitations(of(the(data.(In( terms(of(structure,(it(you(start(by(explaining(some(of(the(weaknesses(in(the( variables(after(which(you(continue(with(a(separate(section(about(weaknesses.( You(also(proposes(some(ways(that(these(limitations(could(be(addressed,(which(is( a(merit.(I(also(like(the(fact(that(you(assess(not(only(the(weaknesses(of(the( empirics,(but(also(weaknesses(in(the(theory(by(introducing(the(role(of(that(the( type(of(research(program(can(play.( ( You(explain(that(it(is(possible(that(relationships(are(non@linear.(This(is(actually( something(that(can(be(tested(by(“specification(tests”(that(can(be(conducted(after( model(estimation.( ( EFFECT OF MARKET COMMONALITY AND RESOURCE SIMILARITY ON PARTICIPATING IN OPEN INNOVATION Research report TU-91.2037 Special study in Strategic management i Abstract Author: Title: Effect of market commonality and resource similarity on participating in open innovation Date: Open innovation “is the use of purposive inflows and outflows of knowledge to accelerate internal innovation, and expand the markets for external use of innovation, respectively.” As opposed of more traditional closed innovation where new products and services are initiated within the firm and then selected whether they end up in the markets, open innovation paradigm states that companies should seek ideas external sources, i.e. outside the boundaries of the company, for insourcing ideas, and also to provide output channels for valuable ideas that originate within the company not supported by current business model through licensing, spin-offs, etc. In order to expand their research and development (R&D) activities and to capture their potential, companies have become more interested in participating in R&D alliances. Despite its great potential, open innovation has some disadvantages. In order to realize the potential in R&D alliances, employees of the companies have to spend significant amount of time with each other and this intensive co-operation possesses risk of increased competition in the future. Especially for companies that are in coopetitive relationship, that is to say that companies compete and cooperate at the same time, leakage of valuable knowledge can cause real problems. Do competing companies find it unattractive to co-operate in open innovation programs? Concepts market commonality and resource similarity are used to establish competitive situation. Thus, the research question can be specified into: do companies that have similar resources and/or common markets find it unattractive to co-operate in open innovation programs? In this study, empirical context is Finnish energy and environment open innovation platform called CLEEN that focuses in cleantech research. Unit of analysis is a firm-dyad, participation data was obtained from CLEEN and data considering market commonality and resource similarity from secondary sources. Logistic regression analysis was used to analyse participation of firm-dyads in same research program using market commonality and resource similarity as explanatory factors. As a result, there is indication that, at least in cleantech collaboration in public-private cluster, market commonality and resource similarity have positive effect on participation in open innovation. Interaction between these factors regarding participation was not observed. However, these results can be only thought of as indicative and in the future these results should be reinforced by conducting a study based on primary data obtained directly from the companies. Keywords: Collaboration, co-operation, open innovation, market commonality, resource similarity ii iii Contents Abstract .................................................................................................................................................... i! 1.! Introduction ...................................................................................................................................... 1! 2.! Theory and hypotheses ................................................................................................................... 2! 3.! Methods ........................................................................................................................................... 3! 3.1.! Data collection .......................................................................................................................... 3! 3.2.! Dependent variables ................................................................................................................ 4! 3.3.! Independent variables .............................................................................................................. 4! 3.3.1! Market commonality .......................................................................................................... 4! 3.3.2! Resource similarity ............................................................................................................ 5! 3.4.! Control variables ...................................................................................................................... 5! 3.5.! Data preparation ....................................................................................................................... 6! 3.6.! Analysis method ....................................................................................................................... 7! 3.7.! Overview of the data ................................................................................................................ 7! 4.! Results ............................................................................................................................................. 9! 4.1.! Logistic regression models ....................................................................................................... 9! 5.! Discussion ..................................................................................................................................... 10! 5.1.! Limitations and further research ............................................................................................. 11! 6.! Conclusion ..................................................................................................................................... 13! References ........................................................................................................................................... 14! Appendix 1. Histograms of variables .................................................................................................... 16! 1 1. Introduction Open innovation, as defined by Chesbrough (2006, p. 1), “is the use of purposive inflows and outflows of knowledge to accelerate internal innovation, and expand the markets for external use of innovation, respectively.” As opposed of more traditional closed innovation where new products and services are initiated within the firm and then selected whether they end up in the markets, open innovation paradigm states that companies should seek ideas external sources, i.e. outside the boundaries of the company, for insourcing ideas, and also to provide output channels for valuable ideas that originate within the company not supported by current business model through licensing, spin-offs, etc (Chesbrough et al., 2006). Summing up previous research, Faems et al (2010, p. 3) wrote that in order to expand their research and development (R&D) activities and to capture their potential, companies have become more interested in participating in R&D alliances, namely “formal agreements between otherwise independent companies“. Despite its great potential, open innovation has some disadvantages. In order to realize the potential in R&D alliances, employees of the companies have to spend significant amount of time with each other to be able to transfer meaningful codified and tacit knowledge, and this intensive co-operation possesses risk of increased competition in the future, resulted by unintended transfer of knowledge (Faems et al., 2010). Especially for companies that are in coopetitive relationship, that is to say that companies compete and cooperate at the same time (Bengtsson & Kock, 2000), this leakage of valuable knowledge can cause real problems. This paper asks the question: do competing companies find it unattractive to co-operate in open innovation programs? Concepts market commonality and resource similarity from framework presented by Chen (1996) are used to establish competitive situation. Thus, the research question can be specified into: do companies that have similar resources and/or common markets find it unattractive to co-operate in open innovation programs? Empirical context for the study is Finnish energy and environment open innovation platform called CLEEN that focuses in cleantech research. In short, CLEEN is one of six Strategic Centres for Science, Technology and Innovation (SHOKs) that are aimed to be “a unique cooperation platform for innovative companies and spearheading research” in Finland. These public-private partnerships are limited to selected few fields that have significance to society as well as businesses. (CLEEN, 2014) In SHOKs, there are various research programs that companies are able to participate in. For example CLEEN has seven different research programs focusing different topics, with content such as “Carbon Capture and Storage Program (CCSP)” (CLEEN, 2014). 2 2. Theory and hypotheses In order to study whether competition affects the companies’ willingness to participate in open innovation programs, competition between companies has to be established and thereafter it can be evaluated whether it affects participation in open innovation programs. In this research competitive situation between a firm-dyad, i.e. between a pair of firms, is analysed with framework proposed by Chen (1996). The framework consists of two dimensions between a firm dyad: market commonality and resource similarity. Market commonality is the main component used in competitor analysis and it is thought of as driver of rivalry (Chen, 1996). In his article, Chen (1996, p. 106) defined market commonality as “degree of presence that a competitor manifests in the markets it overlaps with the focal firm”. There are multiple reasons why companies with common markets might want to participate in open innovation. For example, they might collaborate within certain segment of overall market while at the same time competing in other segments, such as Toyota collaborating with PSA Peugeot Citroën in small car segment in Europe (PSA Peugeot Citroën, 2012). As an another example, Chen (1996) argued that in order to avoid retaliation, if market commonality is high, companies are less likely to initiate an attack, but will respond to an attack initiated by a competitor to defend for potential threat. It is also known that companies have tendency to imitate companies they compete with (Lieberman & Asaba, 2006). Thus, when company A participates in open innovation platform, competing company B would respond, in defending its position or just to imitate competition, by also joining to the same program to avoid getting behind. Hypothesis 1: The greater the market commonality is between the firm-dyad, the more willing they are to co-operate in open innovation Chen (1996, p. 107) defined resource similarity as “the extent to which a given competitor possesses strategic endowments comparable […] to those of focal firm”. In their research for management strategies of R&D alliances, Faems et al (2010) summed up risks that participating in R&D alliance poses for the company: as knowledge between companies become more similar, company risks more intensive competition in the future. This means that rational companies should participate in programs they are able to utilize the results of the research program when the time comes and not be defeated by competition. To clarify the point further, there has to be a way of utilizing the research results in a way that company can use the results to its own advantage. In this case, new information is created by collaborative effort, R&D capabilities of individual companies are especially important to effectively utilize the research results. This is because having 3 internal R&D is thought to increase absorptive capacity, i.e. being able to understand new external ideas and to co-develop ideas further (Dahlander & Gann, 2010). An example of this might be a large company understanding the value of cutting edge research done by the smaller company and by collaboration the smaller company also benefits by getting access to other resources of a larger company such as large supply chain network, therefore crating a possible win-win situation. Thus, having similar R&D resources is necessary to be able to compete for external ideas. Hypothesis 2: The greater the resource similarity between the firm-dyad, the more willing they are to co-operate in open innovation If by collaboration companies increase the risk of competition in the future because of unwanted knowledge transfer, companies might be hesitant to collaborate with companies that are also participating in similar markets with them. Accordingly, when companies share similar markets and resources, the risk of unwanted knowledge transfer could be too high if the other company is a direct competitor, making the option of co-operation in open innovation unattractive. Hypothesis 3: The greater the overlap between market commonality and resource similarity between the firm-dyad, the less willing they are to co-operate in open innovation 3. Methods 3.1. Data collection Out of possible SHOKs, CLEEN was chosen as source of data because the companies participating in that cluster are quite heterogeneous group, as opposed to, for example FIMECC, where most of the companies are quite similar. Data for this study were collected from various secondary sources. Information about participation of various companies to SHOK programs and companies’ shareholder stakes were obtained from CLEEN. Industry descriptions and main line of business were obtained from company data search engines created by Kauppalehti, Fonecta, and from Joint business information system of the Finnish Patent and Registration Office and the Tax Administration (Fonecta, 2014; Kauppalehti, 2014; PRH, 2014). Most of the financial data were collected from Bureau van Dijk’s Orbis database, supplemented by Fonecta’s database when available (Bureau van Dijk, 2014). Before defining variables, it must be understood that because unit of analysis is a firm-dyad, i.e. a firm pair, all values of the variables are in relation to the other firm. For example, regarding the dependent variable, that is introduced in next, as company either participates in the same research program or doesn’t, values for variable that is formed for the firm-dyad are whether the firm-dyad does participate 4 in the same research program, thus is a pair, or don’t participate in the same research program, thus is not a pair. 3.2. Dependent variables Participation in open innovation within the same research program can be seen from data obtained from CLEEN. CLEEN has various research programs that companies can participate in, and there is data in which program the company has participated in. Thus, for the firm-dyad, the dependent variable is a binary variable and has two values: 1 for “is a pair” or 0 for “is not a pair”. 3.3. Independent variables All continuous independent variables consist of variables that contain values of relative difference between the firm-dyad to be able to determine if the two companies are similar or different from each other. This means, for example, that for continuous variable revenue per employee, nominator is the firm from the dyad with larger value and denominator the firm with smaller value, thus relative difference between the two firms for variable revenue per employee is obtained. Also, logarithmic transformation was applied to base variables revenue, number of employees, age, total fixed assets, and intangible fixed assets to better describe the nature of the variables, i.e. the proportional increase of matters more than absolute increase. For example 10 000€ more revenue is more significant when current revenue is 50 000€ than one million euros. From these transformed variables, proportional variables such as revenue per employee were calculated. 3.3.1 Market commonality Main line of business is classified by Finnish Tax Administration and it is based on Standard industrial classification created by Finnish Statistics (Finnish Patent and Registration Office, 2014). Only one main line of business classification is given to one firm based on how most of the value added is created by the company (Tilastokeskus, 2008). The classification has five levels of granularity, first describing the broad category, such as “Professional, scientific and technical activities”, and the last more specific category, such as “Research and experimental development on biotechnology”. In this study, two-digit industry code was used. As the maker of the statistics is concerned about classifying the company most accurate way possible, overlap of main line of business classification is most likely the best readily available secondary source indicator for market commonality between the firm-dyad. The variable has two values: the classification is the same for firm-dyad or it is not the same. Line of business description is a description about what company does and is provided to Finnish Patent and Registration Office by the company and therefore it should reflect in which markets 5 company operates. From those descriptions keywords are mined using text mining software to obtain similarities between the descriptions of the firm-dyad, where more similar descriptions would indicate that companies have common markets (Meyer, Hornik, & Feinerer, 2008). Text mining was performed using tm-package in R (Meyer et al., 2008). First, whitespace was removed, content was transformed into lower case, punctuation was removed, Finnish stop words were removed, and the words were stemmed. Second, words that didn’t describe the line of business were removed, such as word “company”. Third, synonymous words were processed so that only one word describing one thing remained. Finally, using the remaining words, the Euclidian distances were calculated and merged to the firm-dyad dataset. Smaller the value is for Euclidian distance between the companies, the more similar company description is and thus should indicate larger market commonality between the firm-dyad and vice versa. Validity of the calculated distances for main line of business description were qualitatively evaluated by randomly selecting 24 firm-dyads by name and then evaluating on scale from 1 to 3 whether they are competitors, might be competitors, or are not competitors, respectively. From this evaluation correlation of 0.31 was obtained between the subjective evaluation and calculated Euclidian distance. It is to say that there is some correlation, but the indicator could be made better by fine-tuning text mining further. 3.3.2 Resource similarity Non-physical assets, such as patents and brands, are called intangible assets. These are assets that are generated within firm and have significant impact on companies’ market value in today’s economy (Austin, 2007), thus should give some indication about R&D capabilities of the company. Ratio of intangible fixed assets to total fixed assets is used to describe how dependent company is of its intangible assets. If this ratio is high, company is more likely to produce more complex products and to have more R&D capabilities, thus would gain more utility from participating in open innovation. For the firm-dyad the overlap of these ratios between firm-dyad should indicate similar resources. Revenue per employee is obtained by dividing logarithmic transformation of company’s revenue by the logarithmic transformation of number of people it employs. The higher the ratio, the more profitable the company is and it might indicate that company also spends more money on R&D spending. Based on this logic, the more the ratios overlap between the firm-dyad, the more similar their resources are. 3.4. Control variables All control variables for the firm-dyad are mean values of the variables to control for their absolute value in order to find relative differences between the companies. 6 Logarithmic transformations of firm size (number of employees) and age are used as controlling variables to rule out effects of company maturity. Number of employees are used as indicator of size because it should resemble value added quite well and coordination costs of the company are dependent on the number of employees rather than, say, revenue (Kumar, Rajan, & Zingales, 1999). Also, some of the companies are shareholders of CLEEN that is the company organizing the open innovation projects. Owning shares of the open innovation platform might indicate that company feels more positively about open innovation, which as result might affect willingness to participate in research programs. Thus, whether company is a shareholder of CLEEN is used as control variable that had three values for the firm-dyad: “both”, “other” and “neither”, to indicate if both firms are shareholder, just the other one, or neither, respectively. Participation frequency to the research programs was calculated, indicating in how many programs company participated in. This is important because as company participates in more and more programs, the more companies it collaborates with. Finally, as opposed to relative values calculated of variables intangible fixed assets to total fixed assets and revenue per employee as independent variables, mean values of those same variables between the firm-dyad were used as controls. 3.5. Data preparation To prepare the data for analysis, data handling was performed in six steps. First, institutions such as universities were filtered from the initial data to gain dataset that contained only companies. Second, only company line of business descriptions in Finnish were used, thus two descriptions written in Swedish had to be discarded. Third, companies that participated in a certain research program were identified and labelled. Fourth, all possible firm-dyad combinations were made and this firm dyad data was merged with the data containing variables such as size, age, and revenue. Fifth, to prepare the data for analysis, all of the variables for the firm dyad were treated so that the values were in relation to the other company of the dyad. Continuous variables such as age were treated so that age of the older company was divided by age of the younger company, thus the relative age difference was obtained. In addition, mean values of the variables of the firm dyad were calculated. Categorical variables were evaluated on the basis if the values were the same or not, or for example, in the case of whether company is a shareholder, the labelling was “both”, “one”, and “neither”. Obtaining the data of the line of business description was a bit trickier. For this variable, Euclidian distances of the main line of business descriptions of the firm-dyad were calculated according to previous explanation. Finally, missing data was replaced with mean values of each variable. 7 3.6. Analysis method Since the dataset contained categorical variables as well as continuous variables data was analysed using logistic regression. After preparing the data, there were both relative difference of the variable between the firm dyad and mean values of the firm dyad for continuous variables. For example, the dataset contained both relative difference of revenue and the mean value of revenue between the firm dyad. Analyses were made separately for both relative values and mean values to determine whether there was difference in relative or absolute values of variables, e.g. is it the relative size that makes companies prefer co-operating or do larger firms, for example, tend to co-operate with each other. After this, odds ratios were generated from each of the models. 3.7. Overview of the data Data consisted of cross-sectional data of companies and institutions that participated in different research programs in SHOK co-operation platform CLEEN that has been set up for cleantech research. All in all there were seven research programs that had the participation data from 130 different institutions and companies. After removing all of the institutions and bankrupt companies, 102 different companies remained. Table 1 summarizes participation in different research programs before and after processing of data. One company can participate in more than one research program which explains the difference between total participants in table 1 and overall number of companies. Table 1. Participation data to different research programs Program name All participants Participants after processing % of companies SGEM 28 19 60 % FCEP 17 8 68 % MMEA 49 33 60 % CCSP 27 15 64 % EFEU 17 12 59 % BIOENERGIA 37 26 59 % REMA 20 19 51 % Total: 195 132 60 % The companies participating in the research cluster cannot be considered as a random sample of Finnish companies. According to Statistics Finland most of the companies in Finland are SMEs (Statistics Finland, 2014), but in CLEEN research cluster, large companies represent disproportionally large portion of overall companies. This is quite understandable, because research conducted in 8 CLEEN has long-term focus, which might make it impossible for smaller firms to participate actively because of smaller amount of resources. Means, standard deviations and correlations of the non-dyadic transformed variables between continuous variables are presented in table 2. It can also be seen from the table 2 that the data for continuous variables was quite complete, except for amount of R&D budget to revenue, which was discarded from analysis due to missing data. For categorical variables data was fully complete except for variable main line of business description where two descriptions were in Swedish and had to be discarded. Table 2. Means, standard deviations, and correlations N Mean s.d. 1. 2. 3. 1. Revenue per employee 97 16528 26696 2. R&D budget to revenue 12 0.02 0.03 -0.12 3. Intangible fixed assets to total fixed assets 87 0.88 0.10 0.11 0.54 4. Number of employees 99 34790 19391 -0.75*** -0.21 0.05 5. Age 102 26330 0.91 -0.34*** 0.10 6. Participation frequency 102 46023 0.64 -0.25* -0.09 0.01 4. 5. -0.06 0.46*** 0.44*** 0.30** * p < 0.05, ** p <0.01, *** p < 0.001 In tables 3 means, standard deviations and correlations of transformed dyadic data is shown, which is the data that was used in the logistic regression. Distribution of variables is shown in the appendix 1. Table 3. Means, standard deviations, and correlations between variables of firm dyads N 1. Number of employees 5148 (mean) 2. Age (mean) 5151 3. Participation frequency 5151 (mean) 4. Revenue per employee 5044 (mean) 5. Intangible fixed assets to 4437 total fixed assets (mean) 6. Euclidean distance 5151 7. Revenue per employee 4656 (difference) 8. Intangible fixed assets to total fixed assets 4350 (difference) * p < 0.05, ** p <0.01, *** p < 0.001 Mean s.d. 1. 2. 3. 4. 5. 6. 4.95 1.82 2.72 0.64 0.46*** 1.26 0.45 0.44*** 0.30*** 4.45 1.98 -0.73*** -0.33*** -0.24*** 0.88 0.08 0.05** -0.05*** 0.01 0.08*** 6.55 1.69 -0.04** -0.03* -0.05*** 0.06*** -0.07*** 1.77 0.98 -0.26*** -0.15*** -0.05** 0.69*** 0.13*** 0.03* 0.99 0.42 0.09*** 0.03* 0.09*** -0.09*** -0.51*** -0.04** 7. -0.04* 9 4. Results 4.1. Logistic regression models 102 different companies can be paired in 5151 different ways. Six different logistic regression models were calculated from the data, after which odds ratios were calculated from the models. Table 4 presents six models where model 1 contains only the control variables. Model 2 includes both control variables and independent variables, and models 3 to 6 present effect of the different moderator variables. Table 4. Odds ratios of logistic regression models and their significance levels Variable Control variables (Intercept) Number of employees (mean) Age (mean) Shareholder - Neither Shareholder - One Participation frequency (mean) Revenue per employee (mean) Intangible fixed assets to total fixed assets (mean) Control ____________Control & Independent variables___________ Model 1 Model 2 Model 3 Model 4 Model 5 Model 6 0.01*** 0.89** 1.20** 0.94 0.78 4.18*** 1.04 0.03** 1.00 1.14 0.79 0.68* 4.17*** 1.20** 0.03** 0.99 1.15 0.78 0.67* 4.15*** 1.19** 0.03** 1.00 1.15 0.80 0.68* 4.17*** 1.20** 0.03** 1.00 1.15 0.80 0.68* 4.16*** 1.20** 0.01** 1.00 1.15 0.78 0.67* 4.15*** 1.20** 4.60** 1.86 1.92 1.88 1.82 1.90 2.56*** 4.75*** 0.76*** 2.56*** 2.56*** 1.01 1.01 1.01 0.97 1.24 0.83* 0.84* 0.83* 0.73* 0.83* 0.61 0.62 0.55 0.61 2.05 Market commonality Main line of business classification - Same Euclidean distance Resource similarity Revenue per Employee (difference) Intangible fixed assets to total fixed assets (difference) Moderator variables Business classification * Revenue per employee (difference) Business classification * Intangible fixed assets to total fixed assets (difference) Euclidean distance * Revenue per employee (difference) Euclidean distance * Intangible fixed assets to total fixed assets (difference) Number of observations 2 Pseudo-R (Nagelkerke) Deviance 0.67 2.87 1.02 0.83 4342 3486 3486 3486 3486 3486 0.101 0.125 0.126 0.128 0.125 0.125 4575.76 3597.54 3595.19 3596.55 3597.04 3596.72 * p < 0.05, ** p <0.01, *** p < 0.001 Hypothesis 1 stated that the greater the market commonality is between the firm-dyad, the more willing they are to co-operate in open innovation. As it can be seen from table 4, variable main line of 10 business classification, which represented market commonality, has odds ratio greater than one and is highly significant (p < 0.001) in models 2, 3, 4, 5, and 6. Thus, hypothesis 1 was supported. Hypotheses 2 stated that willingness of companies to co-operate would increase when resource similarity increases. According to table 2, odds ratios for difference in revenue per employee is below one and significant (p < 0.05) in models 2, 3, 4, 5, and 6, meaning that as difference between resources gets larger, companies are less willing to co-operate. Thus, hypothesis 2b gets weak support from the data. Although not directly comparable because of different measurement scales, it seems that according to the larger odds ratios, market commonality seems to play larger role than resource similarity. Hypothesis 3 predicted interaction between resource similarity and market commonality. It stated that the greater the overlap between market commonality and resource similarity between the firm-dyad, the less willing they are to co-operate in open innovation. Hypothesis 3 doesn’t get any support from the data since all of the odds ratios for moderator variables are nonsignificant. 5. Discussion By collaborating in R&D alliances, companies can supplement their own research efforts. SHOKs are designed to provide public-private collaboration platform for research that has longer-term goals than companies usually tend to have. Participation in open innovation is not risk-free, since usually intensive collaboration is required, therefore at the same time companies bear the risk of exposing their critical skills and experience to their competitors. In this study, collaboration of firm-dyads were studied based on secondary data and program participation data. According to the results of this study, the co-operation of firm-dyads can be explained with market commonality and resource similarity. It seems that main line of business classification made by Statistics Finland is quite good predictor and can be thought as describing well the industry where the company operates. However, the descriptions are quite broad, especially in this study where the second level categories were used. For example the description of category 71000, where largest portion of companies in this study were classified (appendix 1), is “Architectural and engineering activities; technical testing and analysis”. Therefore it is quite obvious that under that label there are a lot of different kinds of companies that probably are not direct competitors, but rather companies that have complementary skills, and thus provide fertile grounds for collaboration. Consequently, it is not entirely clear whether this actually describes market commonality or rather resource similarity, but because of the limitation of this study mainly to secondary data, it was probably the best pick to describe market commonality. On the other hand, the main line of business description was not very 11 good indicator. This is probably because the descriptions are made by companies themselves and there is a bias in trying to fit in everything that company might do business in, whereas the classifications are given by statisticians who are concerned with creating representative statistics. Resource similarity was predicted by two variables: revenue per number of employees and by ratio of intangible fixed assets to total fixed assets. Of the two variables, revenue per number of employees is better indicator, giving some indication that resource structure of the company matters. One explanation could be that high revenue per employee indicates that company is more profitable and thus has more resources to participate in R&D projects than less profitable companies. Concerning the hypotheses, hypotheses 1 and 2 assumed linear relationship, which might not be necessarily true as it might bet that it might be that relationship is non-linear. For example, having totally different resources, or in the other end, identical resources might make collaboration unattractive, but having somewhat similar resources could turn the situation more attractive. However, given the scope of this study these relationships were not investigated. 5.1. Limitations and further research This study is limited in various ways: the data is gathered from secondary sources, variables capture the companies in very rough cut manner, and so forth. There are at least four things to test the theory more accurately. First, unfortunately not enough data could be obtained for R&D budget compared to revenue that might have been more accurate measure of company’s current R&D capabilities rather than assets that have been accumulated over the years. This probably isn’t that big of a problem since age was controlled for. Also, comparing the companies might not be the best way to evaluate the situation, since even if resources of the companies might differ, they still might input same amount of resources into the open research programme. Second, it would be important to tap into how companies themselves perceive the situation, i.e. do they feel they are in competition with the companies participating in the open innovation and if so, with whom. Dataset used in this study wasn’t fine-grained enough to really know about the competitive situation between single firm-dyad. Third, based on the dataset used in this study we cannot identify all potential participants that might have been able to participate but decided not to. We only know the companies that participated and which programs they cooperated in. Thus, to gain more proper understanding, the potential participants that didn’t participate would need to be identified, and the reasons behind not participating 12 would need to be investigated. For example, currently there is huge bias toward large companies, so why didn’t larger number of small companies participate? Fourth, market commonality could be perhaps taken better into account if some kind of scale is devised to take also one-digit classification into account. For example, code 25000 and code 24000 are now classified as equally different at codes 25000 and 89000. It could be taken into account that the first two codes have more common markets than the latter two codes. There are also factors that might cause the need to revise the theory. The content of the research programs that companies are participating in might be very critical, which this study doesn’t account at all. Content of the research program might be potentially very important factor when companies decide whether to participate in the program or not. It doesn’t seem too unrealistic to think that program that is directly in the core of the company’s business raises more concern about potential competition than a program that is outside company’s main focus. For example, CLEEN program Future Combustion Engine Power Plant surely is more important than Measurement, Monitoring and Environmental Efficiency Assessment - another CLEEN program - for a company that manufactures combustion engines. Assessing the content of the program might also make it possible to classify programs in terms of motivation behind participating in the programme. For example, one kind of program might be interesting for the company as a means to alleviate carbon credit expenses, and some other program might be able to provide solution to lower risk in certain market segment of the company. Also, it might be worthwhile taking into account other kind of relationship between the companies than just competition. For example, it might be that steel manufacturer has collaborated with a software company previously that also participates in the research program. Finally, CLEEN and other SHOKs are public-private collaboration platforms and as such one cannot generalize the results to only collaboration between companies. Even though institutions have been filtered out from the analysed data, it might be that two companies that otherwise wouldn’t want to work together are in same research program only because they want to have access to the resources of the institutional participant. Also, the programs are not completely funded by private sector which might also make it easier for companies to participate, thus generalizability of the results decreases. Based on the results it seems that firm-dyad level analysis provides good grounds for further research. But in order to address the limitations mentioned above, researchers would probably have to ask companies directly more specific questions rather than to rely on secondary data. Also, different ways of measuring market commonality and resource similarity should be devised. For example, as a 13 measure of market commonality, news published about the company could be mined from newspapers and see which companies are mentioned alongside that company. 6. Conclusion There is some evidence that in public-private cleantech research, similar resources and common markets have a positive impact on participating in collaborative research between firm-dyads. Because these results are based mostly on secondary data, they should be taken only an aid to designing more accurate study. In the future, researchers should obtain primary data from the companies themselves to get more accurate knowledge about how company employees actually perceive the competitive situation and the motivations behind participating in collaborative research programs. 14 References Austin, L. (2007). Accounting for intangible assets. University of Auckland Business Review, 9(1), 63– 72. Bengtsson, M., & Kock, S. (2000). ” Coopetition” in business Networks—to cooperate and compete simultaneously. Industrial Marketing Management, 29(5), 411–426. Bureau van Dijk. (2014). Orbis | Detailed global private company information | BvD - Bureau van Dijk. Retrieved June 28, 2014, from http://www.bvdinfo.com/en-gb/products/company- information/international/orbis Chen, M.-J. (1996). Competitor analysis and interfirm rivalry: Toward a theoretical integration. Academy of Management Review, 21(1), 100–134. Chesbrough, H., Vanhaverbeke, W., & West, J. (2006). Open innovation: Researching a new paradigm. Grat Britain, King’s lynn, Norfolk: Oxford university press. CLEEN. (2014). CLEEN - Cluster for Energy and Environment. Retrieved May 3, 2014, from http://www.cleen.fi/en/ Dahlander, L., & Gann, D. M. (2010). How open is innovation? Research Policy, 39(6), 699–709. Faems, D., Janssens, M., & Van Looy, B. (2010). Managing the Co-operation–Competition Dilemma in R&D Alliances: A Multiple Case Study in the Advanced Materials Industry. Creativity & Innovation Management, 19(1), 3–22. doi:10.1111/j.1467-8691.2010.00546.x Finnish Patent and Registration Office. (2014). PRH - Toimiala. Retrieved May 3, 2014, from http://www.prh.fi/fi/kaupparekisteri/useinkysytyt/toimiala.html Fonecta. (2014). Yrityshaku - Kattavat B2B yritystiedot - Fonecta Finder. Retrieved June 28, 2014, from http://www.finder.fi/ Kauppalehti. (2014). Yrityshaku | Kauppalehti.fi. http://www.kauppalehti.fi/5/i/yritykset/yrityshaku/ Retrieved June 28, 2014, from 15 Kumar, K. B., Rajan, R. G., & Zingales, L. (1999). What determines firm size?. National bureau of economic research. Retrieved from http://www.nber.org/papers/w7208 Lieberman, M. B., & Asaba, S. (2006). Why Do Firms Imitate Each Other? Academy of Management Review, 31(2), 366–385. doi:10.5465/AMR.2006.20208686 Meyer, D., Hornik, K., & Feinerer, I. (2008). Text mining infrastructure in R. Journal of Statistical Software, 25(5), 1–54. PRH. (2014). YTJ - Yritys- ja yhteisötietojärjestelmä. Retrieved June 28, 2014, from http://www.ytj.fi/yrityshaku.aspx PSA Peugeot Citroën. (2012, July 23). PSA Peugeot Citroën and Toyota announce a new cooperation on light commercial vehicles in Europe. Retrieved August 28, 2014, from http://www.psapeugeot-citroen.com/en/media/press-releases/psa-peugeot-citroen-and-toyota-announce-newcooperation-light-commercial-vehicles-europe Statistics Finland. (2014). Suomen virallinen tilasto (SVT): Yritysrekisterin vuositilasto. Retrieved July 6, 2014, from http://www.stat.fi/til/syr/meta.html Tilastokeskus. (2008). Toimialaluokitus TOL 2008. Helsinki: Tilastokeskus. 16 Appendix 1. Histograms of variables Figure 1. Continuous variables 17 Figure 2. Categorical variables
© Copyright 2026 Paperzz