CLARIN Value Proposition

D5.4
CLARINValueProposition
Documentinformation
Title
ID
Authors
CLARINValueProposition
CLARINPLUS-D5.4(CE-2016-0847)
BenteMaegaard,DietervanUytvanck,Steven
Krauwer
ResponsibleWPleader
MartinWynne
ContractualDeliveryDate
2016-08-31
ActualDeliveryDate
Distribution
Public
Documentstatusinworkplan Deliverable
Projectinformation
Projectname
CLARIN-PLUS
Projectnumber
676529
Call
H2020-INFRADEV-1-2015-1
Duration
2015-09-01–2017-08-31
Website
www.clarin.eu
Contactaddress
[email protected]
CLARIN-PLUS
D5.4CLARINValueProposition
Tableofcontents
1 ExecutiveSummary.................................................................................................................2
2 Introduction..............................................................................................................................3
3 Valuepropositionandstakeholders.................................................................................4
4 Detailedvaluepropositionatdifferentlevels...............................................................5
4.1 Forresearchers..................................................................................................................................5
4.1.1 Discoveringanddepositingresources.............................................................................5
4.1.2 Advancedtoolsandcomputingfacilities........................................................................6
4.1.3 FederatedLogin:easieraccesstomoreresources.....................................................6
4.1.4 Accesstoexpertiseforresearchers..................................................................................7
4.1.5 Onlinetutorialsetc...................................................................................................................7
4.1.6 Workshopsandseminars,mobilitygrants....................................................................7
4.1.7 Supportfordatamanagementplans................................................................................8
4.2 ForCLARINinstitutionsandinstitutes....................................................................................8
4.2.1 GuaranteedaccesstoCLARINcentralservices............................................................8
4.2.2 ParticipationinH2020projects.........................................................................................9
4.2.3 Accesstocloudservices.........................................................................................................9
4.2.4 AccesstoexpertiseforCLARINinstitutionsandinstitutes....................................9
4.3 ForCLARINERICMembers(countries,ministries)........................................................10
5 ValuesforEurope.................................................................................................................11
6 Concludingremarks.............................................................................................................12
References......................................................................................................................................13
AppendixCLARINERIC,CommonLanguageResourcesandTechnology
Infrastructure,cf.ESFRI2016..................................................................................................14
1
CLARIN-PLUS
D5.4CLARINValueProposition
1 ExecutiveSummary
Thevaluepropositionwillservetoshowtothemainstakeholders(i.e.thefunders,the
usersandtheproviders)whatthebenefitsareofjoiningCLARINERICforacountryand
foritsresearchcommunity.
InthisdocumentwefirstgiveashortformulationoftheCLARINvalueproposition,
followedbydifferentelaborationsandexamples,takingintoaccountthedifferent
perspectivesofdifferentcategoriesofstakeholders.
Wewillpresentthevaluepropositionanditselaborationstointerestednew
stakeholders.Thevaluepropositiontextswillbeintegratedindisseminationmaterials
suchasthewebsite,folders,poster,etc.,withthegoalofattractingusers,funders,new
consortia,centresandmembercountries.Furthermore,aclearunderstandingofthe
valuepropositionwillhelpdefiningCLARIN'sprioritiesfortheinvestmentsfor
sustainabilityandnewdevelopmentsinthecomingyears.
2
CLARIN-PLUS
D5.4CLARINValueProposition
2 Introduction
ThepurposeofthisdocumentistodescribetheelementsthatconstitutetheCLARIN
ValueProposition,takingintoaccountthatdifferentstakeholdersmayassigndifferent
valuestodifferentelementsofwhatweoffer.
InSection3wewillfirstpresentourMissionStatement,describingourultimate
objectives,ourVisionStatement,describingthestatewewanttoachieveasopposedto
thepresentstate,andtheresultingValuePropositionthatexplainsthebenefitsof
joiningCLARINforourstakeholders.
Inthesamesectionwewillalsocharacterizethevariousgroupsofstakeholders.
Asdifferentstakeholdersmayhavedifferentinterestsandprioritieswewillpresent
differentelaborationsofouroverallValuePropositioninSection4,foranumberof
typicalgroupsofstakeholders.InSection5weturntothecontributionsattheEuropean
level.Itshouldbenotedthatsomegroupsofstakeholdersmayhaveoverlapping
interests,whichmayresultinarepetitioninsomeofsections.
Intheappendixweareprovidingaone-pagedescriptionwithmanyfactsthatcanbe
usefulincommunicationcontexts.
Thisdocumentwillserveasabasisfordocuments(reports,brochures,posters,
websites,etc.)thatwillbeusedtoattractnewmembers,usersandothercontributors
anditshouldalsoserveforCLARINitselfasaninstrumenttodefineprioritiesand
initiatenewdevelopmentsbasedontheneedsandexpectationsofthevarious
stakeholders.
3
CLARIN-PLUS
D5.4CLARINValueProposition
3 Valuepropositionandstakeholders
Valueproposition:CLARINmakesdigitallanguageresourcesavailabletoscholars,
researchers,studentsandcitizen-scientistsfromalldisciplines,especiallyinthe
humanitiesandsocialsciences,throughsinglesign-onaccess.CLARINofferslong-term
solutionsandtechnologyservicesfordeploying,connecting,analyzingandsustaining
digitallanguagedataandtools.CLARINsupportsscholarswhowanttoengageincutting
edgedata-drivenresearch,contributingtoatrulymultilingualEuropeanResearchArea.
Mission:Createandmaintainaninfrastructuretosupportthesharing,useand
sustainabilityoflanguagedataandtoolsforresearchinthehumanitiesandsocial
sciences.
Vision:AlldigitallanguageresourcesandtoolsfromalloverEuropeandbeyondare
accessiblethroughasinglesign-ononlineenvironmentforthesupportofresearchersin
thehumanitiesandsocialsciences.
Threecomments:
Disciplines:EveniftheoriginofCLARINisinthehumanitiesandthesocialsciencesit
shouldbenotedthattheofferingsofCLARINcanhaverelevanceforresearchersfrom
manyotherdisciplineswherelanguageplaysarole,notnecessarilyasobjectofstudy,
butalsoascarrierofinformationorinstrumentforcommunication.
OpenScience:MoregenerallyCLARINdoesnotseeitselfasastand-alonefacility,but
ratherasaplayerinmakingthevisionthatisunderlyingtheemergingEuropeanOpen
SciencePoliciesareality,interconnectingresearchersacrossnationalanddiscipline
bordersbyofferingseamlessaccesstodataandservices.
Stakeholders:Theconstructionandoperationofaninfrastructureinvolvesmany
differentstakeholders,eachwiththeirowninterestsandexpectations,rangingfrom
individualresearchers,researchinstitutions,dataarchives,infrastructureservice
providers,fundingbodiesandgovernments–justtomentionafew.
4
CLARIN-PLUS
D5.4CLARINValueProposition
4 Detailedvaluepropositionatdifferentlevels
4.1 Forresearchers
4.1.1 Discoveringanddepositingresources
Researcherscansearchforlanguageresourcesviametadata–intheCLARIN
“catalogue”,theVirtualLanguageObservatory(VLO)-,orinthedataitself(Content
Search).TheVLOcontainsreferencestomorethan800,000resourcesthemajorityof
whicharehostedatCLARINcentres,buttheVLOalsocontainsreferencestoother
relevantresourcecollections.
MetadataSearch:www.clarin.eu/vlo
ContentSearch:www.clarin.eu/contentsearch
AsCLARINERIChasnowmanydifferentcountriesasmembers,theVLOcoversmany
languages,bothnationalandregionallanguages,aswellaslanguagesstudiedinthose
countries.
TheadvantageoftheVLOisfasteridentificationofrelevantresources,allowing
researcherstore-useresourcesthatalreadyexist,ratherthanhavingtoproducetheir
own.Additionally,theVLOallowsuserswhocreateorcollecttheirowndatasetsto
makethembettervisibletoothersthroughpublicationofthemetadataintheVLO.
WithouttheVLO,researcherswouldhavetouseothercatalogues,orgeneralpurpose
searchengines,oftenwithlesspreciseresults–theVLOhasahighnumberofquery
facetswhichcanbeusedtoguidethesearchandmakestheresultmoreprecise.
Longtermpreservation:OneofthefundamentalservicesoftheCLARINinfrastructureis
makingsurethatlanguageresourcescanbearchivedandmadeavailabletothe
communityinasafeandsustainablemanner.Tohelpresearcherstostoretheir
resources(e.g.corpora,lexica,audioandvideorecordings,annotations,grammars,etc.)
inasustainableway,atleastoneCLARINdatacentreineachcountryoffersa
depositingservice.Thesecentresarewillingtostoretheresourcesintheirrepository
andassistwiththetechnicalandorganisationaldetails.Thishasawiderangeof
advantages:
• Long-termarchiving:IneveryCLARINmembercountrythereareoneormore
CLARINdatacentresthathavecommittedthemselvestoofferingastorage
guaranteeforalongerperiodoftime
• TheresourcescanbecitedeasilyandreliablyastheywillgetaPersistent
Identifier.
• Allresourcesandtheirmetadatawillbeequallyaccessibleandsearchable
throughouttheCLARINinfrastructure,irrespectiveoftheirphysicallocation.
• Allresourcesandtheirmetadatacanbecombined,analysedandenrichedwith
variouslinguistictools(e.g.automatedpart-of-speechtagging,information
extraction,phoneticalignmentoraudio/videoanalysis),irrespectiveofthe
physicallocationofdataandtools.
Researchersasproviders:Researchersarenotjustconsumersofdataandtools,butalso
providersinthattheyareencouragedtosharetheirdataandtoolswithothers,if
necessaryinaprotectedway,sothatotherscanbuildfurtherontheirresults.Thisis
supportedbytheavailabilityofrepositories(seeabove)andbyfacilitatingdatacitation
andlicensing(seebelow).
5
CLARIN-PLUS
D5.4CLARINValueProposition
Datacitation:Itisamajortasktoproduceacorpusoradatacollection.Moreandmore,
thescientificworldrecognisesthevalueofsuchcontributionsandmechanismshave
beendevelopedfordatacitationthatencouragecreatorsofcorporaorotherdata
collectionstopublishtheirdata.CLARINisaperfectplatformforthistypeofpublication
andsubsequentcitation.ThisisbecauseCLARINoffersagoodsearchtool(VLO)for
betterpublicityandbettersharing,andbecauseoftheuseofPersistentIdentifiersto
refertotheminsteadofnotoriouslyunstableURLs.
Licensing:CLARINcentresmakedataavailablethroughlicensingandclearconditions
foruse.ThisinvolvesCLARINcentresmakingdealswithrightsowners,signing
DepositionLicenseAgreementswhichincludeEndUserLicenseAgreements,
categorizinglicensesinclearlymarkedlicensecategories,andwritingTermsofService.
Guidanceisofferedtocreatorsofdatawhowanttoselectthemostappropriatelicensing
conditionswhenpublishingtheirdata.
4.1.2 Advancedtoolsandcomputingfacilities
CLARINoffersstate-of-the-arttoolsandon-lineservicesformanylanguages.These
supportresearcherstoannotate,analyseandpublishtheirlanguagedata.Automatic
analysessuchase.g.annotationcanbeperformedfasterandmoreeasilyonlarge
amountsofdata.Examplesofthefunctionalityoffered:
(1) Advancedanalysisandvisualizationsforlargedatasetsthatmayhelpgaining
deeperinsights,e.g.:
a. DiaCollo:collocationanalysisindiachronicperspective
b. Stylo:state-of-the-arttoolforstylometricanalysis
(2) Fasterautomatedanalysis,leadingtomoretimefortheactualresearch,e.g.:
a. WebMAUS:AutomaticSegmentationandLabellingofSpeechSignals
overtheWeb
b. AVAtech:audioandvideorecognizers
(3) Reproduciblescientificanalysisflows,leadingtomoredatasharingandbetter
replicabilityofresearchresults,e.g.:
a. MindRepository:aplatformforresearcherstosharetheirpapers
togetherwiththedatatheyhaveusedandthescriptstoanalysethem
b. Webserviceorchestrationengines
(4) Accesstofirst-classcorporathroughspecializedqueryinterfaces,e.g.:
a. TheCorpusofContemporaryDutch
b. TheICAMEcorpora
Forlargeandcomputation-intensivetasks,CLARINcanconnectscientiststohighly
rankedHighPerformanceComputing(HPC)centres.Dependingontheamountof
computingresourcesneeded,theresearchermightneedtoenrolinacompetitivecall.In
anycasetherearenocostsfortheuseoftheHPCfacilities.
4.1.3 FederatedLogin:easieraccesstomoreresources
CLARINhasestablishedaServiceProviderFederation,i.e.atrustednetworkofidentity
providerswhichoffer“singlesign-on”.Thismeansthatresearcherscanloginwiththeir
institutionalcredentialstogetaccesstopassword-protectedlanguageresourcesand
applicationsinothercountries.
6
CLARIN-PLUS
D5.4CLARINValueProposition
Oneadvantagefortheresearchersisthattheygaintimeandhavethebenefitofhaving
touseonlyoneaccesscode.Anotheradvantageisthattheyalsogetaccesstoprotected
resourcesinothercountries.Withoutthissinglesign-on,researcherswouldeitherhave
noaccesstootherwisevaluableresources,ortheywouldhavetoapplyforaccountsfor
eachrepository.
Statisticsshowthatabout60uniquevisitorsperdayusethefederatedlogintoaccess
CLARINresources1.
YoucanfindsomeoftheonlineresourcestowhichCLARINgiveseasyaccesshere:
https://www.clarin.eu/content/easy-access-protected-resources
4.1.4 Accesstoexpertiseforresearchers
ComplementarytotheaccesstodataandtoolsCLARINoffersresearchersaccessto
expertisethroughitsKnowledgeSharingInfrastructure.Firstofall,therearehelpdesks
andknowledgecentres.
AllCLARINcentresofferingaccesstodataandtoolsoperateahelpdesk(inEnglish)
where(potential)userscangetinformationaboutthedataandservicesoffered,gethelp
inusingtheservices,andreportproblems.
CertifiedCLARINknowledgecentres(https://www.clarin.eu/content/knowledgecentres)offeraccesstoexpertiseinspecificareas,suchastreebanking,speechanalysis,
audio-visualfieldwork,languagelearning,theDanishlanguage,thelanguagesofSpain,
thelanguagesofSweden.
Specializedcommitteesbringtogetherexpertknowledgeaboutvarioustopics,suchas
IPRandlicences(CLIC–theCLARINLegalIssuesCommittee)andstandards(STAC-the
StandardsCommittee).
TheCLARINERICofficeoperatesacentralhelpdeskthatwillreplydirectlytorequests
forhelporinformation,orchanneltherequeststothebestexperts.
4.1.5 Onlinetutorialsetc.
TheKnowledgeSharingInfrastructurealsooffersonlinetutorialsandexplanationsof
usecasesinordertofacilitatetheuptakeofnewtechniquesbyresearchers.This
materialisprovidedbyexpertsfromnationalCLARINconsortia,andwhereverpossible,
offeredatleastinEnglishor(inthecaseofmovies)withEnglishsubtitles.
Theusecasesservetodemonstratethesuccessfulapplicationofdigitalmethodsto
specificresearchquestions,andtoinspireresearcherswithsimilarorrelatedquestions.
4.1.6 Workshopsandseminars,mobilitygrants
Thefinalpartoftheknowledgesharinginfrastructureconsistsofdirectandphysical
meetingsofvarioustypes.
ThroughanopencallforworkshopproposalsmembersoftheCLARINcommunityare
invitedtosubmitproposalsforworkshopsonstrategicprioritiesforCLARIN,orforthe
preparationofsmalldevelopmentprojects.
1MeasuredusingPiwikduringthefirsthalfof2016attheCLARINdiscoveryservice.Asnotall
7
CLARIN-PLUS
D5.4CLARINValueProposition
FromtimetotimeCLARINERICorganisesworkshopsfocusedonspecifictopicswhere
membersoftheCLARINcommunitygettogetherandworkontopicsofcommon
interest.
AttheAnnualCLARINConferenceinfrastructureprovidersandusersfromallCLARIN
countriesexchangeexpertiseandideas.
Across-bordermobilityschemeforshortstaysatCLARINcentreshelpstheexchangeof
knowledgeandexpertisebetweenstaffofdifferentcentres,orbetweenstafffrom
centresandusers.
4.1.7 Supportfordatamanagementplans
SinceCLARIN’smainpurposeistofacilitatethepreservationandexploitationof
resources,CLARINisinagoodpositiontogiveprofessionalsupportforwritingaData
ManagementPlan(DMP),e.g.forresearchfundingapplications,therebyimprovingthe
qualityoftheapplication,andatthesametimesavingtimeandeffort.
Asanexample,theGermangroupCLARIN-Dhadapresentationatthe2015CLARIN
conferencebyTrippelandZinn:DMPTY–AWizardForGeneratingDataManagement
Plans.ThisiscloselyrelatedtotheGermanrepositories,butmaybeadaptedtoservein
othercountries.
4.2 ForCLARINinstitutionsandinstitutes
4.2.1 GuaranteedaccesstoCLARINcentralservices
CLARINoffersawiderangeofcentralservicesfromwhichallusersandallCLARIN
centresinCLARINERICmembercountriesbenefit.Theseservicesaredevelopedand
offeredinclosecollaborationbetweenthenationalteamsandCLARINERIC.
Herewedescribeverybrieflysomeofthoseserviceswhichareofferedtoinstitutions.
NB:Notonlyaccess,butalsoinfluenceonthedevelopmentofawiderangeoftailored
infrastructuralservicesisoffered.
• TheuniqueCLARINServiceProviderFederationwhichmakes“singlesign-on”
possibleacrosscountriesandinstitutions.
• TheCLARINIdentityProvider,cateringforacademicswithoutaninstitutional
IdentityProvider
• ThecentralDiscoveryService2withhigh,i.e.24/7,availabilityfortheusers.This
serviceprovidesauser-friendlyuniforminterfacetoselectahomeorganization
duringthefederatedloginprocess.
• RobustPersistentIdentifierservices,includingmirroringofthehandleservers,
viaEPIC-theEuropeanPersistentIdentifierConsortium3
• AsofficialEUDATcommunity,accesstoadvancedEUDATservices:e.g.B2DROP
(workspaces)andB2SAFE(safereplication)
2Seehttps://www.clarin.eu/content/clarin-discovery-service
3Seehttp://www.pidconsortium.eu
8
CLARIN-PLUS
•
•
•
•
•
D5.4CLARINValueProposition
Servicesforcreating,sharingandre-usingrichmetadatadescriptions:
ComponentRegistry(creationandeditingofmetadataschemas),Concept
Registry(registrationandre-useofsemanticdefinitions),OAIharvester
(automateddistributionofmetadatafiles)
TheCLARINcentreregistry,acentraldashboardtoregisterservicesandto
connectthemtoautomatedcheckingandpublicationprocesses(monitoring,
harvesting,contentsearch,etc.)
Professionalmonitoringservices(Icinga),withhighavailability.
Advanceddistributeduserstatistics,usingPiwik.
Accesstothecommondevelopmentenvironment(SVNcoderepositoryandTrac
wiki).
Institutionsandinstitutesarenotonlythehomesofusersofdataandservices,butthey
arealsoproviders.Researchinstitutescansharetheresultsandspin-offsoftheir
researchthroughCLARINandmakethemmorevisible,andinstitutionswithan
archivingfunctioncanuseCLARINtomaketheirholdings,oftencreatedwithpublic
funding,morewidelyvisibleandusable.
4.2.2 ParticipationinH2020projects
CLARINERICisactivelyapplyingforparticipationinEuropeanprojectswhensuch
participationsupportsCLARIN’sgeneralstrategyand/ordevelopment.Thismeansthat
joiningCLARINERICopensenhancedopportunitiesforparticipationinEuropean
fundedprojects,asstaffofnationalCLARINconsortiamayworkonsuchprojectson
behalfofCLARINERIC.
Currently(August2016)CLARINERICiscoordinatingtheH2020projectCLARIN-PLUS,
whichaimsatfurtherconsolidationandexpansionoftheCLARINinfrastructure,both
technicallyandorganisationally.
Additionally,CLARINERICisparticipatinginfourotherEuropeanprojects:
PARTHENOS,EUDAT2020,LT-Observatory,andEUROPEANA-DSI.Atthismomentstaff
frommorethanadozenCLARINERICMemberorObservercountriesisparticipatingin
EUprojectsonbehalfofCLARINERICforabroadvarietyoftasksintermsoftopicand
size.MoreinformationaboutongoingEUprojectswithCLARINERICinvolvementcanbe
foundonourwebsiteonhttps://www.clarin.eu/content/clarin-eu-projects.
4.2.3 Accesstocloudservices
FormembersCLARINERICprovidesfreeaccesstovariouscommerciallyavailablecloud
services:
• Communicationtools:AdobeConnectforvideoconferences,Slackforinstant
messaging
• Githubforcodesharing,includingprivaterepositories
• Basecampforprojectmanagement.
4.2.4 AccesstoexpertiseforCLARINinstitutionsandinstitutes
ToinstitutionsplanningtoestablishthemselvesasCLARINtechnicalcentresoraiming
tokeepthemselvesup-to-datewithrecentdevelopmentsweoffervarioustypesof
support,includingfundingfortravelcostsinvolved.
9
CLARIN-PLUS
•
•
•
D5.4CLARINValueProposition
Bestpracticeandotherdocumentationonthecreationandmaintenanceof
infrastructureservicesisavailable,andwillbecontinuouslyupdated.
Thematicworkshopsortutorialsessionsonspecificinfrastructuretopicsare
organizedtobringtogethercentrestafffromnewandestablishedcentres
Amobilityschemeforshort(typicallyupto1week)exchangesbetweennewand
establishedcentres.
4.3 ForCLARINERICMembers(countries,ministries)
Inadditiontotheabove-mentionedbenefitsforthecountry’sresearchersand
institutions,thefollowingnationalstrategicadvantagescanbeidentified:
CLARINmembershipmaycontributetoabetterpositionofthenationallanguage(s)in
theEuropeanandinternationalcontextbymakingdatamorevisibleandmorewidely
accessible.
Visibilityofculturalcontentincreases:Asmuchofacountry’sculturalcontentisof
linguisticnatureordescribedbymeansoflanguage,disclosingitthroughCLARINwill
makeitmorevisible,bothfortheresearchcommunityandforthepublicatlarge.
ConverselymembershipofCLARINERICwillgiveresearchersinthecountryfullaccess
totheculturalheritageinotherCLARINcountries.
Asgenericinfrastructureservicescanbeusedacrossborders,CLARINmemberscan
benefitfromthefactthatthecostsofconstructionandoperationofsuchservicescanbe
sharedbetweenmembers.Softwareofagenericnaturecanbesharedbetween
members,andinmanycasesthedevelopmentcostofmorespecificsoftwarecanbe
reducedbyportingitbetweenlanguagesratherthandoingitfromscratch.
CLARINmembershavedirectinfluenceonthedecision-makingaboutallaspectsofthe
infrastructure,rangingfromconstruction,operation,longertermevolutionand
strategicpriorities,asvotingmemberintheGeneralAssembly,theNational
Coordinators’Forum,etc.,aswellasthroughparticipationinthevariouscommittees.
MoredetailsaboutthegovernanceofCLARINERICcanbefoundintheStatutes,see
http://hdl.handle.net/1839/00-DOCS.CLARIN.EU-75.
AccesstoexpertisefornewCLARINcountries:
Foremergingnationalconsortiaincountriesthathavejustjoinedorarepreparing
themselvesforjoiningCLARIN,CLARINoffersworkshopsonhowtosetuptheCLARIN
infrastructureatthenationallevel.Fourworkshopsfornewandfuturemembersare
beingheld2015-2017,andtheinformationmaterialsproducedwillbeavailableonthe
CLARINwebsite.
Additionally,aseriesofbestpracticepapersisbeingpreparedonthefollowingthemes
•
Buildinganationalconsortium
•
LivinguptothecriteriaforCLARINcentres
•
BuildingtheKnowledgeSharingInfrastructureatthenationallevel
•
Costestimatesfortheconstructionandoperation
•
ProfessionalprofilesfortypicalCLARINinfrastructurefunctions
•
IPRandethicalissues
•
Interoperabilityandstandards
10
CLARIN-PLUS
D5.4CLARINValueProposition
5 ValuesforEurope
Excellence:CLARIN’sroleisnotjusttoofferinfrastructureservicesperse,butalsoto
contributetothefurtherdevelopmentoftheEuropeanResearchArea.Inthiscontext
CLARINhasanumberofthingstooffertosupportcuttingedgeresearchandexcellence
inEuropeanresearch.
Cross-border,cross-languageresearch:CLARINoffersthesameaccesstodataand
servicestoallusers.Thisisanecessityforjointresearchandforthereplicationof
researchresults,whichhasbecomeaveryimportantfactorinmanydisciplines.Atthe
sametime(andmaybeevenmoreimportantly)CLARINisalsoofferingaccesstosimilar
datacollections4andservicestotheEuropeanhumanitiesandsocialsciencesresearch
communityatlarge.Thiscreatesanexcellentstartingpointforjointcross-borderand
cross-languageresearch,andexchangeofexpertise.Thiswillstimulatethecollaboration
forcommunitieswitharesearchagendathatrequirestheexplorationoflanguagedata
fromacomparativeperspective.Aslanguageisadatatypethatcaptureslinguistic,
culturalandsocialphenomena,thepotentialformultidisciplinaryresearchwillalso
increase.
OpenScience:CLARINisnotastand-alonefacility,butiswell-embeddedinthe
Europeanresearchinfrastructurelandscapeatlarge,andassuchfullycommittedtothe
EuropeanOpenSciencePolicies,includingre-useofdata.Asmentionedin4.1.7afocus
ondatamanagementisinherentinCLARIN’sdatapreservationactivities.Throughits
focusonlanguageitoffersopportunitiesforcross-disciplinecollaborationbetween
areaswherelanguageplaysarole,bothwithinthehumanitiesandsocialsciencesandin
otherdisciplines.
Variousinstrumentswillbeusedtosupportthis,including:
•
Workshopsontypicalcross-border,cross-languageorcross-disciplinetopicsin
ordertoensureacontinuousflowofknowledgeandexpertisebetweendifferent
communities
•
InitiativesfororparticipationinEUprojectswithastronglanguagedimension
thatbringtogetherresearchersfromnationalCLARINconsortia,therebyproviding
excellentinputandexpertisefortheseprojects.
4newspapercollections,collectionsofparliamentarydebatesetc
11
CLARIN-PLUS
D5.4CLARINValueProposition
6 Concludingremarks
Thisvaluepropositionwillservetoshowtothemainstakeholders(i.e.thefunders,the
usersandtheproviders)whatthebenefitsofjoiningCLARINERICareforacountryand
foritsresearchcommunity.
Wepresentthevaluepropositionanditselaborationstointerestednewstakeholders
intheformofe.g.informationpackagesorpresentationsateventswhererelevant
potentialstakeholdersarepresent.Itisalsousedtoformulatetextsondissemination
materialsuchasthewebsite,folders,poster,etc.,withthegoalofattractingusers,
funders,newconsortia,centresandmembercountries.
Furthermore,itwillalsobeusedasareferencepointforCLARINitself,asaclear
understandingofthevaluepropositionwillhelpdefiningCLARIN'sprioritiesforthe
investmentsforsustainabilityandnewdevelopmentsinthecomingyears.
Alastimportantcomment:thevaluepropositionshouldnotbeseenassetinconcrete:
thewholeresearchlandscapeischangingfastinmanyrespects,andCLARINwilladapt
itselftonewdevelopmentsifthisisnecessarytoachieveitsgoalsasreflectedbyits
missionandvisionstatements.
12
CLARIN-PLUS
D5.4CLARINValueProposition
References
CLARINERICStatutes
ESFRI2016StrategyReportofResearchInfrastructures.(Theappendixonnextpageis
page82ofthisreport,partoftheLandmarksection.Notehowever,thatthelistof
countrieshasbeenupdatedwiththetwocountriesthatjoinedinthemeantime:Latvia
andHungary).
Apartfromthis,throughoutthedocumentmanylinksaremadetotheCLARINERIC
website,www.clarin.eu,andotherwebsiteswheredocumentationisfound.Theselinks
arenotrepeatedhere.
13
CLARIN-PLUS
D5.4CLARINValueProposition
Appendix
CLARINERIC,CommonLanguageResourcesandTechnologyInfrastructure,cf.ESFRI2016
Singlesign-onaccessto
languagedata
repositoriesandtoolsto
explore,exploit,
analyse,enrichor
combinethem
DESCRIPTION
The Common Language Resources and Technology Infrastructure (CLARIN)
provideseasyandsustainableaccessforscholarsinthehumanitiesandsocial
sciences to digital language data − in written, spoken or multimodal form −
andadvancedtoolstodiscover,explore,exploit,annotate,analyseorcombine
them, independent of their location. To this end CLARIN is building a
networked federation of language data repositories, service centres and
centres of expertise, with single sign-on access for all members of the
academic community in all participating countries. Tools and data from
different centres are interoperable, so that data collections can be combined
and tools from different sources can be chained to perform complex
TYPE:distributed,virtual
operations to support researchers in their work. CLARIN integrates existing
COORDINATINGCOUNTRY
dataandservicecentres,withoutmajorcapitalinvestments.
NL
MEMBERCOUNTRIES
Currently the CLARIN infrastructure is under construction, but a growing
AT
numberofparticipatingcentresisalreadyofferingservicestodata,toolsand
BG
CZ
expertise. Since 2012, CLARIN is a European Research Infrastructure
DE
Consortium(CLARINERIC).
DK
EE
ACTIVITY
EL
Theoperation,maintenanceandcontinuousexpansionoftheinfrastructureis
FI
carried out in the participating countries at the individual CLARIN centres
HU
(currently over 30), which have to meet clearly defined technical and
IT
organizational criteria, ensuring the coherence of the whole infrastructure.
LT
CLARIN also works closely together with research communities in creating
LV
NL
and expanding a knowledge infrastructure that can support developers of
NO
language resources and tools, as well as the end-users of the available data
PL
andservices.Intotalover100institutesacrossEuropeareinvolved.CLARIN
PT
has been offering operational access and services since 2012, and the offer
SE
willgrowascentresexpandandnewcountriesjoin.Tostimulatetheuptake
SI
andincreasetheinsightintheusabilityoftheservices,CLARINparticipatesin
PARTICIPATINGCOUNTRIES
the development of courseware and organises workshops and data camps
UK,ES,USA
focussing on the processing and analysis of specific data types, such as
OTHERMEMBERS
parliamentarydata,socialmediadata,andhistoricaltexts.Thecollaboration
DutchLanguageUnion
with other RIs that support scholarly communities in Social Sciences and
TIMELINE
Humanities (SSH) takes places via the collaboration in H2020 consortia.
• ESFRIRoadmapentry:2006
CLARINpromotesopenaccesstoresearchdatawithinthepresentlegislation
• Preparationphase:2008-2011
and ethical standards and will develop outreach activities to stimulate the
• Constructionphase:2011-2015
reuseandrepurposingofthedataresourcesintegrated.
• Operationphase:2012
• Legalentityestablishment:ERIC,
IMPACT
2012
CLARIN stimulates the reuse and repurposing of available research data,
thereby enabling scholars in SSH (including digital humanities) to increase
ESTIMATEDCOSTS
theirproductivityand,moreimportantly,opennewresearchavenuesinand
• Capitalvalue:notapplicable
across disciplines that address one or more of the multiple societal roles of
• Operation:12M€/year
language:asacarrierofculturalcontentandinformation,bothsynchronically
HEADQUARTERS
and diachronically, as a reflection of scientific and societal knowledge, as an
CLARINERIC
instrument for human communication, as one of the central components of
UtrechtUniversity
the identity of individuals, groups, cultures or nations, as an instrument for
TheNetherlands
human expression, or as an object of study or preservation. Through the
accessanddiscoveryservices,CLARINincreasesthepotentialimpactofdata
WEBSITE
andtoolsproducedwithpubliclyfundedprojects.WorkingwithCLARINdata
http://www.clarin.eu
andtoolswillincreasetheskilllevelsfordataanalysistasksamongthenew
generations of SSH students, which is likely to be welcomed by the data
sciencesector.
14