CIS Sandra Bsiri Michaela Geierhos Christoph Ringlstetter CENTRE FOR INFORMATION AND LANGUAGE PROCESSING Structuring Job Search via Local Grammars Job Search Problem Job Announcements are Structured Documents Real World Web Documents are Still Semantically Flat BAG-OF-WORDS Indexing Leads to NOISE Local Grammars Generate Structured Forms Forms Provide Efficient Information Access ORGANIZATION NAME AND URL Description of Organizational Contexts with Local Grammars Window of Predefined Size for Syntactical or Lexical Features Simple Keyword Search vs. Local Grammar Approach: Fewer Ambiguities Locating Online Job Offers FOCUSED CRAWLER Typical Terminology of Job Offer Nominal Phrases semantically structuring job descriptions Frozen Expressions and SubLanguage Vocabulary Job offer – yes or no? Domain Name Segmentation PHRASES AND TERMINOLOGY Check Up: Recognized Organization Name in URL? ORGANIZATION NAME AND URL Search on Annotated Job Offers ORGANIZATION NAME AND URL FOCUSED CRAWLER Identifying Organization Home Pages PHRASES AND TERMINOLOGY FOCUSED CRAWLER PHRASES AND TERMINOLOGY IDENTIFICATION Local Grammar that models the context of the word “leader”. Pipeline of Classification and Transformation CLASSIFICATION AND TRANSFORMATION
© Copyright 2026 Paperzz