Web Data Extraction and Crawling Techniques leaf node


URI

https://openalex.org/T12016

Label

Web Data Extraction and Crawling Techniques

Description

This cluster of papers focuses on techniques and technologies for extracting structured data from web pages, including web crawling, automatic wrapper generation, page segmentation, and mining data records. It also covers topics related to the hidden web, information retrieval, and content adaptation for different devices.

Implementation

@prefix oasubfields: <https://openalex.org/subfields/> .
@prefix openalex: <https://lambdamusic.github.io/openalex-hacks/ontology/> .
@prefix owl: <http://www.w3.org/2002/07/owl#> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
@prefix skos: <http://www.w3.org/2004/02/skos/core#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .

<https://openalex.org/T12016> a skos:Concept ;
    rdfs:label "Web Data Extraction and Crawling Techniques"@en ;
    rdfs:isDefinedBy openalex: ;
    owl:sameAs <https://en.wikipedia.org/wiki/Web_scraping>,
        <https://openalex.org/T12016> ;
    skos:broader oasubfields:1710 ;
    skos:definition "This cluster of papers focuses on techniques and technologies for extracting structured data from web pages, including web crawling, automatic wrapper generation, page segmentation, and mining data records. It also covers topics related to the hidden web, information retrieval, and content adaptation for different devices."@en ;
    skos:inScheme openalex: ;
    skos:prefLabel "Web Data Extraction and Crawling Techniques"@en ;
    openalex:cited_by_count 420298 ;
    openalex:works_count 46647 .