You can download this code by clicking the button below.
This code is now available for download.
This function uses Scrapy's Selector and LinkExtractor to extract links from HTML content.
Technology Stack : Scrapy, Selector, LinkExtractor
Code Type : The type of code
Code Difficulty : Intermediate
def extract_links_from_html(html_content):
from scrapy import Selector
from scrapy.spiders import CrawlSpider, Rule
from scrapy.linkextractors import LinkExtractor
selector = Selector(text=html_content)
link_extractor = LinkExtractor()
links = link_extractor.extract_links(selector)
return links