scraping

Star

Here are 3,265 public repositories matching this topic...

scrapy / scrapy

Star

Scrapy, a fast high-level web crawling & scraping framework for Python.

python crawler framework scraping crawling web-scraping hacktoberfest web-scraping-python

Updated Apr 9, 2025
Python

feder-cr / Jobs_Applier_AI_Agent_AIHawk

Sponsor

Star

AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.

Updated Mar 14, 2025
Python

ScrapeGraphAI / Scrapegraph-ai

Sponsor

Star

Python scraper based on AI

machine-learning ai scraping webscraping sc automated-scraper scraping-python gpt-3 gpt-4 llm scrapingweb llama3

Updated Apr 14, 2025
Python

soxoj / maigret

Sponsor

Star

🕵️‍♂️ Collect a dossier on a person by username from thousands of sites

Updated Apr 11, 2025
Python

psf / requests-html

Sponsor

Star

Pythonic HTML Parsing for Humans™

python html http scraping requests kennethreitz beautifulsoup lxml css-selectors pyquery

Updated Apr 16, 2024
Python

ultrafunkamsterdam / undetected-chromedriver

Star

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

testing chrome automation webdriver browser captcha scraping selenium navigator python3 cloudflare chromedriver anti-bot bot-detection cloudflare-bypass distil anti-detection

Updated Jun 25, 2024
Python

alirezamika / autoscraper

Sponsor

Star

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

python crawler machine-learning scraper automation ai scraping artificial-intelligence web-scraping scrape webscraping webautomation

Updated Oct 12, 2024
Python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

python crawler scraper automation web-crawler headless scraping crawling pip web-scraping beautifulsoup web-crawling hacktoberfest headless-chrome apify playwright

Updated Apr 14, 2025
Python

adbar / trafilatura

Sponsor

Star

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Updated Mar 17, 2025
Python

fake-useragent / fake-useragent

Star

Up-to-date simple useragent faker with real world database

python agent user-agent scraping fake faker python3 user useragent user-agent-spoofer useragent-scraper

Updated Apr 14, 2025
Python

snooppr / snoop

Star

Snoop — инструмент разведки на основе открытых данных (OSINT world)

Updated Apr 12, 2025
Python

aapatre / Automatic-Udemy-Course-Enroller-GET-PAID-UDEMY-COURSES-for-FREE

Star

Do you want to LEARN NEW STUFF for FREE? Don't worry, with the power of web-scraping and automation, this script will find the necessary Udemy coupons & enroll you for PAID UDEMY COURSES, ABSOLUTELY FREE!

python scraper scraping selenium python3