A simple python web scraper.

Last update: May 06, 2022

Overview

Dissec

A simple python web scraper.

It gets a website and its contents and parses them with the help of bs4.

Installation

To install the requirements, run the following commands;

To change directories: cd Dissec

To make the files executable: chmod +x *

To install the requirements: python requirements.py

Usage

Run the script using: python dissec.py

It'll prompt you, asking for a website that you want to scrape. Enter any website that you want to scrape. The website can be with or without https, ex:

www.mega.nz or https://www.mega.nz, in this, both of them will work the same.

Owner

Hmmmmm.. *nothing here*

GitHub Repository

Demonstration on how to use async python to control multiple playwright browsers for web-scraping

Playwright Browser Pool This example illustrates how it's possible to use a pool of browsers to retrieve page urls in a single asynchronous process. i

8 Oct 27, 2022

A crawler of doubamovie

豆瓣电影 A crawler of doubamovie 一个小小的入门级scrapy框架的应用，选取豆瓣电影对排行榜前1000的电影数据进行爬取。 spider.py start_requests方法为scrapy的方法，我们对它进行重写。 def start_requests(self):

1 Oct 05, 2021

Twitter Scraper

Twitter's API is annoying to work with, and has lots of limitations — luckily their frontend (JavaScript) has it's own API, which I reverse–engineered. No API rate limits. No restrictions. Extremely

45 Dec 30, 2022

High available distributed ip proxy pool, powerd by Scrapy and Redis

高可用IP代理池 README　｜　中文文档本项目所采集的IP资源都来自互联网，愿景是为大型爬虫项目提供一个高可用低延迟的高匿IP代理池。项目亮点代理来源丰富代理抓取提取精准代理校验严格合理监控完备，鲁棒性强架构灵活，便于扩展各个组件分布式部署快速开始注意，代码请在release

5.2k Jan 03, 2023

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

1 Nov 30, 2021

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

🕳️ CygnusX1 Code by Trong-Dat Ngo. Overviews 🕳️ CygnusX1 is a multithreaded tool 🛠️ , used to search and download images from popular search engine

32 Dec 31, 2022

A python script to extract answers to any question on Quora (Quora+ included)

quora-plus-bypass A python script to extract answers to any question on Quora (Quora+ included) Requirements Python 3.x

10 Aug 18, 2022

Discord webhook spammer with proxy support and proxy scraper

3 Feb 27, 2022

crypto currency scraping

SCRYPTO What ? Crypto currencies scraping (At the moment, only bitcoin and ethereum crypto currencies are supported) How ? A python script is running

15 Sep 01, 2022

mlscraper: Scrape data from HTML pages automatically with Machine Learning

🤖 Scrape data from HTML websites automatically with Machine Learning

798 Dec 29, 2022

Examine.com supplement research scraper!

ExamineScraper Examine.com supplement research scraper! Why I want to be able to search pages for a specific term. For example, I want to be able to s

15 Dec 06, 2022

This program scrapes information and images for movies and TV shows.

Media-WebScraper This program scrapes information and images for movies and TV shows. Summary For more information on the program, read the WebScrape_

1 Dec 05, 2021

Kusonime scraper using python3

Features Scrap from url Scrap from recommendation Search by query Todo [+] Search by genre Example # Get download url from kusonime import Scrap

2 Jan 28, 2022

Automated Linkedin bot that will improve your visibility and increase your network.

LinkedinSpider LinkedinSpider is a small project using browser automating to increase your visibility and network of connections on Linkedin. DISCLAIM

2 Nov 26, 2021

A Pixiv web crawler module

Pixiv-spider A Pixiv spider module WARNING It's an unfinished work, browsing the code carefully before using it. Features 0004 - Readme.md updated, co

1 Nov 14, 2021

Script used to download data for stocks.

This script is useful for downloading stock market data for a wide range of companies specified by their respective tickers. The script reads in the d

71 Oct 04, 2022

Pro Football Reference Game Data Webscraper

Pro Football Reference Game Data Webscraper Code Copyright Yeetzsche This is a simple Pro Football Reference Webscraper that can either collect all ga

6 Dec 21, 2022

Web Scraping images using Selenium and Python

Web Scraping images using Selenium and Python A propos de ce document This is a markdown document about Web scraping images and videos using Selenium

3 Jul 01, 2022

Find thumbnails and original images from URL or HTML file.

Haul Find thumbnails and original images from URL or HTML file. Demo Hauler on Heroku Installation on Ubuntu $ sudo apt-get install build-essential py

150 Oct 15, 2022

一个m3u8视频流下载脚本

一个Python的m3u8流视频下载脚本介绍 m3u8流视频日益常见，目前好用的下载器也有很多，我把之前自己写的一个小脚本分享出来，供广大网友使用。写此程序的目的在于给视频下载爱好者提供一个下载样例，可直接调用，勿再重复造轮子。使用方法在python中直接运行程序或进行外部调用 import

0 Oct 10, 2021

A simple python web scraper.

Related tags

Overview

Dissec

Installation

Usage

Owner

Demonstration on how to use async python to control multiple playwright browsers for web-scraping

A crawler of doubamovie

Twitter Scraper

High available distributed ip proxy pool, powerd by Scrapy and Redis

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

A python script to extract answers to any question on Quora (Quora+ included)

Discord webhook spammer with proxy support and proxy scraper

crypto currency scraping

mlscraper: Scrape data from HTML pages automatically with Machine Learning

Examine.com supplement research scraper!

This program scrapes information and images for movies and TV shows.

Kusonime scraper using python3

Automated Linkedin bot that will improve your visibility and increase your network.

A Pixiv web crawler module

Script used to download data for stocks.

Pro Football Reference Game Data Webscraper

Web Scraping images using Selenium and Python

Find thumbnails and original images from URL or HTML file.

一个m3u8视频流下载脚本