Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Last update: Nov 30, 2021

Related tags

Overview

Baua Biocides Scraper

Scrapping the data from each page of biocides listed on the BAUA website (https://www.baua.de/DE/Biozid-Meldeverordnung/Offen/offen.html) into a csv file.
A windows standalone client is avalaible in the dist folder

About the project

What's the problem?

Baua website contains many usefull data for biocides domain, but the website only allows you to search product by product and it is not easy to find and get some informations with over 80,000 products listed

The idea

Facilitate the data manipulation with providing a csv file with all data scraped from Baua website.

How does it work ?

The user start the program.
The program extract data from Baua website.
A csv file containing data are created.

Roadmap

This project was created after a request and is not intended to evolve. Nevertheless you can fork the project to improve it by yourself and propose them via the project pull requests. or make a suggestion via the project issues.

Build with

Programming language : Python 3.10.0
Scraping Framework : Scrapy 2.5.1
HTTP library : Requests 2.26.0
Standalone Builder : PyInstaller 4.7

Demo

You can use the windows standalone client in the dist folder

Version management

We use a semantic version management, that is a version number MAJOR.MINOR.CORRECTIVE :

the MAJOR version number when there are non backward compatible changes,
the MINOR version number when there are backward compatible feature additions,
the FIX version number when there are backwards compatible bug fixes.

See SignMail tags For more info: semver.org

Authors

Eric De Maria - Numio - Initial work

License

This project is licensed under the GNU GPL 3 license - See the LICENSE file for more details.

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Instagram_scrapper This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or exce

5 Oct 17, 2022

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

mcc-mnc.com-webscraper Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX) A Python script for web scraping mcc-mnc.com Link: mcc

1 Nov 7, 2021

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

5 Nov 25, 2021

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

Introduction This is a project I built with the sole intent to learn more about

1 Jan 14, 2022

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Web-Scrapping-1 An application that on a given url, crowls a web page and gets all words, sorts and counts them. Installation Using the package manage

1 Jan 16, 2022

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

The windows standalone client for the first public version of Baua Biocides Scraper
Source code(tar.gz)
Source code(zip)
Baua_Biocides_Scraper_Windows.zip(16.02 MB)

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Related tags

Overview

Baua Biocides Scraper

About the project

What's the problem?

The idea

How does it work ?

Roadmap

Build with

Demo

Version management

Authors

License

You might also like...

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

Scrapes mcc-mnc.com and outputs 3 files with the data (JSON, CSV & XLSX)

AssistScraper - program for /r/nba to use to find list of all players a player assisted and how many assists each player recieved

A Python module to bypass Cloudflare's anti-bot page.

Screenhook is a script that captures an image of a web page and send it to a discord webhook.

A Python module to bypass Cloudflare's anti-bot page.

Python script who crawl first shodan page and check DBLTEK vulnerability

EBay-email-tracker - Scapes an entire search page of a particular item on eBay and sends regular updates to an email address

An application that on a given url, crowls a web page and gets all words, sorts and counts them.

Releases(v0.1.0)

v0.1.0(Nov 30, 2021)

Owner

Eric DE MARIA

SearchifyX, predecessor to Searchify, is a fast Quizlet, Quizizz, and Brainly webscraper with various stealth features.

TarkovScrappy - A nifty little bot that lets you know if a queried item might be required for a quest at some point in the land of Tarkov!

CRI Scrape is a tool for get general info about Italian Red Cross in GAIA Platform

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Pseudo API for Google Trends

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Scrapping Connections' info on Linkedin

Simple tool to scrape and download cross country ski timings and results from live.skidor.com

A list of Python Bots used to extract data from several websites

Dictionary - Application focused on word search through web scraping

A scalable frontier for web crawlers

Script used to download data for stocks.

Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Complete pipeline for crawling online newspaper article.

12306抢票脚本

Python web scrapper

A Very simple free proxy list scraper.

Binance Smart Chain Contract Scraper + Contract Evaluator

Open Crawl Vietnamese Text

Simply scrape / download all the media from an fansly account.