Hey I'm Pierlugi and I write for The Web Scraping Club.
I'm the co-founder of Databoutique.com and want to share with you my 10+ years of experience with web scraping.

Featured Posts:

From 0 to 2 Billion Prices scraped per months

In this post of The Web Scraping Club blog, I’ll write about what we did at Databoutique.com to scale from 0 to 2 Billion prices per month scraped, bootstrapped, and with a minimal team of developers.

Read more...

A brief August wrap up of the latest news on web scraping

A brief august wrap up of the latest news about web scraping from all around the world.

Read more...

The starter toolkit for a python web scraping developer (2022)

Web scraping, as we all know, it's a discipline that evolves over time, with more complex anti-bot countermeasures and new tools to use.Let's find together what tools can't be missed for a python web scraper developer.

Read more...

Is web scraping becoming harder?

Do you have the feeling that web scraping is becoming more difficult and expensive? I do, especially in the last 12 months, I've noticed an increasing number of websites using advanced anti-bot solutions

Read more...

THE LAB #1: Scraping data from an app

I usually write in this newsletter about how to extract data from websites but what if our target is an app with no web interface?

Read more...

The costs of web scraping

There's no doubt in stating that cloud computing enabled a wide range of new opportunities in the tech space, and this is true also for web scraping.

Read more...

THE LAB #2: scraping data from a website with Datadome and xsrf tokens

A real world use case of a simple scraper that does not get blocked by Datadome

Read more...

Interview #1: Neha Setia - Zyte

Welcome to the first of our interviews, we'll break the ice with Neha Setia (@nehasetianagpal), developer advocate at Zyte, where she conducts workshops and enablement sessions for system integrators and clients at events.

Read more...

What's a proxy server?

Straight from Wikipedia, "In computer networking, a proxy server is a server application that acts as an intermediary between a client requesting a resource and the server providing that resource".

Read more...

THE LAB #3: Scraping Cloudflare protected websites

Cloudflare is an American company, based in San Francisco, offering several services like DDoS mitigation services, Distributed DNS, Content Distribution Networks, and also anti-bot protection for websites.

Read more...