A library created to scrape Google Images.
If you are looking for other image scrapers, JJLimmm has created image scrapers for Gettyimages, Shutterstock, and Bing.
Visit their repo here: https://github.com/JJLimmm/Website-Image-Scraper
- Google Chrome
- Python3 packages (Pillow, Selenium, Requests)
- Windows OS (Other OS is not tested)
- Open command prompt
- Clone this repository (or download)
git clone https://github.com/ohyicong/Google-Image-Scraper
- Install Dependencies
pip install -r requirements.txt
- Edit your desired parameters in main.py
search_keys = Strings that will be searched for number of images = Desired number of images headless = Chrome GUI behaviour. If True, there will be no GUI min_resolution = Minimum desired image resolution max_resolution = Maximum desired image resolution max_missed = Maximum number of failed image grabs before program terminates. Increase this number to ensure large queries do not exit. number_of_workers = Number of sectioned jobs created. Restricted to one worker per search term and thread.
- Run the program
python main.py
This project was created to bypass Google Chrome"s new restrictions on web scraping from Google Images. To use it, define your desired parameters in main.py and run through the command line:
python main.py
Although it says so in the video, this program will not run through VSCode. It must be run in the command line.
This program will install an updated webdriver automatically. There is no need to install your own.