Amazon-Scraper: Find your perfect item!

Functionality: scraping multiple amazon web pages for your item, setting a price range for your item, csv/json output, and more!

Technology Needed:

Python (here is where you can install python: https://www.python.org/downloads/)
A python package manager (e.g pip) (here is where you can install pip: https://pip.pypa.io/en/stable/installation/)
git (here is where you can install git: https://git-scm.com/book/en/v2/Getting-Started-Installing-Git)

Python Dependencies Needed:

bs4 (BeautifulSoup4)
requests
lxml

How to use (NOTE: -i or --item and -n or --num are required fields):

Get the repository:

mkdir "Amazon Scraper"
cd "Amazon Scraper"
git clone https://github.com/Moffi-bit/Amazon-Scraper.git

Install the dependencies:

py -m pip install bs4 requests lxml rich

If you encounter issues trying to run commands using "py", you may have to use "python" or "py3" instead. Your system environment PATH variable may also be the issue.

Moving into the cloned repository:

cd Amazon-Scraper

Usage:

usage: scrape.py [-h] [-i ITEM [ITEM ...]] [-l LOWER] [-u UPPER] [-n NUM] [-o OUT] [-c]
Note: Adding -c to the arguments will cause the program to print the cheapest item at the end of scraping

Individual Commands:

-i or --item:

py scrape.py -i xbox

OR

py scrape.py --item xbox

Tells the program that the item you're looking for is a xbox.

-l or --lower:

py scrape.py -l 50

OR

py scrape.py --lower 50

Tells the program that the price minimum (lower bound) is 50.

-u or --upper:

py scrape.py -u 500

OR

py scrape.py --upper 500

Tells the program that the price maximum (upper bound) is 500.

-n or --num:

py scrape.py -n 100

OR

py scrape.py --num 100

Tells the program that the number of item links you want to pull data from is 100.

-c:

py scrape.py -c

Tells the program that you want it to output the cheapest item after it's scraped all links.

-o or --out:

py scrape.py -o test

OR

py scrape.py --out test

Tells the program that you want the product information to be written to a csv/json named test. If this argument is not provided the default csv/json the information will go to is: "out.csv"/"out.json"

Examples of Possible Run Commands:

Get all items within a price range (USD):

py scrape.py -i xbox s -l 200 -u 400 -n 100

Get all items above a price (USD):

py scrape.py -i yoga mats -l 10 -n 150

Get all items below a price (USD):

py scrape.py -i playstation -u 400 -n 100

Get all items no matter the price:

py scrape.py -i car tires -n 100

Get the cheapest item of the items scraped and write the information to a csv/json named "gfxcards":

py scrape.py -i rtx 3090 -n 50 -c -o gfxcards

CSV:

Format:

title,price,rating,reviews,availability,url

The CSV contains ALL of the relevant items scraped

JSON:

Format:

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
demo.py		demo.py
headers.py		headers.py
scrape.py		scrape.py
user-agents.txt		user-agents.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon-Scraper: Find your perfect item!

Technology Needed:

Python Dependencies Needed:

How to use (NOTE: -i or --item and -n or --num are required fields):

Get the repository:

Install the dependencies:

If you encounter issues trying to run commands using "py", you may have to use "python" or "py3" instead. Your system environment PATH variable may also be the issue.

Moving into the cloned repository:

Usage:

Individual Commands:

Examples of Possible Run Commands:

CSV:

JSON:

Future Improvements

Please report any issues/bugs you come across when using the scraper! Always looking to receive feedback, what I should add, and make improvements!

About

Releases

Packages

Languages

License

cl0ver012/Amazon-Scraper

Folders and files

Latest commit

History

Repository files navigation

Amazon-Scraper: Find your perfect item!

Technology Needed:

Python Dependencies Needed:

How to use (NOTE: -i or --item and -n or --num are required fields):

Get the repository:

Install the dependencies:

If you encounter issues trying to run commands using "py", you may have to use "python" or "py3" instead. Your system environment PATH variable may also be the issue.

Moving into the cloned repository:

Usage:

Individual Commands:

Examples of Possible Run Commands:

CSV:

JSON:

Future Improvements

Please report any issues/bugs you come across when using the scraper! Always looking to receive feedback, what I should add, and make improvements!

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages