Python is going to bite the flipkart soon... :)
execute the crawler only in this way:
python main.py --search=django book
This is a simple python based project which is used to scrape the flipkart search results page. Also it does the scrolling to navigate into next page to crawl the data.This project has been implemented with mighty technologies like mongodb, redis, python, flask. It will be helpful to collect some historical data from flipkart.If you keep this crawler running in your server, this automatically scrapes the data for the user's keyword from flipkart website. Once you started gathering the data, you could do run many analysis like price range variation, offers, review analysis etc..
in screenshots
directory we have uploaded our first version screenshots. you can take a look and give us the feedback.
- We love python(2.7).
- phantomjs is phenomenon
- xpath is amazing.
- redis always rocks
- mongodb is sweet, if you are apetite
- flask simply flashes as usual
- selenium is damn serious
Upon all the above, Ubuntu is our living earth.
:)
lxml, mongodb, redis are the required system packages to be installed to execute it, after just run
pip install -r requirements.txt
.
$ ./start.sh
in your shell will do the job.It runs the redis_worker.py
and flask
app.
you can even customize for your case.
- Simple interface
- Re-Crawl option available to the user
- Case-insensitive search
- easy expandable
we have been working for further releases for the extensive use of customers.
###Demo:
###Re-Crawl:
Thanks for your time.