Skip to content

ryscet/allegro_project

Repository files navigation

Allegro project

This project scrapes allegro archive and analyzes trends in sales history.

Scrapy is used for web scraping.

Scrapy saves the data to sqlite database (AllegroSales.db), flask reads it from there.

Flask is used to display the results in a web-app.

Project is running in docker.

Instructions for docker build

$ docker pull ryscet/allegroproject_web

$ docker run -p 5000:5000 ryscet/allegroproject_web

For linux go to localhost:5000 in the browser

For mac and windows check the ip of the virtual machine:

$ docker-machine ip and enter it instead of local host.

Scrapy instructions

The sqlite db is already in the project. However, it can be updated. Run scrapy from the uppermost project directory (allegro_project/)

$ scrapy crawl crawl_allegro

Sales Heatmap

This heatmap is produced from the data scraped from allegro, and is visible in the flask app under '/plots' url. alt text

  • The color represents the number of sold items.
  • x-axis respresents the history, binned into months.
  • y-axis respresents price range bins in pln.

About

This project scrapes sales history from allegro archive and analyzes data trends in time.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published