Skip to content

surenjanath/PriceTracker-Pricesmart

Repository files navigation

PriceSmart Products Web Scraper

This is a Python web scraper designed to fetch product data from PriceSmart's API at https://www.pricesmart.com and store them in a SQLite database. It utilizes asyncio and aiohttp for asynchronous API calls and SQLAlchemy for database operations.

About PriceSmart

PriceSmart is a membership-based warehouse club operator in the Caribbean and Central America. This scraper focuses on extracting product information from their Trinidad and Tobago (TT) store, including groceries, electronics, household items, and more.

Features

  • API-Based Scraping: Uses PriceSmart's internal API for reliable data extraction
  • Asynchronous Processing: Efficient fetching using asyncio and aiohttp
  • Rate Limiting: Implements proper rate limiting (1 request per 2 seconds) to be respectful
  • JSON Response Parsing: Robust JSON parsing for structured data extraction
  • SQLite Database Storage: Structured storage of product data using SQLAlchemy ORM
  • Error Handling: Comprehensive error handling with retry logic and connection recovery
  • JSON Response Saving: Saves raw JSON responses for debugging and verification
  • Data Analysis: Generates comprehensive analysis reports including brand analysis and pricing statistics
  • Multi-Category Support: Can scrape multiple product categories
  • Pagination Handling: Automatically handles pagination to get all available products

Requirements

  • Python 3.7 or higher
  • aiohttp
  • pandas
  • SQLAlchemy
  • beautifulsoup4
  • lxml
  • html5lib

Installation

  1. Clone the repository or download the files
  2. Install the required dependencies:
    pip install -r pricesmart_requirements.txt

Usage

  1. The scraper is pre-configured to scrape PriceSmart products from the Trinidad and Tobago store
  2. Run the scraper:
    python pricesmart_scraper.py
  3. The scraper will:
    • Fetch product data from PriceSmart's API with proper rate limiting
    • Parse the JSON responses and extract product information
    • Store results in the SQLite database
    • Generate analysis reports
    • Save JSON responses for debugging

Configuration

Categories

You can modify the categories list in the main section to scrape different product categories:

categories = ['G10D03', 'G10D04', 'G10D05']  # Add more category codes

Rate Limiting

The scraper implements a 2-second delay between requests to be respectful to the API:

  • Built-in rate limiting ensures 1 request per 2 seconds
  • Automatic delays between category processing

Database Settings

  • Database Name: PriceSmart_Products_Database.db
  • Location: Database/ folder
  • Customize by modifying Database_Name and Location variables

Data Structure

The scraper extracts the following data for each product:

  • pid: Product ID
  • title: Product title/name
  • price: Base price
  • thumb_image: Product thumbnail image URL
  • brand: Product brand
  • slug: URL slug
  • skuid: SKU ID
  • currency: Currency code (TTD)
  • fractionDigits: Price decimal places
  • master_sku: Master SKU reference
  • sold_by_weight_TT: Whether sold by weight
  • weight_TT: Weight value
  • weight_uom_description_TT: Weight unit of measure
  • sign_price_TT: Sign price
  • price_per_uom_TT: Price per unit of measure
  • uom_description_TT: Unit of measure description
  • availability_TT: Product availability
  • price_TT: Price in Trinidad and Tobago dollars
  • inventory_TT: Inventory count
  • promoid_TT: Promotion ID
  • category: Product category

Output Files

  • Database: Database/PriceSmart_Products_Database.db - SQLite database with all scraped product data
  • HTML Reports: pricesmart_analysis_report.html - Generated analysis report
  • Debug Files: response_{category}_{start}_{rows}.json - Raw JSON responses for debugging

Analysis Features

The generated HTML report includes:

  • Basic Statistics: Total products, total value, average price
  • Brand Analysis: Top 5 brands by product count
  • Recent Products: Sample of scraped products with details
  • Pricing Analysis: Price distribution and trends

Technical Details

API Endpoint

  • URL: https://www.pricesmart.com/api/br_discovery/getProductsByKeyword
  • Method: POST
  • Content-Type: application/json

Request Structure

The scraper sends properly formatted JSON payloads including:

  • Category codes
  • Pagination parameters
  • Authentication keys
  • Request metadata

Error Handling

  • Retry logic for connection failures (3 attempts)
  • Graceful handling of JSON parsing errors
  • Comprehensive logging for debugging
  • Rate limiting to prevent API overload

Database Schema

  • Uses SQLAlchemy ORM for data management
  • Auto-incrementing primary key
  • Automatic timestamp tracking for data freshness
  • Unique ID generation for each record
  • Duplicate detection and update logic

Ethical Scraping

This scraper is designed to be respectful to PriceSmart's servers:

  • Rate Limiting: 2-second delays between requests
  • Proper Headers: Uses appropriate User-Agent and headers
  • Session Management: Maintains proper session cookies
  • Error Recovery: Graceful handling of temporary failures
  • Data Usage: Intended for research and analysis purposes only

Contributing

Contributions are welcome! If you find any bugs or have suggestions for improvement, please open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

  • This project was inspired by the need to analyze PriceSmart's product offerings and pricing
  • Special thanks to the developers of aiohttp, pandas, SQLAlchemy, and other libraries
  • Respectful API usage following best practices for data collection

Disclaimer

This scraper is for educational and research purposes only. Please respect PriceSmart's terms of service and use the data responsibly. The scraper is designed to be respectful to their servers and should not be used for commercial purposes without proper authorization.

This project has recently gained unexpected attention. It was created for personal, educational purposes ONLY.

  • DO NOT ABUSE THIS SCRIPT: Do not run it excessively or use it for commercial purposes.
  • RESPECT THE WEBSITE: Scraping places a load on a website's servers. This script includes a 10-second delay between requests to be respectful. Please do not remove it.
  • USE AT YOUR OWN RISK: The user is solely responsible for their use of this script. I (the author) am not responsible for any misuse, server overloads, IP bans, or any legal action that may result from its use. This project is provided as-is for educational demonstration.

Analysis Results

{{analysis_placeholder}}

PriceSmart Products Analysis Report

Basic Analysis

  • Total products scraped: 1126
  • Total value: $125,999.00
  • Average price: $111.90

Database Changes

  • New products added: 3
  • Existing products updated: 1123
  • Price changes detected: 57
  • Stock/availability changes: 6
  • Discontinued products: 5

Top 5 Brands

Brand Count
Member's Selection 187
142
Badia 18
Swiss 13
Kirkland Signature 12

Recent Products

Title Brand Price (TTD) Availability
Member's Selection Pepperoni Pizza 2 Units / 656 g / 1.44 lb Member's Selection $123.95 true
Member's Selection Supreme Pizza 2 Units / 716 g / 1.57 lb Member's Selection $123.95 true
Pop-Tarts Snack Crunchy Poppers Filled with Strawberry and Brownie 2 Units 360 g / 12.6 oz Pop Tarts $174.95 true
McCain Mac and Cheese Triangles 2 Units / 400 g / 14 oz McCain $119.95 true
Kellogg's Chocolatey Chip Waffles 2 Pack / 349 g / 12.3 oz Kellogg's $97.95 true
Swiss Miss Dark Chocolate Flavor Cocoa Powder 50 Units 31 g / 1 oz Swiss Miss $88.84 true
Krusteaz Butter and Honey Pancake Mix 8 Units / 907 g / 2 lb Krusteaz $209.95 true
Pafritas Potatoes with Black Truffle Flavor 500 g / 1.1 lb Pafritas $112.95 true
Bombolo Biscotti Decorated Cookies with Autumn Patterns 18 Units Bombolo Biscotti $73.95 true
My Mochi Peppermint Ice Cream 420 g / 15 oz My Mochi $99.95 true

PriceSmart Price Analysis Report

Price Change Summary (Last 30 Days)

  • Total price changes: 1379
  • Price increases: 749
  • Price decreases: 545
  • Average increase: 4.3%
  • Average decrease: -4.8%

Recent Price Changes

Product Old Price New Price Change % Change Type
Jackson's Sweet Potato Chips 454 g / 16 oz $0.00 $77.29 $+77.29 +100.0% New
Pork Belly with Skin Frozen Vacuum Packaged $174.82 $179.88 $+5.06 +2.9% Increase
Member's Selection Freshly Made Chicken Salad Wraps 4 Units $0.00 $65.95 $+65.95 +100.0% New
Frozen Bone In Pork Shoulder Sliced Tray $71.69 $71.79 $+0.10 +0.1% Increase
Frozen Lamb Leg Whole Boneless Tray Pack $313.82 $314.55 $+0.73 +0.2% Increase
Sundays Peanut Ice Cream 3.78 L / 128 oz $69.95 $58.95 $-11.00 -15.7% Decrease
Frozen Skinless Boneless Beef Shoulder Clod Steaks Tray $97.73 $97.57 $-0.16 -0.2% Decrease
Brunswick Canned Mackerel in Tomato Sauce 3 Units / 425 g $34.95 $27.95 $-7.00 -20.0% Decrease
Member's Selection Frozen Skinless Boneless Beef Shoulder Clod Roast Tray Pack $137.68 $137.76 $+0.08 +0.1% Increase
Frozen Sliced Turkey Drumsticks $147.59 $147.40 $-0.19 -0.1% Decrease
Fresh Chicken Mixed Parts Tray $89.90 $89.79 $-0.11 -0.1% Decrease
Pascual Greek Yogurt 8 Units / 125 g / 4.4 oz $57.95 $48.95 $-9.00 -15.5% Decrease
Klim Powdered Milk 1.6 kg / 56.4 oz $86.95 $109.95 $+23.00 +26.5% Increase
Fresh Ground Chicken Tray $84.42 $85.95 $+1.53 +1.8% Increase
Pier 33 Coho Salmon Steak Frozen, Skin on 1 kg / 2.2 lb $224.95 $179.95 $-45.00 -20.0% Decrease

Biggest Price Increases (All Time)

Product Old Price New Price % Increase
Tropical Frying Cheese 907 g / 32 oz $19.70 $89.95 +356.6%
Frozen Boneless Pork Loin Vacuum Packaged $56.90 $253.53 +345.6%
Sabra Classic Hummus 850 g / 30 oz $24.70 $79.95 +223.7%
Avocado 2 Units $9.70 $29.95 +208.8%
Kozyshack Cinnamon Rice Pudding 1.36 kg / 3 lb $29.70 $89.95 +202.9%
Belgioioso Fresh Mozzarella Snack Cheese 18 Units / 28 g / 1 oz $34.70 $89.95 +159.2%
Nestos Capers in Brine 2 Units / 350 g / 12.4 oz $29.70 $64.95 +118.7%
Black Seedless Grapes 907 g / 2 lb $29.70 $64.95 +118.7%
Curly's Baby Back Pork Ribs 680 g / 24 oz $59.70 $129.95 +117.7%
Green Seedless Grapes 907 g / 2 lb $29.70 $62.95 +112.0%

Biggest Price Decreases (All Time)

Product Old Price New Price % Decrease
Tropical Frying Cheese 907 g / 32 oz $89.95 $19.70 -78.1%
Sabra Classic Hummus 850 g / 30 oz $79.95 $24.70 -69.1%
Avocado 2 Units $29.95 $9.70 -67.6%
Kozyshack Cinnamon Rice Pudding 1.36 kg / 3 lb $87.95 $29.70 -66.2%
Pillsbury Cookie Dough Mix 1.3 kg / 3 lb $109.95 $39.70 -63.9%
Belgioioso Fresh Mozzarella Snack Cheese 18 Units / 28 g / 1 oz $89.95 $34.70 -61.4%
Black Seedless Grapes 907 g / 2 lb $64.95 $29.70 -54.3%
Curly's Baby Back Pork Ribs 680 g / 24 oz $129.95 $59.70 -54.1%
Green Seedless Grapes 907 g / 2 lb $62.95 $29.70 -52.8%
Leclerc Summer Cookies with Raspberry and Berry Flavor 2 Units / 300 g $41.70 $19.70 -52.8%

Recently Discontinued Products

Product Brand Last Known Price Discontinued Date
Fresh Regular Tomato $33.61 2025-10-20
Russet Potato 2.2 kg / 5 lb $22.95 2025-10-20
Bokchoy, 680 g / 1.3 lb $32.95 2025-10-20
Samyang Four Cheese and Ramen Flavor Instant Noodle Soup 5 Units / 140 g Samyang $69.95 2025-10-20
Fresh Kiwi 907 g / 2 lb $89.95 2025-10-20
Golden Harvest Stuffed Flounder Pacifica 340 g / 12 oz Golden Harvest $29.70 2025-10-17
Member's Selection Frozen Fruit Chunks 2.2 kg / 5 lb Member's Selection $129.95 2025-10-17
Cantaloupe 2 Units $45.96 2025-10-17
Gala Apples 1.36 kg / 3 lb $49.95 2025-10-17
Arcor Mint Hard Candies 2.26 Kg Arcor $89.95 2025-10-16

New Products Added Today

Product Brand Price Category
Jackson's Sweet Potato Chips 454 g / 16 oz Jackson's $77.29 G10D03
Member's Selection Freshly Made Chicken Salad Wraps 4 Units Member's Selection $65.95 G10D03
Quaker Quick Oats 2 Units / 1.13 kg / 40 oz Quaker $85.95 G10D03

About

This is an automated Script that webscrapes grocery items from Pricesmart

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published