Amazon Orders Dataset Drops
A real Amazon US orders dataset—1,000 anonymized line items—is now available on Hugging Face, enabling basket analysis, demand forecasting, and pricing models that fulfillment operators can use. That kind of open data speeds ML experiments for e‑commerce tenants and 3PLs. (x.com)
The Hugging Face repository shows two related uploads by user saikumarchunduri titled amazon_sales_first1000_rows_dataset and 1000_subset_amazon_sales_dataset, each listed as CSV-format datasets on the Hugging Face Datasets Hub (huggingface.co). (huggingface.co) The amazon_sales_first1000_rows_dataset entry explicitly requires users to "agree to share your contact information" before accessing files, and its page recorded 9 downloads in the last month on Hugging Face's site (huggingface.co). (huggingface.co) The 1000_subset_amazon_sales_dataset viewer exposes a 1k-row train split and shows a sample row for a Wayona USB-to-Lightning cable with price ₹399 discounted from ₹1,099 and a 4.2-star rating based on 24,269 reviews in the dataset preview (huggingface.co). (huggingface.co) The same Hugging Face viewer page lists the dataset format as CSV and notes the Dataset Viewer auto-converts files to Parquet and provides a built-in SQL console and API embed for remote querying (huggingface.co). (huggingface.co) For scale context, Hugging Face also hosts much larger Amazon-derived collections—McAuley Lab’s Amazon-Reviews-2023 contains roughly 571.5 million reviews—underscoring that the saikumarchunduri uploads are small, sample-sized datasets in comparison (huggingface.co). (huggingface.co) Hugging Face’s Datasets library documents one-line loading with load_dataset(...) and supports remote querying and local conversion to Parquet or DuckDB, which are concrete tooling paths for turning a CSV/Parquet product/order file into SQL-ready tables for basket or SKU-level analysis (huggingface.co). (huggingface.co)