pc-part-dataset
pc-part-dataset copied to clipboard
A dataset of PC parts scraped from PCPartPicker
PC Part Dataset
A dataset of PC parts scraped from PCPartPicker.
Part count: 60,506
Last updated: March 27, 2024
Download
The parts are available in JSON, JSON Lines, and CSV format. You can find them in the ./data
directory.
JSON Property Descriptions
Check out API.md for JSON property descriptions of each product category.
Contents
-
General
- CPUs
- CPU Coolers
- Motherboards
- Memory
- Storage
- Video Cards
- Cases
- Power Supplies
- Optical Drives
- Operating Systems
- Monitors
- External Storage
-
Accessories / Other
- Case Accessories
- Case Fans
- Fan Controllers
- Thermal Compound
- UPS Systems
-
Expansion Cards / Networking
- Sound Cards
- Wired Network Adapters
- Wireless Network Adapters
-
Peripherals
- Headphones
- Keyboards
- Mice
- Speakers
- Webcams
Running the Scraper
Warning Make sure to turn on your VPN before running this scraper!
- Run
npm install
- Run
npm run start
- Wait about 5-10 minutes for the scraper to finish doing its thing.
- Everything will be emitted to a directory named
data-staging
.
If you don't want to scrape every part, you can specify individual parts by passing them as arguments. You can find the list of part type IDs in API.md.
Example usage:
# Only get CPUs and video cards
npm run start -- cpu video-card
package.json
Scripts
-
start
=> Starts the scraper. -
dev
=> Starts the scraper without typechecking. -
count
=> Runscounter.ts
; gets the number of parts from./data-staging/json
or argv[1]. -
output
=> Runsoutput.ts
; outputs JSONL and CSV formats to./data-staging
or argv[1]. -
zip
=> Zips the JSON, JSONL, and CSV folders in./data
.
License
MIT