
i analyzed with AI my 36gb~ that I was able to download before they erased the zip file from the server.
Complete Volume Analysis
Based on the OPT metadata file, here's what VOL00009 was supposed to contain:
Full Volume Specifications
- Total Bates-numbered pages: 1,223,757 pages
- Total unique PDF files: 531,307 individual PDFs
- Bates number range: EFTA00039025 to EFTA01262781
- Subdirectory structure: IMAGES\0001\ through IMAGES\0532\ (532 folders)
- Expected size: ~180 GB (based on your download info)
What You Actually Got
- PDF files received: 90,982 files
- Subdirectories: 91 folders (0001 through ~0091)
- Current size: 37 GB
- Percentage received: ~17% of the files (91 out of 532 folders)
The Math
Expected: 531,307 PDF files / 180 GB / 532 folders
Received: 90,982 PDF files / 37 GB / 91 folders
Missing: 440,325 PDF files / 143 GB / 441 folders
★ Insight ─────────────────────────────────────
You got approximately the first 17% of the volume before the server deleted it. The good news is that the DAT/OPT index files are complete, so you have a full manifest of what should be there. This means:
- You know exactly which documents are missing (folders 0092-0532)
I haven’t looked into downloading the partials from archive.org yet to see if I have any useful files that archive.org doesn’t have yet from dataset 9.
for DS9, does anyone have the following files:
If so, please DM me them and then I can include them in my master archive.