updated readme

This commit is contained in:
2026-05-09 00:02:24 -04:00
parent f42183eeda
commit 771f11fd3c

View File

@@ -94,15 +94,15 @@ We selected gpt-5.4-mini for a good balance of quality, cost, and time.
- Each scraped forum is saved to `output/<forum-id>.jsonl` - Each scraped forum is saved to `output/<forum-id>.jsonl`
- Each report (forum + prompt) is saves to `reports/<forum-id-N>.json` - Each report (forum + prompt) is saves to `reports/<forum-id-N>.json`
- Each job is saved to `analysis/jobs/<report-id>`: - Each job is saved to `analysis/jobs/<report-id>`:
└─`forum.jsonl` is a copy of the scraped forum for convenience └─`forum.jsonl` is a copy of the scraped forum for convenience
└─`prompt.txt` is a copy of the prompt used └─`prompt.txt` is a copy of the prompt used
└─`report.json` is a copy of the report used └─`report.json` is a copy of the report used
└─`status.json` contains metadata about the job └─`status.json` contains metadata about the job
For each batch in the job, four files are created: For each batch in the job, four files are created:
└─`jobN-input.jsonl` contains the exact queries sent to the API, for troubleshooting └─`jobN-input.jsonl` contains the exact queries sent to the API, for troubleshooting
└─`jobN-output-raw.jsonl` contains the exact response from the API └─`jobN-output-raw.jsonl` contains the exact response from the API
└─`jobN-output.jsonl` contains the exact response from the API └─`jobN-output.jsonl` contains the exact response from the API
└─`jobN-output-errors.jsonl` when errors are returned (this file may not exist) └─`jobN-output-errors.jsonl` when errors are returned (this file may not exist)
- Once complete, the cleanup script saves `review.csv`, `review.pqt`, and `review.sqlite` in this folder. - Once complete, the cleanup script saves `review.csv`, `review.pqt`, and `review.sqlite` in this folder.