added scope for browser session pull task and cleanup
This commit is contained in:
22
pm/tasks.org
22
pm/tasks.org
@@ -254,6 +254,28 @@
|
||||
- commit: `c0054dc` on branch `cx`
|
||||
- tests: `./venv/bin/python -m unittest discover -s tests`; `./venv/bin/python scrape_costco.py --help`; verified Costco summary/detail flattening now uses composite receipt keys in unit tests
|
||||
- date: 2026-03-16
|
||||
* [ ] t1.8.6: add browser session helper (2-4 commits)
|
||||
|
||||
** acceptance criteria
|
||||
- create a separate Python module/script that extracts firefox browser session data needed for giant and costco scrapers.
|
||||
- support Firefox and Costco first, including:
|
||||
- loading cookies via existing browser-cookie approach
|
||||
- reading browser storage needed for dynamic auth headers (e.g. Costco bearer token)
|
||||
- copying locked browser sqlite/db files to a temp location before reading when necessary
|
||||
- expose a small interface usable by scrapers, e.g. cookie jar + storage/header values
|
||||
- keep retailer-specific parsing of extracted session data outside the low-level browser access layer
|
||||
- structure the helper so Chromium-family browser support can be added later without changing scraper call sites
|
||||
|
||||
** notes
|
||||
- goal is to replace manual `.env` copying of volatile browser-derived auth data
|
||||
- session bootstrap only, not full browser automation
|
||||
- prefer one shared helper over retailer-specific ad hoc storage reads
|
||||
- Firefox only; Chromium support later
|
||||
|
||||
** evidence
|
||||
- commit:
|
||||
- tests:
|
||||
- date:
|
||||
* [ ] t1.9: compute normalized comparison metrics (2-4 commits)
|
||||
|
||||
** acceptance criteria
|
||||
|
||||
Reference in New Issue
Block a user