From 490c642bd9252c7d18cc320c6e65019efc5d0e65 Mon Sep 17 00:00:00 2001 From: eulaly Date: Tue, 5 May 2026 15:03:25 -0400 Subject: [PATCH] added timestamp to tasks --- agents.md | 2 +- docs/tasks.org | 9 ++++++--- 2 files changed, 7 insertions(+), 4 deletions(-) diff --git a/agents.md b/agents.md index 0356fd7..03c0081 100644 --- a/agents.md +++ b/agents.md @@ -36,5 +36,5 @@ Description and PM notes ** evidence - commit: - tests: -- datetime: +- date: [2026-05-05 Tue 15:00] ``` diff --git a/docs/tasks.org b/docs/tasks.org index 8e7ccf1..4c6a0f6 100644 --- a/docs/tasks.org +++ b/docs/tasks.org @@ -31,7 +31,7 @@ Comments are hydrated in backend via js-cued button (AJAX?). - tests: 8 passing (`python -m pytest tests -q`) or (`python -m pytest tests/`) - `scrapy crawl forum -a forum_id=452 -s LOG_LEVEL=WARNING 2>&1` - retrieved 9083 comments -- datetime: 2026-05-05 +- datetime: [2026-05-05 Tue 14:00] * [ ] t1.2: initial 4o sentiment Write a simple manual pipeline for gpt-4o that reads one scraped forum jsonl file and roduces a separate analyzed jsonl file. this step must not mutate scraper output. analysis should classify each comment for regulatory stance, generic tone/sentiment, confidence, and enough rationale/evidence to support later dashboard drilldown. @@ -67,14 +67,17 @@ Should be run manually, separate from scraper. You may use scrapy, but are not r - MAX_COMMENT_CHARS=6000: covers >99% without truncation; outliers (e.g. 18k-char law firm brief) flagged with truncated=True. ** evidence -- commit: +- commit: d834d18 - tests: 20 passing (pytest tests/test_gpt4o_analysis.py), 28 total across suite python ./analysis/gpt4o/analysis.py --limit 5 ./output/f452.jsonl -- date: [2026-05-05] +- date: [2026-05-05 Tue 15:00] * [ ] t1.2.1: 4o with batch processing ** acceptance criteria 1. input scraped jsonl doc by filename/path, and process the whole thing via batch processing + +** notes + ** evidence - commit: - tests: