lazyflat

History

EiSiMo 0aa4c6c2bb enrichment: drop LLM for structured info, dedup images by sha + phash Per user request, the LLM is no longer asked to extract rooms/size/rent/WBS — those come from the inberlinwohnen.de scraper which is reliable. Haiku is now used for one narrow job: pick which <img> URLs from the listing page are actual flat photos (vs. logos, badges, ads, employee portraits). On any LLM failure the unfiltered candidate list passes through. Image dedup runs in two tiers: 1. SHA256 of bytes — drops different URLs that point to byte-identical files 2. Perceptual hash (Pillow + imagehash, Hamming distance ≤ 5) — drops the "same image at a different resolution" duplicates from srcset / CDN variants that were filling galleries with 2–4× copies UI: - Wohnungsliste falls back to scraper-only display (rooms/size/rent/wbs) - Detail panel only shows images + "Zur Original-Anzeige →"; description / features / pros & cons / kv table are gone - Per-row "erneut versuchen" link + the "analysiert…/?" status chips were tied to LLM extraction and are removed; the header "Bilder nachladen (N)" button still surfaces pending/failed batches for admins Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-04-21 15:29:55 +02:00
..
static	enrichment: Haiku flat details + image gallery on expand	2026-04-21 14:46:12 +02:00
templates	enrichment: drop LLM for structured info, dedup images by sha + phash	2026-04-21 15:29:55 +02:00
app.py	enrichment: drop LLM for structured info, dedup images by sha + phash	2026-04-21 15:29:55 +02:00
apply_client.py	multi-user: users, per-user profiles/filters/notifications, tab UI, apply forensics	2026-04-21 10:52:41 +02:00
auth.py	multi-user: users, per-user profiles/filters/notifications, tab UI, apply forensics	2026-04-21 10:52:41 +02:00
db.py	wohnungen: drive list info from LLM JSON, tidy header	2026-04-21 14:57:11 +02:00
Dockerfile	lazyflat: combined alert + apply behind authenticated web UI	2026-04-21 09:51:35 +02:00
enrichment.py	enrichment: drop LLM for structured info, dedup images by sha + phash	2026-04-21 15:29:55 +02:00
llm.py	enrichment: drop LLM for structured info, dedup images by sha + phash	2026-04-21 15:29:55 +02:00
matching.py	multi-user: users, per-user profiles/filters/notifications, tab UI, apply forensics	2026-04-21 10:52:41 +02:00
notifications.py	map debug + coord backfill, remove email channel, countdown label	2026-04-21 13:42:21 +02:00
requirements.txt	enrichment: drop LLM for structured info, dedup images by sha + phash	2026-04-21 15:29:55 +02:00
retention.py	multi-user: users, per-user profiles/filters/notifications, tab UI, apply forensics	2026-04-21 10:52:41 +02:00
settings.py	enrichment: Haiku flat details + image gallery on expand	2026-04-21 14:46:12 +02:00