chore: bootstrap repository with governance docs
Initialize git, add Apache-2.0 LICENSE, .gitattributes (LF line endings), AGENTS.md (entry points, stack, discovery order, baseline checks), RUNBOOK.md (dev boot, prod deploy with overlay, ingestion, failures, rollback, scaling notes), .env.prod.example with rotated credential placeholders, and dev-only warnings on .env.example. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
12
app/ingestion/normalizer.py
Normal file
12
app/ingestion/normalizer.py
Normal file
@@ -0,0 +1,12 @@
|
||||
"""Block-level normalization wrappers around utils.text_cleaning."""
|
||||
|
||||
from __future__ import annotations
|
||||
|
||||
from app.utils.text_cleaning import clean_ocr_text, normalize_for_search
|
||||
|
||||
|
||||
def normalize_block(text: str) -> tuple[str, str]:
|
||||
"""Return ``(display_text, normalized_text)``."""
|
||||
display = clean_ocr_text(text)
|
||||
norm = normalize_for_search(display)
|
||||
return display, norm
|
||||
Reference in New Issue
Block a user