Files
LegacyHUB/app/ingestion/normalizer.py
Vadim Malanov 7f72171572 chore: bootstrap repository with governance docs
Initialize git, add Apache-2.0 LICENSE, .gitattributes (LF line
endings), AGENTS.md (entry points, stack, discovery order, baseline
checks), RUNBOOK.md (dev boot, prod deploy with overlay, ingestion,
failures, rollback, scaling notes), .env.prod.example with rotated
credential placeholders, and dev-only warnings on .env.example.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-13 16:41:50 +03:00

13 lines
387 B
Python

"""Block-level normalization wrappers around utils.text_cleaning."""
from __future__ import annotations
from app.utils.text_cleaning import clean_ocr_text, normalize_for_search
def normalize_block(text: str) -> tuple[str, str]:
"""Return ``(display_text, normalized_text)``."""
display = clean_ocr_text(text)
norm = normalize_for_search(display)
return display, norm