fix(ocr): improve text layer alignment with width-based font sizing

- Create new hocr-transform.ts utility for parsing hOCR output
- Add line-aware text processing with baseline and rotation support
- Implement width-based font size calculation to match word bounding boxes
- Fix text selection not covering full characters issue
- Add proper type definitions for OcrLine, OcrPage, WordTransform
- Support RTL languages and CJK word break handling
This commit is contained in:
abdullahalam123
2026-01-10 13:09:52 +05:30
parent 1f7238d0b5
commit c5799954dc
3 changed files with 887 additions and 461 deletions

File diff suppressed because it is too large Load Diff