PDF Reader

i am work on a solution that require me to read govt issue pdfs. the pdfs have the standard govt issue quirks, un structured, items mix up, layered, un standard bullet poiints. etc. some data has to be extract from them. standard py script reader does agood job but fails in unstructured lines. deepseek ocr needs py3.12 it seems, im py3.10. venv could be option, yes. moondreams or something else, glm ocr, what i do?

2 Likes