How to PNG, JPEG or JPG image document to MSWORD ?
Im using pytessaract and docx but it doesnt work?
and i need keep document shape like ABBYY FineReader.
Please help me
image = "image.png"
text1 = pytesseract.image_to_alto_xml(Image.open(image), lang='mon+eng', config='-c preserve_interword_spaces=1')
mydoc.add_paragraph(text1)
Error message is:
File
"/home/codelex/anaconda3/envs/ganbaagpu/lib/python3.7/site-packages/docx/oxml/text/run.py",
line 156, in add_char
elif char in '
': TypeError: 'in ' requires string as left operand, not int
Any other suggessions about image to msword?
i need to keep document formatting.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…