Linux systems have pdftotext
which I had reasonable success with. By default, it creates foo.txt
from a give foo.pdf
.
That said, the text mining packages may have converters. A quick rseek.org search seems to concur with your crantastic search.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…