It is. NET platform files on extraction framework, mainly to solve the problem of extracting the contents in various formats, such as PDF, doc, docx, xls, xlsx, and so on.
wsmwlh
2016-08-23
00
1
Extract text from Word, and PowerPoint
no vote
A simple tool for reading Office files, without using Microsoft Office software and Interop extraction. contents of the doc,.docx,.ppt,.pptx file, and summary information.
wsmwlh
2016-08-23
00
1
PDF text extraction c # source
4.0
1. Use the function of PDFtoTEXT in xpdf3.04 to extract the text in PDF file; 2. PDFtoTEXT was originally a console application, changed to C + + / CLR interface, which is convenient for. Net call; 3. Add the interface of PDFtoTEXT extraction parameter setting, which can be set when. Net call; 4. Extract the Chinese text in PDF, see the example for setting; 5. All the source codes are tested under the Chinese professional version of VS2008.