XpdImageExtract is a very affordable developer's library/SDK that makes it easy for a programmer to gain access to the images in PDF files. It can extract the images to memory or disk, as well as simply generating a list of the images along with their size and color-space information.
To clarify: XpdfImageExtract works with the raw bitmaps of images placed in a PDF file. It does not convert vector graphics or text to bitmaps.
XpdfImageExtract is easy to use.
#include "XpdfImageExtract.h"
PDFHandle pdf;
int i, mask, colorSpace, w, h;
err = pdfLoadFile(&pdf, "c:/test/file.pdf");
if (err != pdfOk) {
/* handle the error */
}
/* extract images on page 1 */
pdfGetImages(pdf, 1);
for (i = 0; i < pdfGetNumImages(pdf); ++i) {
pdfGetImageInfo(pdf, i, &mask, &colorSpace, &w, &h);
printf(" %d: %s %d x %d\n", i, mask ? "mask" : "image", w, h);
}
THIS PARTICULAR PRODUCT RUNS ON WINDOWS COMPUTERS. HOWEVER, MAC AND LINUX SHARED LIBRARIES ARE ALSO AVAILABLE; PORTABLE C++ SOURCE CODE IS AVAILABLE TOO. 32-BIT AND 64-BIT VERSIONS ARE AVAILABLE FOR ALL PLATFORMS. CONTACT US FOR DETAILS.