When a PDF file contains table, you may need to convert it to Excel for further processing. In this article, you will learn how to convert each PDF page to a single Excel worksheet as well as how to convert multiple PDF pages to one Excel worksheet by using Free Spire.PDF for Java.
Two Methods to Import the JAR Dependency
Method 1: You can download the free library and unzip it. Then add the Spire.Pdf.jar file to your project as dependency.
Method 2: Or you can directly add the jar dependency to maven project by adding the following configurations to the pom.xml.
Convert a PDF File Containing Three Pages to Three Excel Worksheets
Step 1: Create a PdfDocument object.
Step 2: Load a sample PDF file using PdfDocument.loadFromFile() method.
Step 3: Save the PDF file to Excel using PdfDocument.saveToFile() method.
import com.spire.pdf.FileFormat;
import com.spire.pdf.PdfDocument;
public class ToXLS {
public static void main(String[] args) {
//Create a PdfDocument object
PdfDocument pdf = new PdfDocument();
//Load a sample PDF file
//Save to Excel
pdf.saveToFile("output/ToExcel.xlsx", FileFormat.XLSX);
Convert a PDF File Containing Three Pages to One Excel Worksheets
Free Spire.XLS for Java offers the PdfDocument.getConvertOptions().setConvertToOneSheet(true) method to set the PDF to XLSX conversion options to render multiple PDF pages on one Excel worksheet.
import com.spire.pdf.*;
public class ManyPagesToOneSheet {
public static void main(String[] args) {
//Create a PdfDocument object
PdfDocument pdf = new PdfDocument();
//Load a sample PDF file
//Convert multiple PDF pages to one Excel worksheet
//Save to Excel
pdf.saveToFile("output/ToOneSheet.xlsx", FileFormat.XLSX);