date | export_on_save | html | print_background | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
2022-04-21 |
|
|
false |
This example demonstrates two methods for extracting tables from a PDF document using Microsoft Excel Power Query or Microsoft Power BI Desktop.
- The
PDF.Tables
method is demonstrated in the QueryIRSW2AppendixF and QueryIRSW2AppendixG queries. - The
XML.Document
method is demonstrated in the QueryIRSW2RecordTypes query.
- IRS EFW2 Publication 42-007, tax year 2021 version 2
- Microsoft VS Code, version 1.66.2
- Markdown Preview Enhanced extension, version 0.6.2
- Adobe Acrobat DC
- Microsoft Excel, version 2203 build 16.0.15028.20218 64-bit
- Microsoft Power BI Desktop, version 2.103.661.0 64-bit (March 2022)
- Microsoft Windows 10 Pro, version 10.0.19044 build 19044
The location of the IRS W-2 specification PDF document.
The location of the IRS W-2 specification PDF-XML document.
A query that imports a PDF document and creates a table from Appendix F found in the IRS W-2 specification.
A query that imports a PDF document and creates a table from Appendix G found in the IRS W-2 specification.
A query that imports a PDF-XML document and creates a table from the W-2 record types and their field specifications that are found in sections 4.5 through 4.13 of the IRS W-2 specification.