Skip to main content
Microsoft Idea

Power BI

Completed

Tables in PDF files

Vote (3044) Share
Gogula Aryalingam's profile image

Gogula Aryalingam on 17 Jan 2015 03:15:10

I have come across so many public data repositories that hold data in PDF format. Other websites have tables within documents such as annual reports etc., also in PDF format. A data source for PDFs or tables from PDFs would be awesome!

Administrator on 13 Apr 2019 00:56:11

The PDF connector is now generally available in the April release of Power BI Desktop. Learn more here: https://powerbi.microsoft.com/en-us/blog/power-bi-desktop-april-2019-feature-summary/#pdf

Comments (284)
Gogula Aryalingam's profile image Profile Picture

Mattia Russo on 05 Jul 2020 23:37:22

RE: Tables in PDF files

Hello Everybody,

the pdf connector works only for PBI Desktop. When i try to use a Gateway on a dataset that use a pdf Connector the Gateway doesn't work!!!! Are you working on it? When will fix this bug?
thanks in advanced!
Mattia

Gogula Aryalingam's profile image Profile Picture

Tiffany on 05 Jul 2020 23:36:00

RE: Tables in PDF files

This is great but I've come across where the data becomes corrupted and produces errors in the editor.

My source is a folder and I have 2 PDFs sent to me daily that I drop in that folder. The PDFs are identical except for the dollar amounts. Inconsistently; BI will corrupt one or a few of the documents when I refresh the dashboard.


Gogula Aryalingam's profile image Profile Picture

DirkV on 05 Jul 2020 23:35:24

RE: Tables in PDF files

Feature is working fine in my applications.
When will this be shipped with Excel Get & Transform since I prepare my data in Excel and I have to log the data imported?

Gogula Aryalingam's profile image Profile Picture

Terry on 05 Jul 2020 23:35:24

RE: Tables in PDF files

It would be good if this could also read the data from formatted fields within the PDF. I believe they maybe in a fdf format. But as they are named fields it should be relatively easy to show a list of fields by column. And let you import like other files. Currently the data is not imported at all

Gogula Aryalingam's profile image Profile Picture

Sharon Maxon on 05 Jul 2020 23:34:00

RE: Tables in PDF files

Beyond just importing a chart from a PDF, we need to be able to import a chart in a collection of PDFS with a consistent format in a folder. For example, a report in a standardized format is received on a weekly basis. We need to be able save the PDFs for a SharePoint Online folder and then let Power BI find each chart to append them together. This is a powerful feature that work for multiple Excel files in a folder, so replicate the same with PDFs.

Gogula Aryalingam's profile image Profile Picture

Power BI Ideas Admin on 05 Jul 2020 23:30:43

RE: Tables in PDF files

Can this read hand-written tables, if those are scanned and then saved as pdf-file?

Gogula Aryalingam's profile image Profile Picture

KRIS WILLISON on 05 Jul 2020 23:29:36

RE: Tables in PDF files

SEPT 2018 UPDATE: I am testing the PDF Import/Connector & have already found minor issues. Who/How/Where do I report?


IN BRIEF - I have "sample data" G/L Ledger 51 pages. PBI is not bringing in column headers which is not a big deal, but in skipping the headers it is merging any data where there is only 1 space between columns. EXAMPLE: PERIOD & SOURCE of 1 PJ became 1PJ & ACCOUNT_NUM & ACCOUNT_DESC of 21200 TRADE COLLECTORS became 21200TRADE COLLECTORS - these 2 are easy enough to "split columns" to fix.

HOWEVER, AMOUNT & DESCRIPTION were also merged so instead of -409.09 Pre-conversion purchase, I have -409.09Pre-conversion purchase. There is not a decimal in every amount & the amount total digits can vary. While it is highly unlikely that our company will connect to PDFs on a regular basis, we feel that this is an important feature for PBI.

Our own software has no problems with this sample file. Tableau merges fields same as PBI, but at least it leaves the space so that the fields can be "split"

Gogula Aryalingam's profile image Profile Picture

Kawabata Yoshihiro on 05 Jul 2020 23:26:32

RE: Tables in PDF files

Nice, 'STARTED' status 😁

Gogula Aryalingam's profile image Profile Picture

Sam on 05 Jul 2020 23:25:10

RE: Tables in PDF files

'@Miguel
The summit is over - and It is Mid 2018 - when in the PDF connector scheduled

Gogula Aryalingam's profile image Profile Picture

Randy on 05 Jul 2020 23:24:15

RE: Tables in PDF files

This is obviously a much needed data source, any update on its release will be appreciated