Tables in PDF files
I have come across so many public data repositories that hold data in PDF format. Other websites have tables within documents such as annual reports etc., also in PDF format. A data source for PDFs or tables from PDFs would be awesome!
Thanks everyone for your feedback – We plan to release a PDF connector to import data from tables by the end of 2017 or early 2018. Stay tuned for more updates as we get closer.
Ken Puls commented
Please bring this to Excel as well. I get this question EVERY time I teach a course on using Power Query. It's a very big need!
Engaged User commented
Agree, sometimes you just dont have access to the nice to have CSV file. If the PDF was generated from an Excel file to begin with reverting it back would be awesome.
It would be great if PBI Desktop could load PDF files - both physical and scanned.
Kamel S. Abou Saleh commented
I also vote for PDF
This would be super for government data sources. Example: http://www.dfw.state.or.us/MRP/salmon/Historical_Data/docs/TrollEffTable.pdf
Gerry Baerman commented
I'll add a third vote for this. As Gogula indicates, PDFs are the rule for a lot of public domain data on the Web, especially from the US Gov. Personally, I hate PDFs and my choice would be to simply make them illegal :) , but if we have to live with them, we're going to need a way to mine the data from that hideous file format.
Chris Han commented
This is huge to CFO and CMO teams. Parsing financial reports is essential task toward any competition analysis and strategic planning.