How can we improve Power BI?

Tables in PDF files

I have come across so many public data repositories that hold data in PDF format. Other websites have tables within documents such as annual reports etc., also in PDF format. A data source for PDFs or tables from PDFs would be awesome!

3,000 votes
Sign in
Check!
(thinking…)
Reset
or sign in with
  • facebook
  • google
    Password icon
    Signed in as (Sign out)

    We’ll send you updates on this idea

    Gogula Aryalingam shared this idea  ·   ·  Flag idea as inappropriate…  ·  Admin →

    268 comments

    Sign in
    Check!
    (thinking…)
    Reset
    or sign in with
    • facebook
    • google
      Password icon
      Signed in as (Sign out)
      Submitting...
      • KrisW commented  ·   ·  Flag as inappropriate

        SEPT 2018 UPDATE: I am testing the PDF Import/Connector & have already found minor issues. Who/How/Where do I report?


        IN BRIEF - I have "sample data" G/L Ledger 51 pages. PBI is not bringing in column headers which is not a big deal, but in skipping the headers it is merging any data where there is only 1 space between columns. EXAMPLE: PERIOD & SOURCE of 1 PJ became 1PJ & ACCOUNT_NUM & ACCOUNT_DESC of 21200 TRADE COLLECTORS became 21200TRADE COLLECTORS - these 2 are easy enough to "split columns" to fix.

        HOWEVER, AMOUNT & DESCRIPTION were also merged so instead of -409.09 Pre-conversion purchase, I have -409.09Pre-conversion purchase. There is not a decimal in every amount & the amount total digits can vary. While it is highly unlikely that our company will connect to PDFs on a regular basis, we feel that this is an important feature for PBI.

        Our own software has no problems with this sample file. Tableau merges fields same as PBI, but at least it leaves the space so that the fields can be "split"

      • Sam commented  ·   ·  Flag as inappropriate

        @Miguel
        The summit is over - and It is Mid 2018 - when in the PDF connector scheduled

      • Randy commented  ·   ·  Flag as inappropriate

        This is obviously a much needed data source, any update on its release will be appreciated

      • Ryan commented  ·   ·  Flag as inappropriate

        I'm also looking forward to this feature. We currently do bluebeam take-off work for construction, after which I export to CSV and run through Power BI. I would love to get more granular access to the data contained within.

      • Dave commented  ·   ·  Flag as inappropriate

        Hey Miquel, can you provide an update? Surely your comment in August for one of the top ranked ideas warrants an update. What does Planned actually mean?
        We are now in April, can you comment that it will or won't be released in the May release?
        Come on give us a bone here...

      • AJ commented  ·   ·  Flag as inappropriate

        Any update on the ETA for this? Really looking forward to it!

      • Nate commented  ·   ·  Flag as inappropriate

        Any update on the ETA for this? Really looking forward to it!

      • Mike Honey commented  ·   ·  Flag as inappropriate

        While we wait for a Connector solution (which would be great), I'm having some success at this task using the open-source Tabula project.

        http://tabula.technology/

        It has supporting automation/scripting projects (e.g. tabula-py for python), but in almost all cases manual extract design and post-extract data transformation is required. I guess this is due to the imprecise nature of PDF tables.

      • Anonymous commented  ·   ·  Flag as inappropriate

        This is a very useful connector. The Internet has a lot of information in PDF, which can be used in complex analysis.

      • Andrew commented  ·   ·  Flag as inappropriate

        Hi - is there an expected release for this? Hopefully early 2018 = March update!

      ← Previous 1 3 4 5 13 14

      Feedback and Knowledge Base

      Ready to get started?

      Try new features of Power BI today by signing up and learn more about our powerful suite of apps.