How data.world handles different file types

Comments

2 comments

  • Avatar
    Mike Honey

    Can you explain what happens when we upload a PDF file? I can see a Schema by File and First 100 Data Points, but none of it makes any sense...  I was hoping to be able to query the table data from the PDF.

  • Avatar
    Rebecca Clay

    Hi Mike, we don't currently convert PDFs to queryable formats, so you'll only be able to store and view them.

    We hope to support querying any tables within PDFs in the future, but until then, you'll need to pull out the tables with another tool and upload as a separate file.I haven't used it, but Tabula appears to be an open source tool that you might try out: http://tabula.technology/.

Please sign in to leave a comment.