<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic ingest and parse PDF files in Administrative Discussions</title>
    <link>https://community.incorta.com/t5/administrative-discussions/ingest-and-parse-pdf-files/m-p/6115#M301</link>
    <description>&lt;P&gt;I know ( er, I'm pretty sure I know ) that Incorta has the ability to receive a PDF file and work with it - I expect that means to summarize info as in the CFO demo&amp;nbsp; - which is awesome.&amp;nbsp; &amp;nbsp;I'd also like to use it to pull data from a table w/in the PDF.&lt;/P&gt;&lt;P&gt;Can I do that w/ Incorta, and if so would it be using data studio, notebooks, something else?&lt;/P&gt;&lt;P&gt;Looking for a pointer to documentation, white papers, videos, whatnot.&lt;/P&gt;</description>
    <pubDate>Wed, 13 Nov 2024 19:23:34 GMT</pubDate>
    <dc:creator>RADSr</dc:creator>
    <dc:date>2024-11-13T19:23:34Z</dc:date>
    <item>
      <title>ingest and parse PDF files</title>
      <link>https://community.incorta.com/t5/administrative-discussions/ingest-and-parse-pdf-files/m-p/6115#M301</link>
      <description>&lt;P&gt;I know ( er, I'm pretty sure I know ) that Incorta has the ability to receive a PDF file and work with it - I expect that means to summarize info as in the CFO demo&amp;nbsp; - which is awesome.&amp;nbsp; &amp;nbsp;I'd also like to use it to pull data from a table w/in the PDF.&lt;/P&gt;&lt;P&gt;Can I do that w/ Incorta, and if so would it be using data studio, notebooks, something else?&lt;/P&gt;&lt;P&gt;Looking for a pointer to documentation, white papers, videos, whatnot.&lt;/P&gt;</description>
      <pubDate>Wed, 13 Nov 2024 19:23:34 GMT</pubDate>
      <guid>https://community.incorta.com/t5/administrative-discussions/ingest-and-parse-pdf-files/m-p/6115#M301</guid>
      <dc:creator>RADSr</dc:creator>
      <dc:date>2024-11-13T19:23:34Z</dc:date>
    </item>
    <item>
      <title>Re: ingest and parse PDF files</title>
      <link>https://community.incorta.com/t5/administrative-discussions/ingest-and-parse-pdf-files/m-p/6377#M312</link>
      <description>&lt;P&gt;Yes we have customers who have the same requirement and we use LLM capability to parse the PDF.&lt;/P&gt;
&lt;P&gt;We created a custom recipe in the DataStudio for loading the pdf files from cloud storage like S3 or Azure.&amp;nbsp; LLMs, like Gemini, have a good capability of recognizing text in images.&lt;/P&gt;
&lt;P&gt;We can provide this as part of a POC if you are interested.&lt;/P&gt;</description>
      <pubDate>Thu, 27 Mar 2025 13:55:46 GMT</pubDate>
      <guid>https://community.incorta.com/t5/administrative-discussions/ingest-and-parse-pdf-files/m-p/6377#M312</guid>
      <dc:creator>dylanwan</dc:creator>
      <dc:date>2025-03-27T13:55:46Z</dc:date>
    </item>
  </channel>
</rss>

