Process PowerPoint Slides as a whole image for VLM #2839
Steve-Allison
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I feel that we've missed the ability to capture what a PowerPoint really is - it's not a bunch of text and graphics on a slide; a slide is a gestalt whole... I would love to be able to run VLM against the whole slide and use that as an image description within Docling, and have the Docling understand that this is what the slide-level image description is.
Currently, I do this as a post-conversion activity and then inject the VLM image description back into the Docling file - but would love to see this as a native capability.
Beta Was this translation helpful? Give feedback.
All reactions