You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have a group of 50 CIK's that I pulled historical 10-K/10-Q filings for and roughly 600/2700 filings support XBRL. Could the library support scraping the documents without relying on XBRL?
I could just create pyrantic objects and preprocess the html to markdown and use a reasoning model to extract to structured outputs but that creates the risk of hallucination/etc. hoping to ask around for a better way first
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
I have a group of 50 CIK's that I pulled historical 10-K/10-Q filings for and roughly 600/2700 filings support XBRL. Could the library support scraping the documents without relying on XBRL?
I could just create pyrantic objects and preprocess the html to markdown and use a reasoning model to extract to structured outputs but that creates the risk of hallucination/etc. hoping to ask around for a better way first
Beta Was this translation helpful? Give feedback.
All reactions