Parallel processing issue for same entity. #69
Closed
HannoLohuis
started this conversation in
General
Replies: 1 comment
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
When loading bronze, the files from the landing are processed in parallel by the main notebook. However, when you have to restart the load landing before loading the files in bronze, it could be that for the same entity you end-up with multiple files (let assume file t1 and t2). This is not a problem, but when the entity is loaded incrementally, it could be that file t1 is just processed, before file t2 (because in parallel). As a result you could potentially end-up with the wrong data (missing the final updates in t2).
In case multiple files needs to be processed for the same entity, the ordering of processing must be guaranteed.
Beta Was this translation helpful? Give feedback.
All reactions