PressMint PT Data#46
Conversation
|
Changes were accepted. |
|
Trying once again. |
|
All checks have passed now. |
|
@cluljoseaires, I thought you had provided only the source sample, but you provided the PressMint-PT sample without providing the source. |
|
I believe I have put the content in such a way because validation was
complaining and it stopped once I organized it in this way.
However, after taking a look at PressMint-CZ to try to better understand
what you mean, if by "sources" you mean the "images from which the content
has been extracted", I actually do not have those because I am told these
files are from a validated corpus and the images have not been kept.
Apparently, these files are the result of OCR done on several documents,
which were then validated by several people, including some metadata like
location and date, carried out a few years ago, having only kept the
resulting files.
Nevertheless, I am still in the process of including some other files,
which will include the images, as well as the corresponding URL links, but
I am still having issues.
Tell me if this is acceptable to you or if there is something else I could
or should do.
Matyáš Kopp ***@***.***> escreveu (sexta, 5/06/2026 à(s)
12:28):
… *matyaskopp* left a comment (clarin-eric/PressMint#46)
<#46 (comment)>
@cluljoseaires <https://github.com/cluljoseaires>, I thought you had
provided only the source sample, but you provided the PressMint-PT sample
without providing the source.
Can you move the content with the corpus sample to the PressMint-PT
directory? The validation is passing because no validation run is performed
on the Source folder.
—
Reply to this email directly, view it on GitHub
<#46?email_source=notifications&email_token=A24T6GLOJFWFR7E24C2IKHD46KVFFA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINRTGEYTINZQG422M4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJLDGN5XXIZLSL5RWY2LDNM#issuecomment-4631147075>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/A24T6GOHQ6QDRWTK2AXGW7346KVFFAVCNFSM6AAAAACX6PSIOWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHM2DMMZRGE2DOMBXGU>
.
Triage notifications, keep track of coding agent tasks and review pull
requests on the go with GitHub Mobile for iOS
<https://github.com/notifications/mobile/ios/A24T6GIPNMNOMR6DL25G32346KVFFA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINRTGEYTINZQG422M4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJKTGN5XXIZLSL5UW64Y>
and Android
<https://github.com/notifications/mobile/android/A24T6GNXG5EJGLTM3FJLD4D46KVFFA5CNFSNUABFM5UWIORPF5TWS5BNNB2WEL2JONZXKZKDN5WW2ZLOOQXTINRTGEYTINZQG422M4TFMFZW63VHNVSW45DJN5XKKZLWMVXHJLTGN5XXIZLSL5QW4ZDSN5UWI>.
Download it today!
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
by source, I mean source that is used for conversion to TEI, so everybody can see which kind of data and metadata are available The result sample should be placed directly under your corpus root folder |
Checking if possible.