Commit 17c2534
authored
Added the
* Add subcommand reconstruct
* Add a default reconstruct home folder to store compiled graph
* Copy and load enriched files to Blazegraph
* Implement code to query ASCT+B graph
* Implement code to load the graph
* Implement transformRecords to convert query result into ASCT+B table format
* Fix carriage return split
* Make sure the column order is correct
* Shorten concept IRI
* Fix the order between LABEL and ID
* Add header to the constructed table
* Fix references to remove duplicates and add PMID, if present
* Fix getting the reference ordering
* Normalized PubMed identifier
* Make ORCID field as string type
* Append only clonality and clone_id if they are present
* Implement reconstruct OMAP data
* Uncomment loadGraph
* Don't create any columns if the BM entry is empty
* Add at least a set of REF columns (e.g., REF/1, REF/1/ID) if no reference is present
* Implement reconstruct CTAnn data
* Implement reconstruct Ref-Organ data
* Get label from UBERON for accuracy
* Fix the data type for part_of_illustration property
* Implement reconstruct 2D FTU data
* Implement reconstruct Collection data
* Use Papaparse library to create the CSV content
* Remove quoteIfNeeded since we use Papaparse to handle quotes
* Move reconstruction artifacts to individual DO directories
* Cleanup reconstruction artifacts after generating the final output
Use the CLEANUP_ARTIFACTS flag to enable or disable this feature
* Convert query output from JSON to CSV using json2csv
* Change the output records to CSV format
* Fix missing general publications
* Remove unnecessary BIND(STR())
* Implement CSV parsing for ASCT+B DO outputs with Papaparse
* Remove the unnecessary BIND(STR()) for the rest of query files
* Implement CSV parsing for the rest DO outputs with Papaparse
* Add reference notes into the graph
* Add FTU columns into the graph
* Add FTU and reference notes to reconstruction process
* Fix sorting logic to handle numbers properly
* Fix malformed headers when reconstructing ASCT+B tables
* Add CT notes to the graph for use in reconstruction
* Remove unused folder to store the reconstruction artifacts
* Replace constant flag with --keep-artifacts CLI argument
* Extract queryGraph() into a reusable function
* Minor fix removing unnecessary quotes when reformatting date
* Add validation for reconstructed collection digital objects
* Fix array comparison to ignore item order
* Extract validation logging into its own module
* Implement validation technique through comparing CSV tables
* Reformat the warning log message
* Fix bug on locating the row number
* Reformat error log message
* Rename file
* Implement CTAnn validation that includes checks for table metadata section
* Add helper function to retrieve raw data name
* Undo comment out
* Implement OMAP validation
* Fix missing rationale for 'binds to antibody' relationships
* Add the Notes columns for AS and BM
* Make ASCT+B general publication query optional
* Modify ASCT+B record construction to include biomarker type for disambiguating duplicate entries
* Implement soft validation for the table header
* Enhance value comparison logic to handle array-like strings
* Cleanup the directory after validation
* Fix reference name and notes weren't captured properly
* Fix issue where protein presence is not retained in normalized form
* Add support for wildcard '*' to select columns for soft validation
* Ensure all columns from reconstructed CSV are present in the original CSV
* Skip row comparison if row sizes differ
* Rename variables
* Replace soft validation with per-cell comparison for unmatched rows
* Rename variablesreconstruct sub-command (#51)1 parent 9e4b6ae commit 17c2534
File tree
32 files changed
+2727
-48
lines changed- schemas/src/digital-objects
- src
- normalization
- asct-b-utils
- reconstruction
- queries
32 files changed
+2727
-48
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
299 | 299 | | |
300 | 300 | | |
301 | 301 | | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| 36 | + | |
36 | 37 | | |
37 | 38 | | |
38 | 39 | | |
| |||
41 | 42 | | |
42 | 43 | | |
43 | 44 | | |
| 45 | + | |
44 | 46 | | |
45 | | - | |
46 | | - | |
| 47 | + | |
47 | 48 | | |
48 | 49 | | |
49 | 50 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
175 | 175 | | |
176 | 176 | | |
177 | 177 | | |
| 178 | + | |
178 | 179 | | |
179 | 180 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
137 | 137 | | |
138 | 138 | | |
139 | 139 | | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
| 140 | + | |
| 141 | + | |
146 | 142 | | |
147 | 143 | | |
148 | 144 | | |
| |||
156 | 152 | | |
157 | 153 | | |
158 | 154 | | |
| 155 | + | |
159 | 156 | | |
160 | 157 | | |
161 | 158 | | |
| |||
171 | 168 | | |
172 | 169 | | |
173 | 170 | | |
| 171 | + | |
174 | 172 | | |
175 | 173 | | |
176 | 174 | | |
| |||
187 | 185 | | |
188 | 186 | | |
189 | 187 | | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
190 | 221 | | |
191 | 222 | | |
192 | 223 | | |
| |||
473 | 504 | | |
474 | 505 | | |
475 | 506 | | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
| 541 | + | |
| 542 | + | |
476 | 543 | | |
477 | 544 | | |
478 | 545 | | |
| |||
503 | 570 | | |
504 | 571 | | |
505 | 572 | | |
| 573 | + | |
| 574 | + | |
| 575 | + | |
| 576 | + | |
| 577 | + | |
| 578 | + | |
| 579 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
364 | 364 | | |
365 | 365 | | |
366 | 366 | | |
367 | | - | |
368 | 367 | | |
369 | 368 | | |
370 | 369 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
| 12 | + | |
12 | 13 | | |
13 | 14 | | |
14 | 15 | | |
| |||
118 | 119 | | |
119 | 120 | | |
120 | 121 | | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
121 | 131 | | |
122 | 132 | | |
123 | 133 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
135 | 135 | | |
136 | 136 | | |
137 | 137 | | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
138 | 148 | | |
139 | 149 | | |
140 | 150 | | |
| |||
146 | 156 | | |
147 | 157 | | |
148 | 158 | | |
149 | | - | |
150 | | - | |
| 159 | + | |
| 160 | + | |
151 | 161 | | |
152 | 162 | | |
153 | | - | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
154 | 168 | | |
155 | 169 | | |
156 | 170 | | |
| |||
219 | 233 | | |
220 | 234 | | |
221 | 235 | | |
| 236 | + | |
| 237 | + | |
| 238 | + | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
222 | 245 | | |
223 | 246 | | |
224 | 247 | | |
| |||
0 commit comments