Train organoid label model by MikeLippincott · Pull Request #129 · WayScience/NF1_3D_organoid_profiling_pipeline

MikeLippincott · 2026-02-17T19:47:53Z

This pull request introduces significant improvements to the image segmentation and annotation workflows, including the addition of a new lightweight image annotation tool, updates to environment and dependency management, and enhancements to segmentation scripts. The changes streamline the featurization process, improve testing coverage, and provide detailed documentation for new and existing features.

This PR runs segmentation and all things segmentation related including a segmentation tool for annotating "morphology" of organoids.

This is non-complete work and I need to merge this into the repo and then refactor this repo to account for software gardening changes.

* ready for hPC * processed all segs * rerun organoid segs on HPC * fixed HPC script * fixed HPC script * update run list * update run list * update run list * update run list * update run list * segmentations re-completed * Update 2.segment_images/scripts/0.nuclei_segmentation.py Co-authored-by: Dave Bunten <ekgto445@gmail.com> * addressing comments --------- Co-authored-by: Dave Bunten <ekgto445@gmail.com>

review-notebook-app · 2026-02-17T19:47:58Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Copilot

Pull request overview

This PR implements organoid image segmentation and classification infrastructure, including:

A morphology-dependent segmentation system with updated label handling
Machine learning models (logistic regression and random forest) to classify organoid morphology
A web-based annotation tool for labeling organoid images
Feature extraction pipelines using SAM-Med3D and MorphEm models
Scripts for preprocessing, training, prediction, and visualization

Changes:

Added morphology-aware segmentation logic supporting labels: globular, cluster, small, dissociated, elongated, blank, and failed
Created annotation tool with Flask-based UI for manual image labeling
Implemented model training and prediction pipelines for organoid classification
Added numerous utility scripts for preprocessing, feature extraction, and quality checking

Reviewed changes

Copilot reviewed 30 out of 46 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`src/segmentation_utils/src/cell_segmentation.py`	Updated segmentation logic to handle new organoid labels (contains critical bug)
`src/file_utils/src/arg_parsing_utils.py`	Added output_features_subparent_name argument
`src/featurization_utils/src/sammed3d_featurizer.py`	Added logging suppression (stdout not restored)
`2.segment_images/scripts/*.py`	Added 8 new scripts for training, prediction, preprocessing, and checking
`2.segment_images/annotation_tool/*`	New web-based annotation tool with Flask
`3.cellprofiling/*`	Deleted README and shell script (moved/refactored)
`.gitignore`, `.pre-commit-config.yaml`	Updated configuration files

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/segmentation_utils/src/cell_segmentation.py

Copilot · 2026-02-17T19:54:03Z

2.segment_images/scripts/view_plate_level_segmentations.py

    import tqdm.notebook as tqdm
 else:
    import tqdm
+image_base_dir


The variable image_base_dir is used as a statement on line 30, which has no effect. This appears to be leftover debugging code that should be removed.

Copilot · 2026-02-17T19:54:03Z

2.segment_images/scripts/3.predict_whole_image_label.py

+        output_dict["label"].append(row["predicted_label"])
+        output_dict["predicted_or_gt"].append("predicted")
+    else:
+        output_dict["label"].append(row["label_name"])
+        output_dict["predicted_or_gt"].append("gt")


The logic for determining predicted vs ground truth labels is inverted. When predicted_label is NaN (missing), the code marks it as "predicted", but it should append the predicted label value. When predicted_label is not NaN, it appends label_name and marks it as "gt" (ground truth). This appears backwards - rows with actual predictions should be marked as "predicted", and rows with ground truth labels should be marked as "gt".

Suggested change

output_dict["label"].append(row["predicted_label"])

output_dict["predicted_or_gt"].append("predicted")

else:

output_dict["label"].append(row["label_name"])

output_dict["predicted_or_gt"].append("gt")

# No prediction available: use ground-truth label

output_dict["label"].append(row["label_name"])

output_dict["predicted_or_gt"].append("gt")

else:

# Prediction available: use predicted label

output_dict["label"].append(row["predicted_label"])

output_dict["predicted_or_gt"].append("predicted")

Copilot · 2026-02-17T19:54:05Z

src/featurization_utils/src/sammed3d_featurizer.py

+        # Suppress logging and stdout
+        import sys
+
+        logging.getLogger("transformers").setLevel(logging.ERROR)
+        logging.getLogger("torch").setLevel(logging.ERROR)
+        old_stdout = sys.stdout
+        sys.stdout = StringIO()


The stdout redirection to StringIO is never restored. After line 81, sys.stdout is redirected but there's no corresponding code to restore it to old_stdout. This will suppress all print statements for the rest of the program execution, which could hide important error messages or logs. Add sys.stdout = old_stdout after the model loading is complete.

2.segment_images/scripts/view_plate_level_segmentations.py

wli51

Nice PR! Love the utility UI you've made. The only thing that stood out to me was the potential bug with blank morphology src/segmentation_utils/src/cell_segmentation.py. Otherwise LGTM.

2.segment_images/annotation_tool/annotation_tool.py

2.segment_images/annotation_tool/README.md

2.segment_images/scripts/0.extract_whole_FOV_features.py

2.segment_images/scripts/2.train_models.py

2.segment_images/scripts/view_plate_level_segmentations.py

src/segmentation_utils/src/cell_segmentation.py

MikeLippincott and others added 30 commits January 13, 2026 14:07

running on HPC

9ad7a2a

push to HPC

2d1014a

push to HPC

1ccce77

push to HPC

813cf1f

run on HPC

e6303aa

rerun on HPC

cfa8599

push to HPC

a5810f9

push to HPC

24abc41

running ibp

3d84d63

ibp

1373b40

complete ibp

7818391

addressing comments

8e2ed48

running on HPC

4c48edd

push to HPC

d6101b9

push to HPC

a9f2638

push to HPC

bd54e63

run on HPC

aa5eda3

rerun on HPC

3a12e75

push to HPC

191976b

push to HPC

1405a7a

running ibp

f25de78

ibp

e77645f

complete ibp

6768cc2

addressing comments

5355647

Merge branch 'MikeLippincott-ibp_all_patients'

f7f09c9

Merge remote-tracking branch 'upstream/main'

c82595b

precommit

73c29cf

Merge remote-tracking branch 'upstream/main'

5c5cae8

MikeLippincott added 5 commits February 5, 2026 13:15

Merge remote-tracking branch 'upstream/main'

e145d51

Merge remote-tracking branch 'upstream/main'

84d996c

progress capture

f67bc65

progress capture

be001ef

updadting segmentations

d7a8f35

MikeLippincott requested a review from Copilot February 17, 2026 19:48

Copilot started reviewing on behalf of MikeLippincott February 17, 2026 19:48 View session

Copilot AI reviewed Feb 17, 2026

View reviewed changes

MikeLippincott requested a review from wli51 February 19, 2026 21:21

wli51 approved these changes Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Train organoid label model#129

Train organoid label model#129
MikeLippincott wants to merge 35 commits intoWayScience:mainfrom
MikeLippincott:train_organoid_label_model

MikeLippincott commented Feb 17, 2026 •

edited

Loading

Uh oh!

review-notebook-app bot commented Feb 17, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI Feb 17, 2026

Uh oh!

Copilot AI Feb 17, 2026

Uh oh!

Copilot AI Feb 17, 2026

Uh oh!

Uh oh!

wli51 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

MikeLippincott commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Feb 17, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wli51 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MikeLippincott commented Feb 17, 2026 •

edited

Loading