Skip to content
This repository was archived by the owner on Apr 10, 2025. It is now read-only.

edge cases for cleaning .mbox #70

@isabelizimm

Description

@isabelizimm

Describe the bug
Some .mbox files are a list of strings and nested list, rather than just list of strings. This causes issues when using cleaning functions.

To Reproduce
Steps to reproduce the behavior:

  1. Go to ./notebooks/01_collect_data/raw_to_text.ipynb
  2. Run notebook end-to-end, using the entire variable mboxes rather than subset_mbox in IN[5]
  3. See issue below

Expected behavior
Expected to run through all .mbox files.

Screenshots
Screen Shot 2021-04-20 at 1 20 35 PM

Screen Shot 2021-04-20 at 1 20 20 PM

Additional context
Here is screenshot looking inside a problematic .mbox

unnamed

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions