TheNude: convert to Python scraper, add scene/gallery search#2718
TheNude: convert to Python scraper, add scene/gallery search#2718FlashSpazzbo wants to merge 1 commit intostashapp:masterfrom
Conversation
- Move scraper into TheNude/ folder; add Python implementation - Performer URLs normalized to canonical /_NNNNN.htm form - Filter performer's own name out of aliases (matches Stash validation) - Birthdate refuses month-only values rather than defaulting to 1st - Career length formatted as 'YYYY - YYYY' - Drop tattoos/piercings (TheNude data unreliable) - Fix fake_tits: keep 'Fake' as-is (Stash internal value) - Add scene-by-name + scene-by-query-fragment for magnifying glass picker - Add gallery-by-fragment (image field stripped, not in Stash schema) - Preserve XPath sceneByURL/galleryByURL/imageByURL handlers
|
I'm not seeing a good reason to convert to python, what can't be done in yml? |
Birthdate handling. Test case: Carli Banks (https://www.thenude.com/_6444.htm) lists her birthdate as "November 1985" — month and year, no day. YAML's Happy to revert the other changes (URL normalization, career length formatting, fake_tits mapping, dropping tattoos/piercings, and the new sceneByName/sceneByQueryFragment/galleryByFragment search operations) to YAML if the birthdate handling is the only thing that justifies the Python conversion. |
|
previous behaviour has been to force 01 date. you can also regex parse to nullify if it doesn't match |
|
Closing — will resubmit as YAML-only. |
Converts TheNude from YAML-only to a YAML+Python scraper. Adds scene and gallery search via the magnifying glass / "Scrape with..." UIs.
Scraper type(s)
Changes
Performer scraping (Python)
/_NNNNN.htmform, eliminating the space-in-URL problemfake_titsreturns "Fake" as-is (was being mapped to "Augmented" but "Fake" is the actual Stash internal value)Scene/gallery search (new)
sceneByName+sceneByQueryFragmentenables the magnifying glass search picker on scenes. Some studios provide both portrait and landscape versions of the same shoot, which TheNude lists as separate URLs; the picker lets the user choose which onegalleryByFragmentenables "Scrape with..." on galleriesPreserved from upstream
sceneByURL,galleryByURL,imageByURLhandlersTest cases
https://www.thenude.com/_28377.htm(Carly Lauren)