Skip to content

Presentation at the NYC Data Science Study Group on how to streamline your cross-validation and classification workflow using scikit-learn's Pipelines and FeatureUnions modules.

Notifications You must be signed in to change notification settings

Eric-Xu/nydssg_pipelines

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

As a prediction model grows in complexity, scikit-learn's Pipeline module and FeatureUnion module offers a convenient way to organize all of our data extraction, transformation, normalization, and training steps. By chaining transformers and estimators together, we can extract features into a single unit pipeline. Each feature pipeline can then be reordered and combined using FeatureUnion. This not only saves time, but allows us to keep our code better organized, while we look for the ideal combination of techniques for solving a modeling task.

About

Presentation at the NYC Data Science Study Group on how to streamline your cross-validation and classification workflow using scikit-learn's Pipelines and FeatureUnions modules.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published